Bug 1913275

Summary: [4.5] Unable to schedule a pod due to Insufficient ephemeral-storage
Product: OpenShift Container Platform Reporter: Jan Chaloupka <jchaloup>
Component: kube-schedulerAssignee: Jan Chaloupka <jchaloup>
Status: CLOSED EOL QA Contact: RamaKasturi <knarra>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.5CC: aos-bugs, jchaloup, jmcmeek, jnordell, joshisa, knarra, maszulik, mfojtik
Target Milestone: ---Keywords: Reopened, UpcomingSprint
Target Release: 4.5.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1913263 Environment:
Last Closed: 2021-01-18 14:13:47 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1913263    
Bug Blocks:    

Comment 1 Jan Chaloupka 2021-01-06 12:19:13 UTC
Origin PR https://github.com/openshift/origin/pull/25781

Comment 2 RamaKasturi 2021-01-12 12:58:41 UTC
Verified bug in the payload below and i see that it works.

[knarra@knarra openshift-client-linux-4.7.0-fc.2]$ oc get clusterversion
NAME      VERSION                                           AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.5.0-0.ci.test-2021-01-12-111301-ci-ln-5zmmb2t   True        False         74m     Cluster version is 4.5.0-0.ci.test-2021-01-12-111301-ci-ln-5zmmb2t

Below are the steps followed to verify the bug:
=================================================
1) Launch cluster bot with the PR
2) Edit file cat /etc/kubernetes/manifests/kube-controller-manager-pod.yaml, add LocalStorageCapacityIsolation=false to --feature-gates line on all master nodes one by one, wait for the kube-scheduler pods to restart.
3) Now create a pod using the yaml file below.

[knarra@knarra openshift-client-linux-4.7.0-fc.2]$ cat /tmp/ephermal.yaml 
apiVersion: v1
kind: Pod
metadata:
  name: nginx
  labels:
    name: nginx
spec:
  containers:
  - name: nginx
    image: quay.io/openshifttest/nginx@sha256:3936fb3946790d711a68c58be93628e43cbca72439079e16d154b5db216b58da
    schedulerName: default-scheduler
    ports:
    - containerPort: 80
    resources:
      requests:
        ephemeral-storage: 4096M
      limits:
        ephemeral-storage: 4096M
  initContainers:
  - name: init-myservice
    image: quay.io/openshifttest/busybox@sha256:afe605d272837ce1732f390966166c2afff5391208ddd57de10942748694049d
    command: ['sh', '-c', "echo waiting for myservice; sleep 7;"]
    resources:
      requests:
        cpu: 500m
        ephemeral-storage: 2M
        memory: 1024M

4) Verify that pod is in running state and not in pending.

[knarra@knarra openshift-client-linux-4.7.0-fc.2]$ ./oc get pods -o wide
NAME    READY   STATUS    RESTARTS   AGE   IP            NODE                                       NOMINATED NODE   READINESS GATES
nginx   1/1     Running   0          34s   10.128.2.14   ci-ln-5zmmb2t-f76d1-9lspz-worker-b-p59dn   <none>           <none>

Adding /lgtm to the PR as the bug works well.

Comment 3 Jan Chaloupka 2021-01-15 11:07:25 UTC
PR ready to be merged, just waiting for the right labels.