Bug 2011958

Summary: [4.9] [tracker] Kubelet rejects pods that use resources that should be freed by completed pods
Product: OpenShift Container Platform Reporter: RHCOS Bug Bot <rhcos-bot>
Component: RHCOSAssignee: Micah Abbott <miabbott>
Status: CLOSED ERRATA QA Contact: HuijingHei <hhei>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 4.9CC: dornelas, hhei, jligon, miabbott, mrussell, nstielau
Target Milestone: ---   
Target Release: 4.9.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 2011956 Environment:
Last Closed: 2021-10-18 17:52:32 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2009653, 2011956    
Bug Blocks:    

Description RHCOS Bug Bot 2021-10-07 19:32:04 UTC
Backport the fix for bug 2011956 to 4.9.

Comment 1 Micah Abbott 2021-10-08 19:02:22 UTC
This tracker is a bit confusing; the 4.9 version of the kubelet BZ is https://bugzilla.redhat.com/show_bug.cgi?id=2011815

Which links to https://github.com/openshift/kubernetes/pull/1009

The fixed kubelet is built from 894a78b5538c3e96d7bdfabfa8b41430cbd6811e or newer

In the most recent RHCOS 4.9 build for x86_64 (49.84.202110081407-0), it contains `openshift-hyperkube-4.9.0-202110080828.p0.git.894a78b.assembly.stream.el8`

The short commit id in the NVR (894a78b) indicates that this is built from the correct commit.

Additionally, an inspection of the `kubelet` itself on a running RHCOS node confirms the same:

```
[core@cosa-devsh ~]$ rpm-ostree status
State: idle
Deployments:
* ostree://3093d4596a48e37c9926dc53240af084c077e3bf2063ef2a8d8a81421b6e9987
                   Version: 49.84.202110081407-0 (2021-10-08T14:10:58Z)
[core@cosa-devsh ~]$ /usr/bin/kubelet --version
Kubernetes v1.22.0-rc.0+894a78b
```

Comment 2 RHCOS Bug Bot 2021-10-08 19:21:46 UTC
This bug has been reported fixed in a new RHCOS build and is ready for QE verification.  To mark the bug verified, set the Verified field to Tested.  This bug will automatically move to MODIFIED once the fix has landed in a new bootimage.

Comment 3 RHCOS Bug Bot 2021-10-08 21:18:25 UTC
The fix for this bug has landed in a bootimage bump, as tracked in bug 2009653 (now in status MODIFIED).  Moving this bug to MODIFIED.

Comment 6 HuijingHei 2021-10-11 09:16:56 UTC
Bug 2011815 has been resolved, move this bug to VERIFIED.


rhcos: 49.84.202110081407-0 (https://github.com/openshift/installer/commit/2a75f9bd56df413d726e5b9b98dbfbbc0884ff0f)
OCP: 4.9.0-0.nightly-2021-10-08-232649

sh-4.4# chroot /host
sh-4.4# rpm-ostree status
State: idle
Deployments:
* pivot://quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:eaaf9590ed870930313bb0275c6f87c012f64f6a5e784b119921b20369af783d
              CustomOrigin: Managed by machine-config-operator
                   Version: 49.84.202110081407-0 (2021-10-08T14:10:58Z)

  ostree://3093d4596a48e37c9926dc53240af084c077e3bf2063ef2a8d8a81421b6e9987
                   Version: 49.84.202110081407-0 (2021-10-08T14:10:58Z)
sh-4.4# kubelet --version
Kubernetes v1.22.0-rc.0+894a78b

Comment 8 errata-xmlrpc 2021-10-18 17:52:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:3759