Bug 1893511

Summary: All nodes are going into NotReady state intermittently. [vmware]
Product: OpenShift Container Platform Reporter: manisha <mdhanve>
Component: NodeAssignee: Ryan Phillips <rphillips>
Node sub component: Kubelet QA Contact: Sunil Choudhary <schoudha>
Status: CLOSED DUPLICATE Docs Contact:
Severity: urgent    
Priority: urgent CC: akashem, akhaire, aos-bugs, braander, mfojtik, prdeshpa, slaznick, sttts, xxia
Version: 4.5Flags: mdhanve: needinfo-
mdhanve: needinfo-
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Other   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-11-20 15:49:33 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 2 Standa Laznicka 2020-11-02 07:52:55 UTC
You assigned the bugzilla to apiserver-auth, but included kubelet and crio logs which we won't help us troubleshoot the apiserver. Please add must-gather from when the cluster was down.

Comment 3 Standa Laznicka 2020-11-02 08:07:42 UTC
Also, this BZ looks a lot like https://bugzilla.redhat.com/show_bug.cgi?id=1873816, have you checked it before opening this BZ?

Comment 15 Abu Kashem 2020-11-18 19:56:07 UTC
This seems very similar to the known issue: https://github.com/kubernetes/kubernetes/issues/87615. As a workaround I would recommend restarting kubelet.

Comment 16 Abu Kashem 2020-11-18 19:59:12 UTC
Also we have a known BZ https://github.com/kubernetes/kubernetes/pull/96549 where we are tracking this. The upstream fix https://github.com/kubernetes/kubernetes/pull/96549 is going to land in kube 1.20.

Comment 17 Abu Kashem 2020-11-18 20:01:10 UTC
We will also need this patch: https://github.com/kubernetes/kubernetes/pull/95981.

Comment 18 Stefan Schimanski 2020-11-18 20:50:03 UTC
Typo from c16: the BZ is https://bugzilla.redhat.com/show_bug.cgi?id=1873114.

Comment 19 Ryan Phillips 2020-11-20 15:49:33 UTC
Going to mark this BZ as a duplicate. We are working in the other BZ to get the fixes in.

*** This bug has been marked as a duplicate of bug 1873114 ***