Bug 1915023

Summary: After upgrade from 4.5.17 to 4.6.8 stable, nodes get randomly NotReady after 30 minutes (Azure)
Product: OpenShift Container Platform Reporter: Valentino Uberti <vuberti>
Component: NodeAssignee: Harshal Patil <harpatil>
Node sub component: Kubelet QA Contact: MinLi <minmli>
Status: CLOSED DUPLICATE Docs Contact:
Severity: high    
Priority: unspecified CC: aos-bugs, harpatil
Version: 4.6.z   
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-01-20 10:26:18 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Valentino Uberti 2021-01-11 18:53:38 UTC
Description of problem:


Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Install OCP 4.5.17 on Azure
2. Upgrade to OCP 4.6.8 stable


Actual results:

Nodes randomly in NotReady state after 30 minutes.


Expected results:

All nodes in Ready state

Additional info:

Impossible to ssh to the failing node.