Bug 1853400

Summary: [4.5.z] race condition during installation between nodes getting their hostnames and crio+kubelet starting
Product: OpenShift Container Platform Reporter: Omri Hochman <ohochman>
Component: RHCOSAssignee: Ben Howard <behoward>
Status: CLOSED ERRATA QA Contact: Michael Nguyen <mnguyen>
Severity: high Docs Contact:
Priority: high    
Version: 4.5CC: achernet, augol, bbreard, behoward, fsimonce, imcleod, jligon, kholtz, lucab, miabbott, mnguyen, mpatel, nstielau, ohochman, pibanezr, rgregory, rphillips, ykashtan
Target Milestone: ---   
Target Release: 4.5.z   
Hardware: x86_64   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1845885 Environment:
Last Closed: 2020-08-24 15:13:44 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1845885    
Bug Blocks: 1850775    

Comment 1 Omri Hochman 2020-07-02 15:07:38 UTC
Raising severity - adding Test-Blocker, as this race BZ impacting Telco customers that will use 4.5

Comment 5 Micah Abbott 2020-07-06 12:54:32 UTC
We'll need https://github.com/openshift/machine-config-operator/pull/1813/ cherry-picked to 4.5

Comment 7 Ben Howard 2020-07-08 22:05:09 UTC
https://github.com/openshift/machine-config-operator/pull/1914 is aimed at fixing all the hostname issues and I think its a more holistic fix for this particular problem.

Comment 8 Micah Abbott 2020-07-31 13:12:04 UTC
The PR in comment #7 was targeted for master; the 4.5 cherry-pick pointed to https://github.com/openshift/machine-config-operator/pull/1901...which ultimately got closed in favor of https://github.com/openshift/machine-config-operator/pull/1939

The code in PR 1939 is merged, so moving this to MODIFIED.

Comment 17 errata-xmlrpc 2020-08-24 15:13:44 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.5.7 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3436