Bug 2110505

Summary: [Upgrade]deployment openshift-machine-api/machine-api-operator has a replica failure FailedCreate
Product: OpenShift Container Platform Reporter: Ben Parees <bparees>
Component: Cloud ComputeAssignee: OpenShift Cluster Infrastructure Bugs <cluster-infrastructure-bug-bot>
Cloud Compute sub component: Other Providers QA Contact: sunzhaohua <zhsun>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: high CC: aos-team-ota, bleanhar, bparees, cluster-infrastructure-bug-bot, hongkliu, jack.ottofaro, lmohanty, slaznick, wking, yanyang, zhsun
Version: 4.12Keywords: FastFix
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 2110501 Environment:
Last Closed: 2022-08-10 11:21:24 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2110501    
Bug Blocks:    

Comment 1 Ben Parees 2022-07-25 13:57:09 UTC
Setting this as blocker+, this is the bug to be used to revert the MAO change that broke upgrades in 4.11.

Comment 4 sunzhaohua 2022-07-27 18:33:35 UTC
Verified

upgrade the cluster 4.3.18->4.4.33->4.5.41->4.6.60->4.7.55->4.8.46->4.9.43->4.10.24->4.11.0-0.nightly-2022-07-26-154822->4.12.0-0.nightly-2022-07-27-133042, upgrade is successful. Cluster https://mastern-jenkins-csb-openshift-qe.apps.ocp-c1.prod.psi.redhat.com/job/ocp-common/job/Flexy-install/124511/artifact/workdir/install-dir/auth/kubeconfig/*view*/

$ oc get clusterversion                                                               [22:44:31]
NAME      VERSION                              AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.12.0-0.nightly-2022-07-27-133042   True        False         166m    Cluster version is 4.12.0-0.nightly-2022-07-27-133042

in 4.11.0-0.nightly-2022-07-26-154822
$ oc edit deploy machine-api-operator
      securityContext:
        runAsNonRoot: true
        runAsUser: 65534

in 4.12.0-0.nightly-2022-07-27-133042
$ oc edit deploy machine-api-operator
      securityContext: {}

$ oc get pods -A | grep CreateContainerConfigError 
$

Comment 5 Jack Ottofaro 2022-08-02 22:06:40 UTC
Removing UpgradeBlocker since no request to block edges has been forthcoming and no obvious signs of a significant % of the fleet being impacted. This is based upon impact statement https://bugzilla.redhat.com/show_bug.cgi?id=2108858#c8.

Comment 6 errata-xmlrpc 2022-08-10 11:21:24 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069