Bug 1959238
Summary: | CVO creating cloud-controller-manager too early causing upgrade failures | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | OpenShift BugZilla Robot <openshift-bugzilla-robot> |
Component: | Cluster Version Operator | Assignee: | W. Trevor King <wking> |
Status: | CLOSED ERRATA | QA Contact: | Yang Yang <yanyang> |
Severity: | medium | Docs Contact: | |
Priority: | high | ||
Version: | 4.8 | CC: | aos-bugs, jokerman, vrutkovs, wking, yanyang |
Target Milestone: | --- | ||
Target Release: | 4.7.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: The cluster-version operator had been pre-creating ClusterOperator resources at the beginning of the update phase.
Consequence: For new second-level operators who were deeper in in the update, that could leave their ClusterOperator without status conditions for many minutes. After ten minutes without conditions, the ClusterOperatorDown and/or ClusterOperatorDegraded alerts would begin firing.
Fix: Now the cluster-version operator pre-creates ClusterOperator resources much more closely to the creation of the new second-level operator's deployment.
Result: The new second-level operator has time to come up and set status.conditions on the ClusterOperator before alarms start firing.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-06-01 04:50:27 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1957775 | ||
Bug Blocks: | 1943566 |
Comment 2
Yang Yang
2021-05-25 06:04:06 UTC
Performing regression test against 4.7.0-0.nightly-2021-05-25-192356 by upgrading to 4.8.0-0.nightly-2021-05-25-223219. # oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.7.0-0.nightly-2021-05-25-192356 True True 44m Working towards 4.8.0-0.nightly-2021-05-25-223219: 549 of 675 done (81% complete), waiting up to 40 minutes on dns # oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.8.0-0.nightly-2021-05-25-223219 True False 5m24s Cluster version is 4.8.0-0.nightly-2021-05-25-223219 There are no alert ClusterOperatorDown and/or ClusterOperatorDegraded fired. And upgrade is successful. Moving it to verified state. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.7.13 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2121 |