Bug 1771316
Summary: | cluster version object does not reflect the cluster's correct status | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | liujia <jiajliu> |
Component: | Cluster Version Operator | Assignee: | W. Trevor King <wking> |
Status: | CLOSED ERRATA | QA Contact: | liujia <jiajliu> |
Severity: | medium | Docs Contact: | |
Priority: | low | ||
Version: | 4.3.0 | CC: | adahiya, aos-bugs, jokerman, wking |
Target Milestone: | --- | ||
Target Release: | 4.4.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-05-04 11:15:07 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
liujia
2019-11-12 07:26:00 UTC
Looking at the attached must-gather fragment: $ sha1sum log.tgz 0aa3c5a2c7e8a1cae260ee8a9ce158f57d4088b3 log.tgz $ tar xzf log.tgz $ grep 'Running sync.*in state\|Result of work' log/namespaces/openshift-cluster-version/pods/cluster-version-operator-7446bc8f98-p4vjs/cluster-version-operator/cluster-version-operator/logs/current.log | head -n6 2019-11-12T04:22:23.005709831Z I1112 04:22:23.003748 1 sync_worker.go:464] Running sync registry.svc.ci.openshift.org/ocp/release:4.3.0-0.nightly-2019-11-12-000306 (force=true) on generation 2 in state Updating at attempt 0 2019-11-12T04:28:08.055820161Z I1112 04:28:08.055806 1 task_graph.go:611] Result of work: [Cluster operator monitoring is still updating] 2019-11-12T04:28:33.341970344Z I1112 04:28:33.341890 1 sync_worker.go:464] Running sync registry.svc.ci.openshift.org/ocp/release:4.3.0-0.nightly-2019-11-12-000306 (force=true) on generation 2 in state Updating at attempt 1 2019-11-12T04:33:49.389389153Z I1112 04:33:49.389357 1 task_graph.go:611] Result of work: [] 2019-11-12T04:36:41.915469438Z I1112 04:36:41.915369 1 sync_worker.go:464] Running sync registry.svc.ci.openshift.org/ocp/release:4.3.0-0.nightly-2019-11-12-000306 (force=true) on generation 2 in state Reconciling at attempt 0 2019-11-12T04:37:14.276571228Z I1112 04:37:14.276551 1 task_graph.go:611] Result of work: [] So that seems reasonable, and 04:33Z matches the upgrade-complete from your comment 0 history. I'm looking to see if I can find anything from earlier... What we might need, is a dump of the CVO logs when it is reporting the early 'Cluster version is ...', before that pod gets killed (probably due to the control-plane machine it was on getting updated) and rescheduled elsewhere and blowing away the old logs. ok, got u. I will catch more logs before the original cvo pod killed when I hit it again. Can we please confirm whether or not this still happens both in the latest 4.4 builds and the 4.3 release builds? Monitoring an upgrade from v4.3.0 to latest 4.4.0-0.nightly-2020-02-04-002939, did not hit the issue. Monitoring another upgrade from v4.2.16 to latest v4.3.0, did not hit the issue. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581 |