Description of problem:
[sig-arch][Early] Managed cluster should start all core operators [Suite:openshift/conformance/parallel] expand_more
is occasionally failing in CI:
$ w3m -dump -cols 200 'https://search.ci.openshift.org/?maxAge=24h&type=junit&search=Some%20cluster%20operators%20are%20not%20ready:%20marketplace.*missing:%20Degraded' | grep 'failures match' | sort
pull-ci-openshift-cloud-credential-operator-master-e2e-aws - 3 runs, 100% failed, 33% of failures match
pull-ci-openshift-prometheus-master-e2e-aws - 5 runs, 100% failed, 20% of failures match
release-openshift-ocp-installer-e2e-aws-4.7 - 4 runs, 50% failed, 50% of failures match
release-openshift-ocp-installer-e2e-gcp-rt-4.7 - 2 runs, 50% failed, 100% of failures match
release-openshift-origin-installer-e2e-azure-shared-vpc-4.7 - 1 runs, 100% failed, 100% of failures match
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Install a bunch of clusters.
See some of them whose marketplace operator is not setting Degraded.
Always have the Degraded condition set (ideally with status False)
Seems related to bug 1857680, but the logic discussed in that bug no longer exists in 4.7.
Example job :
$ curl -s https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-4.7/1325575034390974464/artifacts/e2e-aws/clusteroperators.json | jq -r '.items | select(.metadata.name == "marketplace").status.conditions | .lastTransitionTime + " " + .type + "=" + .status + " " + (.reason // "-") + ": " + (.message // "-")'
2020-11-08T23:23:55Z Progressing=False OperatorAvailable: Successfully progressed to release version: 4.7.0-0.nightly-2020-11-08-225909
2020-11-08T23:23:55Z Available=True OperatorAvailable: Available release version: 4.7.0-0.nightly-2020-11-08-225909
LGTM, verify it, example job:
[root@preserve-olm-env data]# curl -s https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-4.7/1326370102743928832/artifacts/e2e-aws/clusteroperators.json | jq -r '.items | select(.metadata.name == "marketplace").status.conditions | .lastTransitionTime + " " + .type + "=" + .status + " " + (.reason // "-") + ": " + (.message // "-")'
2020-11-11T04:03:38Z Progressing=False OperatorAvailable: Successfully progressed to release version: 4.7.0-0.nightly-2020-11-11-033756
2020-11-11T04:03:38Z Available=True OperatorAvailable: Available release version: 4.7.0-0.nightly-2020-11-11-033756
2020-11-11T04:03:38Z Upgradeable=True OperatorAvailable: Marketplace is upgradeable
2020-11-11T04:03:17Z Degraded=False OperatorAvailable: Successfully progressed to release version: 4.7.0-0.nightly-2020-11-11-033756
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.