Bug 2042231
| Summary: | CVO hotloops on Updating PrometheusRule openshift-machine-api/machine-api-operator-prometheus-rules | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Junqi Zhao <juzhao> | ||||
| Component: | Cluster Version Operator | Assignee: | Jack Ottofaro <jack.ottofaro> | ||||
| Status: | CLOSED DUPLICATE | QA Contact: | Yang Yang <yanyang> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | medium | ||||||
| Version: | 4.9 | CC: | dhurta, hongyli, jack.ottofaro, lmohanty, yanyang | ||||
| Target Milestone: | --- | ||||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2022-08-31 12:54:32 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Junqi Zhao
2022-01-19 04:21:10 UTC
This bug seems to be already fixed.
The CVO was trying to reconcile an incorrectly written manifest.
We can see in the diff that the CVO tries to add a missing field "annotation" to the manifest (for the whole log line see the previous comment [1]):
- "annotation": map[string]interface{}{
- "message": string("machine health check {{ $labels.name }} has been disabled by short circuit for more than 30 minutes"),
- },
However, the written manifest that the CVO is trying to reconcile has a typo. "annotation" should be "annotations" (for the custom resource definition, see [2]). The CVO was trying to apply a change to a resource that would never be accepted by the API server resulting in a hot-looping on the CVO side.
This issue was already fixed by the pull request [3] for the bug [4] in the original repository [5] from where the manifest comes from. A CVO log file from a newer version (in my case 4.12.0-0.ci-2022-08-29-170215) doesn't contain this hot-looping.
[1] https://bugzilla.redhat.com/show_bug.cgi?id=2042231#c0
[2] https://github.com/openshift/api/blob/b21e86c742e740c2e2c55288a0e6b68cf3afee4d/monitoring/v1alpha1/0000_50_monitoring_01_alertingrules.crd.yaml
[3] https://github.com/openshift/machine-api-operator/pull/942
[4] https://bugzilla.redhat.com/show_bug.cgi?id=2010368
[5] https://github.com/openshift/machine-api-operator
Per https://bugzilla.redhat.com/show_bug.cgi?id=2042231#c2 closing as a dup of https://bugzilla.redhat.com/show_bug.cgi?id=2010368. *** This bug has been marked as a duplicate of bug 2010368 *** |