Description of problem: ClusterOperatorDegraded alert, description part The {{ $labels.name }} operator id degraded because -> id should be is # oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml ..... - alert: ClusterOperatorDegraded annotations: description: The {{ $labels.name }} operator id degraded because {{ $labels.reason }}, and the components it manages may have reduced quality of service. Cluster upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url ) }}/settings/cluster/{{ end }}{{ end }}. summary: Cluster operator has been degraded for 30 minutes. expr: | ( cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"} or on (name) group by (name) (cluster_operator_up{job="cluster-version-operator"}) ) == 1 for: 30m labels: severity: warning ..... Version-Release number of selected component (if applicable): 4.9.0-0.nightly-2021-08-02-145924 How reproducible: only with 4.9 payload Steps to Reproduce: 1. see the description 2. 3. Actual results: Expected results: Additional info:
checked with 4.9.0-0.nightly-2021-08-07-175228, issue is fixed # oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml | grep ClusterOperatorDegraded -A17 - alert: ClusterOperatorDegraded annotations: description: The {{ $labels.name }} operator is degraded because {{ $labels.reason }}, and the components it manages may have reduced quality of service. Cluster upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url ) }}/settings/cluster/{{ end }}{{ end }}. summary: Cluster operator has been degraded for 30 minutes. expr: | ( cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"} or on (name) group by (name) (cluster_operator_up{job="cluster-version-operator"}) ) == 1 for: 30m labels: severity: warning
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:3759