Bug 1989496

Summary: typo in ClusterOperatorDegraded alert description part
Product: OpenShift Container Platform Reporter: Junqi Zhao <juzhao>
Component: Cluster Version OperatorAssignee: Jan Fajerski <jfajersk>
Status: CLOSED ERRATA QA Contact: Yang Yang <yanyang>
Severity: low Docs Contact:
Priority: unspecified    
Version: 4.9CC: amuller, anpicker, aos-bugs, erooth, jfajersk, jokerman, yanyang
Target Milestone: ---   
Target Release: 4.9.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-10-18 17:44:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Junqi Zhao 2021-08-03 10:46:18 UTC
Description of problem:
ClusterOperatorDegraded alert, description part
The {{ $labels.name }} operator id degraded because
-> id should be is
# oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml
.....
      - alert: ClusterOperatorDegraded
        annotations:
          description: The {{ $labels.name }} operator id degraded because {{ $labels.reason
            }}, and the components it manages may have reduced quality of service.  Cluster
            upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator
            {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne
            (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url
            ) }}/settings/cluster/{{ end }}{{ end }}.
          summary: Cluster operator has been degraded for 30 minutes.
        expr: |
          (
            cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"}
            or on (name)
            group by (name) (cluster_operator_up{job="cluster-version-operator"})
          ) == 1
        for: 30m
        labels:
          severity: warning
.....
Version-Release number of selected component (if applicable):
4.9.0-0.nightly-2021-08-02-145924

How reproducible:
only with 4.9 payload

Steps to Reproduce:
1. see the description
2.
3.

Actual results:


Expected results:


Additional info:

Comment 2 Junqi Zhao 2021-08-09 05:16:50 UTC
checked with 4.9.0-0.nightly-2021-08-07-175228, issue is fixed
# oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml | grep ClusterOperatorDegraded -A17
      - alert: ClusterOperatorDegraded
        annotations:
          description: The {{ $labels.name }} operator is degraded because {{ $labels.reason
            }}, and the components it manages may have reduced quality of service.  Cluster
            upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator
            {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne
            (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url
            ) }}/settings/cluster/{{ end }}{{ end }}.
          summary: Cluster operator has been degraded for 30 minutes.
        expr: |
          (
            cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"}
            or on (name)
            group by (name) (cluster_operator_up{job="cluster-version-operator"})
          ) == 1
        for: 30m
        labels:
          severity: warning

Comment 9 errata-xmlrpc 2021-10-18 17:44:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:3759