Bug 1989496

Summary:	typo in ClusterOperatorDegraded alert description part
Product:	OpenShift Container Platform	Reporter:	Junqi Zhao <juzhao>
Component:	Cluster Version Operator	Assignee:	Jan Fajerski <jfajersk>
Status:	CLOSED ERRATA	QA Contact:	Yang Yang <yanyang>
Severity:	low	Docs Contact:
Priority:	unspecified
Version:	4.9	CC:	amuller, anpicker, aos-bugs, erooth, jfajersk, jokerman, yanyang
Target Milestone:	---
Target Release:	4.9.0
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2021-10-18 17:44:09 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description Junqi Zhao 2021-08-03 10:46:18 UTC

Description of problem:
ClusterOperatorDegraded alert, description part
The {{ $labels.name }} operator id degraded because
-> id should be is
# oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml
.....
      - alert: ClusterOperatorDegraded
        annotations:
          description: The {{ $labels.name }} operator id degraded because {{ $labels.reason
            }}, and the components it manages may have reduced quality of service.  Cluster
            upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator
            {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne
            (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url
            ) }}/settings/cluster/{{ end }}{{ end }}.
          summary: Cluster operator has been degraded for 30 minutes.
        expr: |
          (
            cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"}
            or on (name)
            group by (name) (cluster_operator_up{job="cluster-version-operator"})
          ) == 1
        for: 30m
        labels:
          severity: warning
.....
Version-Release number of selected component (if applicable):
4.9.0-0.nightly-2021-08-02-145924

How reproducible:
only with 4.9 payload

Steps to Reproduce:
1. see the description
2.
3.

Actual results:


Expected results:


Additional info:

Comment 2 Junqi Zhao 2021-08-09 05:16:50 UTC

checked with 4.9.0-0.nightly-2021-08-07-175228, issue is fixed
# oc -n openshift-monitoring get cm prometheus-k8s-rulefiles-0 -oyaml | grep ClusterOperatorDegraded -A17
      - alert: ClusterOperatorDegraded
        annotations:
          description: The {{ $labels.name }} operator is degraded because {{ $labels.reason
            }}, and the components it manages may have reduced quality of service.  Cluster
            upgrades may not complete. For more information refer to 'oc get -o yaml clusteroperator
            {{ $labels.name }}'{{ with $console_url := "console_url" | query }}{{ if ne
            (len (label "url" (first $console_url ) ) ) 0}} or {{ label "url" (first $console_url
            ) }}/settings/cluster/{{ end }}{{ end }}.
          summary: Cluster operator has been degraded for 30 minutes.
        expr: |
          (
            cluster_operator_conditions{job="cluster-version-operator", condition="Degraded"}
            or on (name)
            group by (name) (cluster_operator_up{job="cluster-version-operator"})
          ) == 1
        for: 30m
        labels:
          severity: warning

Comment 9 errata-xmlrpc 2021-10-18 17:44:09 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:3759