Bug 2010365
Summary: | OpenShift Alerting Rules Style-Guide Compliance | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Brad Ison <brad.ison> |
Component: | Cluster Version Operator | Assignee: | David Hurta <dhurta> |
Status: | CLOSED ERRATA | QA Contact: | liujia <jiajliu> |
Severity: | low | Docs Contact: | |
Priority: | low | ||
Version: | 4.10 | CC: | aos-bugs, dhurta, jiajliu, vrutkovs |
Target Milestone: | --- | ||
Target Release: | 4.12.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2023-01-17 19:46:45 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Brad Ison
2021-10-04 13:59:29 UTC
*** Bug 2021130 has been marked as a duplicate of this bug. *** > * Alerts found to not include a namespace label: > - ClusterNotUpgradeable > - ClusterOperatorDegraded Tried to reproduce on v4.10.26. 1. Trigger ClusterOperatorDegraded alert. # curl -s -k -H "Authorization: Bearer $token" https://$route/api/v1/alerts | jq -r '.data.alerts[]| select(.labels.alertname == "ClusterOperatorDegraded").labels' { "alertname": "ClusterOperatorDegraded", "condition": "Degraded", "endpoint": "metrics", "instance": "10.0.0.6:9099", "job": "cluster-version-operator", "name": "authentication", "namespace": "openshift-cluster-version", //namespace label was already included in ClusterOperatorDegraded alert "pod": "cluster-version-operator-64bb7d76f4-bn2hx", "reason": "OAuthServerConfigObservation_Error", "service": "cluster-version-operator", "severity": "warning" } 2. Trigger ClusterNotUpgradeable alert # curl -s -k -H "Authorization: Bearer $token" https://$route/api/v1/alerts | jq -r '.data.alerts[]| select(.labels.alertname == "ClusterNotUpgradeable").labels' { "alertname": "ClusterNotUpgradeable", "condition": "Upgradeable", "endpoint": "metrics", "name": "version", "severity": "info" } // Miss namespace label in ClusterNotUpgradeable alert 3. Trigger ClusterOperatorDown alert # curl -s -k -H "Authorization: Bearer $token" https://$route/api/v1/alerts | jq -r '.data.alerts[]| select(.labels.alertname == "ClusterOperatorDown").labels' { "alertname": "ClusterOperatorDown", "endpoint": "metrics", "instance": "10.0.0.6:9099", "job": "cluster-version-operator", "name": "machine-config", "namespace": "openshift-cluster-version", //namespace label was already included in ClusterOperatorDown alert "pod": "cluster-version-operator-64bb7d76f4-bn2hx", "service": "cluster-version-operator", "severity": "critical", "version": "4.10.26" } 4. Trigger CannotRetrieveUpdates alert # curl -s -k -H "Authorization: Bearer $token" https://$route/api/v1/alerts | jq -r '.data.alerts[]| select(.labels.alertname == "CannotRetrieveUpdates")|.labels' { "alertname": "CannotRetrieveUpdates", "endpoint": "metrics", "instance": "10.0.0.6:9099", "job": "cluster-version-operator", "namespace": "openshift-cluster-version", //namespace label was already included in CannotRetrieveUpdates alert "pod": "cluster-version-operator-64bb7d76f4-bn2hx", "service": "cluster-version-operator", "severity": "warning" } According to above reproduce, only ClusterNotUpgradeable alert should add ns label. @Brad Could you confirm ClusterOperatorDegraded alert issue in the bug description? QE can only reproduce it for ClusterNotUpgradeable alert. The reporter Brad Ison from Monitoring team seems not available(Deactivated account) now, QE plan to verify ClusterNotUpgradeable alert since it turned to be the only one missing namespace label. Verified on 4.12.0-0.nightly-2022-08-17-053740 # curl -s -k -H "Authorization: Bearer $token" https://$route/api/v1/alerts | jq -r '.data.alerts[]| select(.labels.alertname == "ClusterNotUpgradeable")|.labels' { "alertname": "ClusterNotUpgradeable", "condition": "Upgradeable", "endpoint": "metrics", "name": "version", "namespace": "openshift-cluster-version", "severity": "info" } Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.12.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:7399 *** Bug 2021130 has been marked as a duplicate of this bug. *** |