Bug 1933805
| Summary: | TargetDown alert fires during upgrades because of normal upgrade behavior | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Clayton Coleman <ccoleman> | ||||
| Component: | Machine Config Operator | Assignee: | Clayton Coleman <ccoleman> | ||||
| Status: | CLOSED ERRATA | QA Contact: | Michael Nguyen <mnguyen> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 4.8 | CC: | alegrand, anpicker, erooth, jerzhang, kakkoyun, lcosic, miabbott, pkrupa, rioliu, surbania | ||||
| Target Milestone: | --- | ||||||
| Target Release: | 4.8.0 | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2021-07-27 22:48:44 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Clayton Coleman
2021-03-01 18:44:28 UTC
Created attachment 1764135 [details]
TargetDown alert rule definition in console UI
Sanity verification done with 4.8.0-0.nightly-2021-03-17-123640 on AWS.
Confirmed that the TargetDown AR was properly updated (see attached screenshot) and that the `relabelings` rule on the MCD ServiceMonitor was added.
```
$ oc get clusterversion
NAME VERSION AVAILABLE PROGRESSING SINCE STATUS
version 4.8.0-0.nightly-2021-03-17-123640 True False 14m Cluster version is 4.8.0-0.nightly-2021-03-17-123640
$ oc -n openshift-machine-config-operator get servicemonitor/machine-config-daemon -o json | jq .spec.endpoints
[
{
"bearerTokenFile": "/var/run/secrets/kubernetes.io/serviceaccount/token",
"interval": "30s",
"path": "/metrics",
"port": "metrics",
"relabelings": [
{
"action": "replace",
"regex": ";(.*)",
"replacement": "$1",
"separator": ";",
"sourceLabels": [
"node",
"__meta_kubernetes_pod_node_name"
],
"targetLabel": "node"
}
],
"scheme": "https",
"tlsConfig": {
"caFile": "/etc/prometheus/configmaps/serving-certs-ca-bundle/service-ca.crt",
"serverName": "machine-config-daemon.openshift-machine-config-operator.svc"
}
}
]
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |