Bug 1996941
| Summary: | Monitoring operator is degraded because expected 8 ready pods for "node-exporter" daemonset but got 6 when upgrading windows cluster to 4.9 | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Yang Yang <yanyang> |
| Component: | Monitoring | Assignee: | Prashant Balachandran <pnair> |
| Status: | CLOSED ERRATA | QA Contact: | Junqi Zhao <juzhao> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 4.9 | CC: | amuller, anpicker, aos-bugs, arajkuma, erooth, hongyli, spasquie |
| Target Milestone: | --- | Keywords: | Upgrades |
| Target Release: | 4.9.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-10-18 17:48:10 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Yang Yang
2021-08-24 04:15:16 UTC
Profile: 53_IPI on AWS & OVN & WindowsContainer
upgraded from 4.8.6 to 4.9.0-0.nightly-2021-08-26-164418 with windows nodes, monitoring upgrade is successful
# oc get co monitoring
NAME VERSION AVAILABLE PROGRESSING DEGRADED SINCE MESSAGE
monitoring 4.9.0-0.nightly-2021-08-26-164418 True False False 157m
# oc get node -o wide
NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME
ip-10-0-131-61.us-east-2.compute.internal Ready worker 116m v1.21.1-1397+a678cfd2c37e87 10.0.131.61 <none> Windows Server 2019 Datacenter 10.0.17763.2114 docker://20.10.6
ip-10-0-149-100.us-east-2.compute.internal Ready worker 110m v1.21.1-1397+a678cfd2c37e87 10.0.149.100 <none> Windows Server 2019 Datacenter 10.0.17763.2114 docker://20.10.6
ip-10-0-153-190.us-east-2.compute.internal Ready master 161m v1.21.1+9807387 10.0.153.190 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
ip-10-0-153-192.us-east-2.compute.internal Ready worker 152m v1.21.1+9807387 10.0.153.192 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
ip-10-0-164-193.us-east-2.compute.internal Ready worker 154m v1.21.1+9807387 10.0.164.193 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
ip-10-0-170-80.us-east-2.compute.internal Ready master 161m v1.21.1+9807387 10.0.170.80 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
ip-10-0-195-55.us-east-2.compute.internal Ready master 161m v1.21.1+9807387 10.0.195.55 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
ip-10-0-213-141.us-east-2.compute.internal Ready worker 152m v1.21.1+9807387 10.0.213.141 <none> Red Hat Enterprise Linux CoreOS 48.84.202108161759-0 (Ootpa) 4.18.0-305.12.1.el8_4.x86_64 cri-o://1.21.2-11.rhaos4.8.git5d31399.el8
# oc -n openshift-monitoring get ds
NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE
node-exporter 6 6 6 6 6 kubernetes.io/os=linux 159m
# oc get node -l kubernetes.io/os=linux
NAME STATUS ROLES AGE VERSION
ip-10-0-153-190.us-east-2.compute.internal Ready master 163m v1.21.1+9807387
ip-10-0-153-192.us-east-2.compute.internal Ready worker 153m v1.21.1+9807387
ip-10-0-164-193.us-east-2.compute.internal Ready worker 155m v1.21.1+9807387
ip-10-0-170-80.us-east-2.compute.internal Ready master 163m v1.21.1+9807387
ip-10-0-195-55.us-east-2.compute.internal Ready master 163m v1.21.1+9807387
ip-10-0-213-141.us-east-2.compute.internal Ready worker 153m v1.21.1+9807387
# oc -n openshift-monitoring get pod -o wide | grep node-exporter | awk '{print $7}'
ip-10-0-213-141.us-east-2.compute.internal
ip-10-0-195-55.us-east-2.compute.internal
ip-10-0-164-193.us-east-2.compute.internal
ip-10-0-170-80.us-east-2.compute.internal
ip-10-0-153-190.us-east-2.compute.internal
ip-10-0-153-192.us-east-2.compute.internal
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:3759 |