Bug 1917667
| Summary: | Master machine config pool updates are stalled during the migration from SDN to OVNKube. | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Archana Prabhakar <aprabhak> |
| Component: | Networking | Assignee: | Peng Liu <pliu> |
| Networking sub component: | openshift-sdn | QA Contact: | huirwang |
| Status: | CLOSED ERRATA | Docs Contact: | |
| Severity: | high | ||
| Priority: | medium | CC: | aconstan, danili, dosmith, kgarriso, lmcfadde, mtarsel, pdsilva, pliu, psundara, tdale |
| Version: | 4.7 | ||
| Target Milestone: | --- | ||
| Target Release: | 4.8.0 | ||
| Hardware: | ppc64le | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-07-27 22:36:15 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Archana Prabhakar
2021-01-19 05:50:20 UTC
Additional info:
# Adding node and mco related events output. It is clear from this data that the node drain does not happen on master-1 and master-2 for the machine config pool updates to get started.
```
#oc get events
5h9m Normal OperatorVersionChanged /machine-config clusteroperator/machine-config-operator started a version change from [] to [{operator 4.7.0-0.nightly-ppc64le-2021-01-18-024748}]
5h7m Normal OperatorVersionChanged /machine-config clusteroperator/machine-config-operator version changed from [] to [{operator 4.7.0-0.nightly-ppc64le-2021-01-18-024748}]
5h1m Normal NodeHasSufficientMemory node/master-0 Node master-0 status is now: NodeHasSufficientMemory
5h2m Normal NodeHasNoDiskPressure node/master-0 Node master-0 status is now: NodeHasNoDiskPressure
5h10m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
5h9m Normal Starting node/master-0 openshift-sdn done initializing node networking.
5h8m Normal NodeDone node/master-0 Setting node master-0, currentConfig rendered-master-0b5c44dc253e57776c4ead0f3bf7fc43 to Done
5h6m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
5h4m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
5h4m Normal NodeNotReady node/master-0 Node master-0 status is now: NodeNotReady
5h1m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
5h1m Normal NodeNotReady node/master-0 Node master-0 status is now: NodeNotReady
5h Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
4h58m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
4h57m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
3h37m Normal Drain node/master-0 Draining node to update config.
3h37m Normal NodeNotSchedulable node/master-0 Node master-0 status is now: NodeNotSchedulable
3h35m Normal OSUpdateStarted node/master-0
3h35m Normal OSUpdateStaged node/master-0 Changes to OS staged
3h52m Normal PendingConfig node/master-0 Written pending config rendered-master-fbe0b855c426cca34f099204f8149f73
3h52m Normal SkipReboot node/master-0 Config changes do not require reboot.
3h52m Normal NodeDone node/master-0 Setting node master-0, currentConfig rendered-master-fbe0b855c426cca34f099204f8149f73 to Done
3h35m Normal NodeSchedulable node/master-0 Node master-0 status is now: NodeSchedulable
3h35m Normal PendingConfig node/master-0 Written pending config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h35m Normal SkipReboot node/master-0 Config changes do not require reboot. Service crio was reloaded.
3h35m Normal NodeDone node/master-0 Setting node master-0, currentConfig rendered-master-7fd61bf26aa8bc5527e461c69134d6e4 to Done
178m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
177m Normal NodeNotReady node/master-0 Node master-0 status is now: NodeNotReady
175m Normal Starting node/master-0 Starting kubelet.
175m Normal NodeHasSufficientMemory node/master-0 Node master-0 status is now: NodeHasSufficientMemory
175m Normal NodeHasNoDiskPressure node/master-0 Node master-0 status is now: NodeHasNoDiskPressure
175m Normal NodeHasSufficientPID node/master-0 Node master-0 status is now: NodeHasSufficientPID
175m Warning Rebooted node/master-0 Node master-0 has been rebooted, boot id: ee20a5b0-1730-4c1f-998c-9b814bebcb72
175m Normal NodeNotReady node/master-0 Node master-0 status is now: NodeNotReady
175m Normal NodeAllocatableEnforced node/master-0 Updated Node Allocatable limit across pods
174m Normal NodeReady node/master-0 Node master-0 status is now: NodeReady
170m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
160m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
59m Normal Drain node/master-0 Draining node to update config.
130m Normal NodeNotSchedulable node/master-0 Node master-0 status is now: NodeNotSchedulable
59m Warning FailedToDrain node/master-0 5 tries: error when evicting pod "etcd-quorum-guard-7db666dcff-p4sf7": global timeout reached: 1m30s
49m Normal OSUpdateStarted node/master-0
49m Normal OSUpdateStaged node/master-0 Changes to OS staged
49m Normal PendingConfig node/master-0 Written pending config rendered-master-8b39ed05a2ffacc7c762c92801ed688d
49m Normal Reboot node/master-0 Node will reboot into config rendered-master-8b39ed05a2ffacc7c762c92801ed688d
44m Normal Starting node/master-0 Starting kubelet.
44m Normal NodeHasSufficientMemory node/master-0 Node master-0 status is now: NodeHasSufficientMemory
44m Normal NodeHasNoDiskPressure node/master-0 Node master-0 status is now: NodeHasNoDiskPressure
44m Normal NodeHasSufficientPID node/master-0 Node master-0 status is now: NodeHasSufficientPID
44m Normal NodeAllocatableEnforced node/master-0 Updated Node Allocatable limit across pods
43m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
42m Normal NodeDone node/master-0 Setting node master-0, currentConfig rendered-master-8b39ed05a2ffacc7c762c92801ed688d to Done
40m Normal RegisteredNode node/master-0 Node master-0 event: Registered Node master-0 in Controller
5h1m Normal NodeHasSufficientMemory node/master-1 Node master-1 status is now: NodeHasSufficientMemory
5h1m Normal NodeHasNoDiskPressure node/master-1 Node master-1 status is now: NodeHasNoDiskPressure
5h10m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
5h9m Normal Starting node/master-1 openshift-sdn done initializing node networking.
5h8m Normal NodeDone node/master-1 Setting node master-1, currentConfig rendered-master-0b5c44dc253e57776c4ead0f3bf7fc43 to Done
5h6m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
5h4m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
5h1m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
5h1m Normal NodeNotReady node/master-1 Node master-1 status is now: NodeNotReady
5h Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
4h58m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
4h57m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
3h34m Normal Drain node/master-1 Draining node to update config.
3h34m Normal NodeNotSchedulable node/master-1 Node master-1 status is now: NodeNotSchedulable
3h30m Normal OSUpdateStarted node/master-1
3h30m Normal OSUpdateStaged node/master-1 Changes to OS staged
3h46m Normal PendingConfig node/master-1 Written pending config rendered-master-fbe0b855c426cca34f099204f8149f73
3h46m Normal SkipReboot node/master-1 Config changes do not require reboot.
3h46m Normal NodeDone node/master-1 Setting node master-1, currentConfig rendered-master-fbe0b855c426cca34f099204f8149f73 to Done
3h30m Normal NodeSchedulable node/master-1 Node master-1 status is now: NodeSchedulable
3h30m Normal PendingConfig node/master-1 Written pending config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h30m Normal SkipReboot node/master-1 Config changes do not require reboot. Service crio was reloaded.
3h30m Normal NodeDone node/master-1 Setting node master-1, currentConfig rendered-master-7fd61bf26aa8bc5527e461c69134d6e4 to Done
178m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
170m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
170m Normal NodeNotReady node/master-1 Node master-1 status is now: NodeNotReady
168m Normal Starting node/master-1 Starting kubelet.
168m Normal NodeHasSufficientMemory node/master-1 Node master-1 status is now: NodeHasSufficientMemory
168m Normal NodeHasNoDiskPressure node/master-1 Node master-1 status is now: NodeHasNoDiskPressure
168m Normal NodeHasSufficientPID node/master-1 Node master-1 status is now: NodeHasSufficientPID
168m Warning Rebooted node/master-1 Node master-1 has been rebooted, boot id: 4315fec9-3dc0-4bae-bcb7-8e882e3ad211
168m Normal NodeNotReady node/master-1 Node master-1 status is now: NodeNotReady
168m Normal NodeAllocatableEnforced node/master-1 Updated Node Allocatable limit across pods
167m Normal NodeReady node/master-1 Node master-1 status is now: NodeReady
160m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
43m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
40m Normal RegisteredNode node/master-1 Node master-1 event: Registered Node master-1 in Controller
5h9m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
5h9m Normal Starting node/master-2 openshift-sdn done initializing node networking.
5h8m Normal NodeDone node/master-2 Setting node master-2, currentConfig rendered-master-0b5c44dc253e57776c4ead0f3bf7fc43 to Done
5h6m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
5h4m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
5h1m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
5h Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
4h58m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
4h57m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
3h35m Normal Drain node/master-2 Draining node to update config.
3h35m Normal NodeNotSchedulable node/master-2 Node master-2 status is now: NodeNotSchedulable
3h34m Normal OSUpdateStarted node/master-2
3h34m Normal OSUpdateStaged node/master-2 Changes to OS staged
3h45m Normal PendingConfig node/master-2 Written pending config rendered-master-fbe0b855c426cca34f099204f8149f73
3h45m Normal SkipReboot node/master-2 Config changes do not require reboot.
3h45m Normal NodeDone node/master-2 Setting node master-2, currentConfig rendered-master-fbe0b855c426cca34f099204f8149f73 to Done
3h34m Normal NodeSchedulable node/master-2 Node master-2 status is now: NodeSchedulable
3h34m Normal PendingConfig node/master-2 Written pending config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h34m Normal SkipReboot node/master-2 Config changes do not require reboot. Service crio was reloaded.
3h34m Normal NodeDone node/master-2 Setting node master-2, currentConfig rendered-master-7fd61bf26aa8bc5527e461c69134d6e4 to Done
178m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
170m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
160m Normal Starting node/master-2 Starting kubelet.
160m Normal NodeHasSufficientMemory node/master-2 Node master-2 status is now: NodeHasSufficientMemory
160m Normal NodeHasNoDiskPressure node/master-2 Node master-2 status is now: NodeHasNoDiskPressure
160m Normal NodeHasSufficientPID node/master-2 Node master-2 status is now: NodeHasSufficientPID
160m Normal NodeAllocatableEnforced node/master-2 Updated Node Allocatable limit across pods
160m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
43m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
40m Normal RegisteredNode node/master-2 Node master-2 event: Registered Node master-2 in Controller
5h8m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/state=Degraded
5h8m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/state=Done
5h8m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/state=Done
5h8m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/state=Done
3h53m Normal SetDesiredConfig machineconfigpool/master Targeted node master-0 to config rendered-master-fbe0b855c426cca34f099204f8149f73
3h53m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-fbe0b855c426cca34f099204f8149f73
3h53m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/state=Working
3h52m Normal SetDesiredConfig machineconfigpool/master Targeted node master-1 to config rendered-master-fbe0b855c426cca34f099204f8149f73
3h52m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-fbe0b855c426cca34f099204f8149f73
3h52m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/state=Working
3h46m Normal SetDesiredConfig machineconfigpool/master Targeted node master-2 to config rendered-master-fbe0b855c426cca34f099204f8149f73
3h46m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-fbe0b855c426cca34f099204f8149f73
3h46m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/state=Working
3h37m Normal SetDesiredConfig machineconfigpool/master Targeted node master-0 to config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h37m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h37m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/state=Working
3h35m Normal SetDesiredConfig machineconfigpool/master Targeted node master-2 to config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h35m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h35m Normal AnnotationChange machineconfigpool/master Node master-2 now has machineconfiguration.openshift.io/state=Working
3h34m Normal SetDesiredConfig machineconfigpool/master Targeted node master-1 to config rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h34m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-7fd61bf26aa8bc5527e461c69134d6e4
3h34m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/state=Working
130m Normal SetDesiredConfig machineconfigpool/master Targeted node master-0 to config rendered-master-8b39ed05a2ffacc7c762c92801ed688d
130m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-8b39ed05a2ffacc7c762c92801ed688d
130m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/state=Working
120m Normal AnnotationChange machineconfigpool/master Node master-0 now has machineconfiguration.openshift.io/state=Degraded
36m Normal SetDesiredConfig machineconfigpool/master Targeted node master-1 to config rendered-master-8b39ed05a2ffacc7c762c92801ed688d
36m Normal AnnotationChange machineconfigpool/master Node master-1 now has machineconfiguration.openshift.io/desiredConfig=rendered-master-8b39ed05a2ffacc7c762c92801ed688d
4h54m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
4h53m Normal Starting node/worker-0 openshift-sdn done initializing node networking.
4h53m Normal NodeDone node/worker-0 Setting node worker-0, currentConfig rendered-worker-aa2999cca8e237b2a24cf4c1d5123a72 to Done
3h37m Normal Drain node/worker-0 Draining node to update config.
3h37m Normal NodeNotSchedulable node/worker-0 Node worker-0 status is now: NodeNotSchedulable
3h35m Normal OSUpdateStarted node/worker-0
3h35m Normal OSUpdateStaged node/worker-0 Changes to OS staged
3h52m Normal PendingConfig node/worker-0 Written pending config rendered-worker-53036b57fbb35b65691bd0423ad209ef
3h52m Normal SkipReboot node/worker-0 Config changes do not require reboot.
3h52m Normal NodeDone node/worker-0 Setting node worker-0, currentConfig rendered-worker-53036b57fbb35b65691bd0423ad209ef to Done
3h35m Normal NodeSchedulable node/worker-0 Node worker-0 status is now: NodeSchedulable
3h35m Normal PendingConfig node/worker-0 Written pending config rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb
3h35m Normal SkipReboot node/worker-0 Config changes do not require reboot. Service crio was reloaded.
3h35m Normal NodeDone node/worker-0 Setting node worker-0, currentConfig rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb to Done
178m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
170m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
160m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
156m Normal NodeNotReady node/worker-0 Node worker-0 status is now: NodeNotReady
153m Normal Starting node/worker-0 Starting kubelet.
153m Normal NodeHasSufficientMemory node/worker-0 Node worker-0 status is now: NodeHasSufficientMemory
153m Normal NodeHasNoDiskPressure node/worker-0 Node worker-0 status is now: NodeHasNoDiskPressure
153m Normal NodeHasSufficientPID node/worker-0 Node worker-0 status is now: NodeHasSufficientPID
153m Warning Rebooted node/worker-0 Node worker-0 has been rebooted, boot id: 447c8830-4cb8-485f-a958-fc64f3ad36e6
153m Normal NodeNotReady node/worker-0 Node worker-0 status is now: NodeNotReady
153m Normal NodeAllocatableEnforced node/worker-0 Updated Node Allocatable limit across pods
153m Normal NodeReady node/worker-0 Node worker-0 status is now: NodeReady
43m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
40m Normal RegisteredNode node/worker-0 Node worker-0 event: Registered Node worker-0 in Controller
36m Normal Drain node/worker-0 Draining node to update config.
36m Normal NodeNotSchedulable node/worker-0 Node worker-0 status is now: NodeNotSchedulable
34m Normal OSUpdateStarted node/worker-0
34m Normal OSUpdateStaged node/worker-0 Changes to OS staged
34m Normal PendingConfig node/worker-0 Written pending config rendered-worker-d048988f4915c29580fcd159da4c91bf
34m Normal Reboot node/worker-0 Node will reboot into config rendered-worker-d048988f4915c29580fcd159da4c91bf
33m Normal NodeNotReady node/worker-0 Node worker-0 status is now: NodeNotReady
28m Normal Starting node/worker-0 Starting kubelet.
28m Normal NodeHasSufficientMemory node/worker-0 Node worker-0 status is now: NodeHasSufficientMemory
28m Normal NodeHasNoDiskPressure node/worker-0 Node worker-0 status is now: NodeHasNoDiskPressure
28m Normal NodeHasSufficientPID node/worker-0 Node worker-0 status is now: NodeHasSufficientPID
28m Warning Rebooted node/worker-0 Node worker-0 has been rebooted, boot id: becd0304-0c9a-4028-80ca-b655f21b960c
28m Normal NodeNotReady node/worker-0 Node worker-0 status is now: NodeNotReady
28m Normal NodeNotSchedulable node/worker-0 Node worker-0 status is now: NodeNotSchedulable
28m Normal NodeAllocatableEnforced node/worker-0 Updated Node Allocatable limit across pods
28m Normal NodeReady node/worker-0 Node worker-0 status is now: NodeReady
27m Normal NodeDone node/worker-0 Setting node worker-0, currentConfig rendered-worker-d048988f4915c29580fcd159da4c91bf to Done
27m Normal NodeSchedulable node/worker-0 Node worker-0 status is now: NodeSchedulable
4h53m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
4h52m Normal Starting node/worker-1 openshift-sdn done initializing node networking.
4h52m Normal NodeDone node/worker-1 Setting node worker-1, currentConfig rendered-worker-aa2999cca8e237b2a24cf4c1d5123a72 to Done
3h35m Normal Drain node/worker-1 Draining node to update config.
3h35m Normal NodeNotSchedulable node/worker-1 Node worker-1 status is now: NodeNotSchedulable
3h29m Normal OSUpdateStarted node/worker-1
3h29m Normal OSUpdateStaged node/worker-1 Changes to OS staged
3h46m Normal PendingConfig node/worker-1 Written pending config rendered-worker-53036b57fbb35b65691bd0423ad209ef
3h46m Normal SkipReboot node/worker-1 Config changes do not require reboot.
3h46m Normal NodeDone node/worker-1 Setting node worker-1, currentConfig rendered-worker-53036b57fbb35b65691bd0423ad209ef to Done
3h29m Normal NodeSchedulable node/worker-1 Node worker-1 status is now: NodeSchedulable
3h29m Normal PendingConfig node/worker-1 Written pending config rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb
3h29m Normal SkipReboot node/worker-1 Config changes do not require reboot. Service crio was reloaded.
3h29m Normal NodeDone node/worker-1 Setting node worker-1, currentConfig rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb to Done
178m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
170m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
160m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
149m Normal NodeNotReady node/worker-1 Node worker-1 status is now: NodeNotReady
146m Normal Starting node/worker-1 Starting kubelet.
146m Normal NodeHasSufficientMemory node/worker-1 Node worker-1 status is now: NodeHasSufficientMemory
146m Normal NodeHasNoDiskPressure node/worker-1 Node worker-1 status is now: NodeHasNoDiskPressure
146m Normal NodeHasSufficientPID node/worker-1 Node worker-1 status is now: NodeHasSufficientPID
146m Warning Rebooted node/worker-1 Node worker-1 has been rebooted, boot id: 385523b8-f1c6-4c98-aff7-6016ab9d6bc0
146m Normal NodeNotReady node/worker-1 Node worker-1 status is now: NodeNotReady
146m Normal NodeAllocatableEnforced node/worker-1 Updated Node Allocatable limit across pods
146m Normal NodeReady node/worker-1 Node worker-1 status is now: NodeReady
43m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
40m Normal RegisteredNode node/worker-1 Node worker-1 event: Registered Node worker-1 in Controller
27m Normal Drain node/worker-1 Draining node to update config.
27m Normal NodeNotSchedulable node/worker-1 Node worker-1 status is now: NodeNotSchedulable
25m Normal OSUpdateStarted node/worker-1
25m Normal OSUpdateStaged node/worker-1 Changes to OS staged
25m Normal PendingConfig node/worker-1 Written pending config rendered-worker-d048988f4915c29580fcd159da4c91bf
25m Normal Reboot node/worker-1 Node will reboot into config rendered-worker-d048988f4915c29580fcd159da4c91bf
25m Normal NodeNotReady node/worker-1 Node worker-1 status is now: NodeNotReady
20m Normal Starting node/worker-1 Starting kubelet.
20m Normal NodeHasSufficientMemory node/worker-1 Node worker-1 status is now: NodeHasSufficientMemory
20m Normal NodeHasNoDiskPressure node/worker-1 Node worker-1 status is now: NodeHasNoDiskPressure
20m Normal NodeHasSufficientPID node/worker-1 Node worker-1 status is now: NodeHasSufficientPID
20m Warning Rebooted node/worker-1 Node worker-1 has been rebooted, boot id: 16df475e-a341-4e96-ae65-af47294868ac
20m Normal NodeNotReady node/worker-1 Node worker-1 status is now: NodeNotReady
20m Normal NodeNotSchedulable node/worker-1 Node worker-1 status is now: NodeNotSchedulable
20m Normal NodeAllocatableEnforced node/worker-1 Updated Node Allocatable limit across pods
20m Normal NodeReady node/worker-1 Node worker-1 status is now: NodeReady
19m Normal NodeDone node/worker-1 Setting node worker-1, currentConfig rendered-worker-d048988f4915c29580fcd159da4c91bf to Done
19m Normal NodeSchedulable node/worker-1 Node worker-1 status is now: NodeSchedulable
3h53m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-0 to config rendered-worker-53036b57fbb35b65691bd0423ad209ef
3h52m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-1 to config rendered-worker-53036b57fbb35b65691bd0423ad209ef
3h37m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-0 to config rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb
3h35m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-1 to config rendered-worker-b9fd2121252e22fbaa0bcdd29b67f5eb
36m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-0 to config rendered-worker-d048988f4915c29580fcd159da4c91bf
27m Normal SetDesiredConfig machineconfigpool/worker Targeted node worker-1 to config
```
Please add a must gather from this cluster. > Even if MCO update for 1 node is stuck, it should pick up other nodes until the error in the faulty node is fixed.
This is false.
Sorry hit send too soon. We don't keep rolling out to other nodes for safety. But I'd like to see a must gather to get to the bottom of what's happening as there seem to be a few errors in the info pasted above and more detailed logs in the must gather will help us figure it out. pushed to 4.8, as we will support UPI clusters then. Since the MCO updates got stuck mid way, cluster is unhealthy and must-gather fails to complete as shown below. ``` [root@arc-npv-ovn-bastion ~]# oc adm must-gather [must-gather ] OUT Using must-gather plug-in image: quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:b990e9178c45dd579115ef7f51b4bbfb79f1fa8c6bde525c1ed0b9718fdf39f7 [must-gather ] OUT namespace/openshift-must-gather-f4vqb created [must-gather ] OUT clusterrolebinding.rbac.authorization.k8s.io/must-gather-bjt7q created [must-gather ] OUT pod for plug-in image quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:b990e9178c45dd579115ef7f51b4bbfb79f1fa8c6bde525c1ed0b9718fdf39f7 created [must-gather-mgsrd] OUT gather logs unavailable: Get "https://9.114.98.140:10250/containerLogs/openshift-must-gather-f4vqb/must-gather-mgsrd/gather?follow=true": x509: certificate signed by unknown authority [must-gather-mgsrd] OUT waiting for gather to complete [must-gather-mgsrd] OUT downloading gather output WARNING: cannot use rsync: rsync not available in container WARNING: cannot use tar: tar not available in container [must-gather-mgsrd] OUT gather output not downloaded: No available strategies to copy. [must-gather-mgsrd] OUT [must-gather ] OUT clusterrolebinding.rbac.authorization.k8s.io/must-gather-bjt7q deleted [must-gather ] OUT namespace/openshift-must-gather-f4vqb deleted error: unable to download output from pod must-gather-mgsrd: No available strategies to copy. ``` Please let me know if there is any specific log or command output you want me to pick up. @pliu regarding your comment above "pushed to 4.8, as we will support UPI clusters then.", is this now in plan for 4.8? (In reply to lmcfadde from comment #8) > @pliu regarding your comment above "pushed to 4.8, as we will > support UPI clusters then.", is this now in plan for 4.8? Yes, I'm working on it. Hi @Peng, since the target release for this bug is 4.8 and this bug and BZ 1937594 are preventing the regression testing of OVNKube from completion, should we set the "Blocker?" flag to "Blocker+" for this bug instead for 4.8? @pliu will the fix for https://bugzilla.redhat.com/show_bug.cgi?id=1937594 also fix this BZ and is this still considered a priority? Yes, I think both BZ can be fixed after the PR merged. Please follow the new migration procedure https://github.com/openshift/openshift-docs/pull/31089 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |