Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2072418

Summary: Windows node failed to become ready after a while.
Product: OpenShift Container Platform Reporter: pawankum <pawankum>
Component: DocumentationAssignee: Michael Burke <mburke>
Status: CLOSED CURRENTRELEASE QA Contact: Xiaoli Tian <xtian>
Severity: medium Docs Contact: Latha S <lmurthy>
Priority: medium    
Version: 4.9CC: aliryan, jflorist, jtanenba, jvaldes, mankulka, mohashai
Target Milestone: ---   
Target Release: 4.13.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-12-09 20:48:42 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description pawankum 2022-04-06 09:14:57 UTC
Must gather logs:

1. Issue: 
2022-04-04T22:31:03.565344145Z {"level":"error","ts":1649111463.5605578,"logger":"controller-runtime.manager.controller.configmap","msg":"Reconciler error","reconciler group":"","reconciler kind":"ConfigMap","name":"windows-instances","namespace":"openshift-windows-machine-config-operator","error":"error configuring host with address 10.49.XXX.xxx: configuring node network failed: error waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation for usddceqap71215: timeout waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation: timed out waiting for the condition","errorVerbose":"timed out waiting for the condition\ntimeout waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation\ngithub.com/openshift/windows-machine-config-operator/pkg/nodeconfig.(*nodeConfig).waitForNodeAnnotation\n\t/remote-source/build/windows-machine-config-operator/pkg/nodeconfig/nodeconfig.go:352\ngithub.com/openshift/windows-machine-config-operator/pkg/nodeconfig.(*nodeConfig).configureNetwork\n\t/remote-source/build/windows-machine-config-operator/pkg/nodeconfig/nodeconfig.go:283\ngithub.com


2. WMCO & OpenShift Version 
OCP 4.9 on BM UPI BYOH.

3. Platform - AWS/Azure/VSpehre/Platform=none
Baremetal (Platform=none)
4. If the platform is vSphere, what is the VMware tools version?
5. Is it a new test case or an old test case?
   if it is the old test case, is it regression or first-time tested? 
   Is it platform-specific or consistent across all platforms?
6. Steps to Reproduce
7. Actual Result and Expected Result
Node not ready, it should be in ready state. 
8. A possible workaround has been tried? Is there a way to recover from the issue being tried out?
Not working.

9. Logs
WMCO pod logs:
2022-04-04T22:18:17.697597006Z {"level":"info","ts":1649110697.6975124,"logger":"wc 10.49.168.80","msg":"configure","service":"hybrid-overlay-node","args":"--node usddceqap71215 --k8s-kubeconfig c:\\k\\kubeconfig --windows-service --logfile C:\\var\\log\\hybrid-overlay\\hybrid-overlay.log"}
2022-04-04T22:20:18.300356816Z {"level":"info","ts":1649110818.2968366,"logger":"wc 10.49.168.80","msg":"configured","service":"hybrid-overlay-node","args":"--node usddceqap71215 --k8s-kubeconfig c:\\k\\kubeconfig --windows-service --logfile C:\\var\\log\\hybrid-overlay\\hybrid-overlay.log"}
2022-04-04T22:31:03.565344145Z {"level":"error","ts":1649111463.5605578,"logger":"controller-runtime.manager.controller.configmap","msg":"Reconciler error","reconciler group":"","reconciler kind":"ConfigMap","name":"windows-instances","namespace":"openshift-windows-machine-config-operator","error":"error configuring host with address 10.49.xx.xx: configuring node network failed: error waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation for usddceqap71215: timeout waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation: timed out waiting for the condition","errorVerbose":"timed out waiting for the condition\ntimeout waiting for k8s.ovn.org/hybrid-overlay-distributed-router-gateway-mac node annotation\ngithub.com/openshift/windows-machine-config-operator/pkg/nodeconfig.(*nodeConfig).waitForNodeAnnotation\n\t/remote-source/build/windows-machine-config-operator/pkg/nodeconfig/nodeconfig.go:352\ngithub.com/openshift/windows-machine-config-operator/pkg/nodeconfig.(*nodeConfig).configureNetwork\n\t/remote-source/build/windows-machine-config-operator/pkg/nodeconfig/nodeconfig.go:283\ngithub.com


spec:
  cloudConfig:
    name: ''
  platformSpec:
    type: None
status:
  apiServerInternalURI: https://api-int.np-ocp.us.kworld.xxxxx.com:6443
  apiServerURL: https://api.np-ocp.us.kworld.xxxx.com:6443
  controlPlaneTopology: HighlyAvailable
  etcdDiscoveryDomain: ''
  infrastructureName: np-ocp-wj2xn
  infrastructureTopology: HighlyAvailable
  platform: None
  platformStatus:
    type: None


NAME                              STATUS                    ROLES   AGE    VERSION                      INTERNAL-IP   EXTERNAL-IP  OS-IMAGE                                                      KERNEL-VERSION                CONTAINER-RUNTIME
usddceqap71215                    Ready,SchedulingDisabled  worker  2h18m  v1.22.1-1739+c8538fcbd98efa  10.49.xx.xx  <none>       Windows Server 2019 Standard                                  10.0.17763.2686               docker://20.10.9


 Optional logs:
    Anything that can be useful to debug the issue.

Comment 14 jvaldes 2022-08-23 16:18:48 UTC
Still relevant and planned in future sprint

Comment 18 Red Hat Bugzilla 2023-09-18 04:34:53 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days