Description of problem: After installation of 4.5.5, the cluster faces stability issues when attempting to deploy pods or applications. Steps to Reproduce: 1. Install via IPv6 provisioning and baremetal network. 2. Install catalog for offline operatorhub Actual results: Failed deployment Expected results: Successful deployment Additional info: OVN Node logs: I0826 13:51:52.665052 23048 cni.go:157] [openshift-marketplace/redhat-operators-catalog-4nkfl] CNI request &{ADD openshift-marketplace redhat-operators-catalog-4nkfl 47407e500d139206d880bf795b3b00091db94c6629a9557bbb6f5865ecda047a /var/run/netns/7029e80f-61cb-4d17-a688-33269100ee23 eth0 0xc000846e10}, result "", err failed to configure pod interface: failed to open netns "/var/run/netns/7029e80f-61cb-4d17-a688-33269100ee23": failed to Statfs "/var/run/netns/7029e80f-61cb-4d17-a688-33269100ee23": no such file or directory I0826 13:51:52.814902 23048 request.go:565] Throttling request took 527.975409ms, request: GET:https://api-int.mgmt-spoke2.e2e.bos.redhat.com:6443/api/v1/namespaces/openshift-marketplace/pods/redhat-operators-catalog-4nkfl I0826 13:51:52.818495 23048 cni.go:157] [openshift-marketplace/redhat-operators-catalog-4nkfl] CNI request &{ADD openshift-marketplace redhat-operators-catalog-4nkfl 0dbc5db92811559fc3b470797bed24f63fb7c6905ce5376e5e33c34311c8bc8d /var/run/netns/74b2ce67-52f2-4163-82c2-af40ff9bc31f eth0 0xc000846b40}, result "", err failed to configure pod interface: failed to open netns "/var/run/netns/74b2ce67-52f2-4163-82c2-af40ff9bc31f": failed to Statfs "/var/run/netns/74b2ce67-52f2-4163-82c2-af40ff9bc31f": no such file or directory I0826 13:51:54.093247 23048 cni.go:157] [openshift-marketplace/redhat-operators-catalog-4nkfl] CNI request &{ADD openshift-marketplace redhat-operators-catalog-4nkfl fba375004b12c2866b3383c8c4649dcde4e8f211444b4ee67baa074b3899bd33 /var/run/netns/dd3cdf31-6f2e-4676-b21c-61a432f56527 eth0 0xc0008462d0}, result "{\"Result\":{\"interfaces\":[{\"name\":\"fba375004b12c28\",\"mac\":\"c2:f7:f4:9e:47:a0\"},{\"name\":\"eth0\",\"mac\":\"6e:78:6b:00:00:90\",\"sandbox\":\"/var/run/netns/dd3cdf31-6f2e-4676-b21c-61a432f56527\"}],\"ips\":[{\"version\":\"6\",\"interface\":1,\"address\":\"fd01::3:6c78:6bff:fe00:90/64\",\"gateway\":\"fd01:0:0:3::1\"}],\"dns\":{}},\"PodIFInfo\":null}", err <nil> Event logs from deployment namespace: Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_redhat-operators-catalog-4nkfl_openshift-marketplace_9e27afb9-028b-459f-82ac-62405950ad0d_0(67d1475d35019a670e54f78da022f07b8172e4d459a5ea6537528c3fee5932ca): netplugin failed with no error message Can provide kubeconfig or any other logs.
Hi Could you provide some logs/ must-gather/kubeconfig? Thanks in advance! /Alex
It appears that all of the failing pods are on master-1, where kubelet has stopped posting node status. So there is something wrong there, which cannot easily be debugged from within OCP, since you can't schedule a pod there.
I'm going to close this BZ as a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1873114 Thank you all for looking into it so quickly! *** This bug has been marked as a duplicate of bug 1873114 ***