Description of problem: Cluster is failing to come up frequently with the following error when OVNKubernetes is used: Cluster operator etcd Degraded is True with InstallerPodContainerWaiting_ContainerCreating::InstallerPodNetworking_FailedCreatePodSandBox: InstallerPodContainerWaitingDegraded: Pod \"installer-3-ip-10-0-214-132.us-east-2.compute.internal\" on node \"ip-10-0-214-132.us-east-2.compute.internal\" container \"installer\" is waiting for 29m13.564092979s because \"\"\nInstallerPodNetworkingDegraded: Pod \"installer-3-ip-10-0-214-132.us-east-2.compute.internal\" on node \"ip-10-0-214-132.us-east-2.compute.internal\" observed degraded networking: Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_installer-3-ip-10-0-214-132.us-east-2.compute.internal_openshift-etcd_344b6f2c-def3-4be4-971b-e4e0257a4d80_0(b6409b47cb699f331600ad52233d5ccb808d8e6c98361c33c2edea22a620ccc1): Multus: [openshift-etcd/installer-3-ip-10-0-214-132.us-east-2.compute.internal]: error adding container to network \"ovn-kubernetes\": delegateAdd: error invoking confAdd - \"ovn-k8s-cni-overlay\": error in getting result from AddNetwork: CNI request failed with status 400: '[openshift-etcd/installer-3-ip-10-0-214-132.us-east-2.compute.internal] failed to get pod annotation: timed out waiting for the condition\nInstallerPodNetworkingDegraded: We're noticing this frequently in our CI as well: https://search.apps.build01.ci.devcluster.openshift.com/?search=error+in+getting+result+from+AddNetwork%3A+CNI+request+failed+with+status+400%3A&maxAge=48h&context=1&type=bug%2Bjunit&name=&maxMatches=5&maxBytes=20971520&groupBy=job Version-Release number of selected component (if applicable): How reproducible: Very frequently. Steps to Reproduce: 1. 2. 3. Actual results: OCP cluster creation fails Expected results: OCP comes up with OVNKubernetes as network type Additional info:
Worth to wait for https://github.com/openshift/machine-config-operator/pull/1830 to present in CI or nightly and check installation again.
CI has already picked it up and we still the issue in our PRs. The nightly has still not picked it up but that should not affect the PRs.
Thanks Aravindh. I had a successful install on 4.6.0-0.ci-2020-06-22-171752 for just networkType: OVNKubernetes on AWS. Does this bug pertained to hybrid config along with OVNKubrnetes networktype? So two questions 1) Was this bug opened on a specific platform? 2) Did your install-config used hybrid config as well?
> Thanks Aravindh. I had a successful install on 4.6.0-0.ci-2020-06-22-171752 > for just networkType: OVNKubernetes on AWS. Does this bug pertained to > hybrid config along with OVNKubrnetes networktype? This pertains to OVN hybrid. > > So two questions > > 1) Was this bug opened on a specific platform? Given we are seeing this in CI, it would be OVN hybrid on AWS. > 2) Did your install-config used hybrid config as well? Yes
Correcting the bug title to reflect actual issue
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4196