1809327 – Confusing error message from Multus

Bug 1809327 - Confusing error message from Multus

Summary: Confusing error message from Multus

Keywords:
Status:	CLOSED DUPLICATE of bug 1777040
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Networking
Sub Component:
Version:	4.4
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	medium
Target Milestone:	---
Target Release:	4.5.0
Assignee:	Douglas Smith
QA Contact:	Weibin Liang
Docs Contact:
URL:
Whiteboard:	SDN-CI-IMPACT,SDN-STALE
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2020-03-02 20:53 UTC by Ben Bennett
Modified:	2020-05-08 23:19 UTC (History)
CC List:	1 user (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-05-08 23:19:16 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Description Ben Bennett 2020-03-02 20:53:54 UTC

Description of problem:

[As seen in https://bugzilla.redhat.com/show_bug.cgi?id=1804681]

level=error msg="Cluster operator kube-controller-manager Degraded is True with MultipleConditionsMatching: NodeControllerDegraded: The master nodes not ready: node \"ci-op-n856n-m-0.c.openshift-gce-devel-ci.internal\" not ready since 2020-02-19 05:31:39 +0000 UTC because KubeletNotReady (runtime network not ready: NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: Missing CNI default network)\nNodeInstallerDegraded: 1 nodes are failing on revision 3:\nNodeInstallerDegraded: \nInstallerPodContainerWaitingDegraded: Pod \"installer-4-ci-op-n856n-m-2.c.openshift-gce-devel-ci.internal\" on node \"ci-op-n856n-m-2.c.openshift-gce-devel-ci.internal\" container \"installer\" is waiting for 36m3.68513657s because \"\"\nInstallerPodNetworkingDegraded: Pod \"installer-4-ci-op-n856n-m-2.c.openshift-gce-devel-ci.internal\" on node \"ci-op-n856n-m-2.c.openshift-gce-devel-ci.internal\" observed degraded networking: Failed create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_installer-4-ci-op-n856n-m-2.c.openshift-gce-devel-ci.internal_openshift-kube-controller-manager_45778f91-2fc1-4cf7-a05c-6bc2dd79bf9f_0(15cdbe4051ec6ae29b1c89ee64ffa369f36d9fc3fbd846c4adc4352261e28937): Multus: error adding pod to network \"ovn-kubernetes\": delegateAdd: error invoking DelegateAdd - \"ovn-k8s-cni-overlay\": error in getting result from AddNetwork: CNI request failed with status 400: 'failed to get pod annotation: timed out waiting for the condition\nInstallerPodNetworkingDegraded: '"


Version-Release number of selected component (if applicable):

Seen in 4.3


Expected results:

The error message above should better identify what the problem is.  This may just be expected if the pod has not yet started, or we should be shouting more clearly that there is a networking problem if something unrecoverable happened.

The problem is that the "Missing CNI default network" is expected when a node starts and pods are started before the networking is ready.  But the above message looks a bit different and may be a separate problem.  If it is, we need to make it clear what to look at.


Additional info:

Comment 1 Douglas Smith 2020-05-05 19:47:29 UTC

I believe this is actually a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1777040 -- which handles up dating the "Missing CNI default network" language.

Comment 2 Alexander Constantinescu 2020-05-07 14:53:49 UTC

Hi

This bug has no update since a couple of months, what is the impact of this? Can we close it?

1) That "Missing CNI default network might have dissapeared since"
2) If #comment 1 is true, then this should be closed. 

-Alex

Comment 3 Ben Bennett 2020-05-08 23:19:16 UTC


*** This bug has been marked as a duplicate of bug 1777040 ***

Note You need to log in before you can comment on or make changes to this bug.