Bug 1881188

Summary: sdn crashloops
Product: OpenShift Container Platform Reporter: David Eads <deads>
Component: NetworkingAssignee: Ben Bennett <bbennett>
Networking sub component: openshift-sdn QA Contact: zhaozhanqi <zzhao>
Status: CLOSED DUPLICATE Docs Contact:
Severity: high    
Priority: high CC: anbhat
Version: 4.6   
Target Milestone: ---   
Target Release: 4.6.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-09-22 13:06:58 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description David Eads 2020-09-21 18:09:48 UTC
This shows up as `Cluster operator network Degraded is True with RolloutHung: DaemonSet \"openshift-sdn/sdn\" rollout is not making progress`

See the example in https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/release-openshift-ocp-installer-e2e-aws-4.6/1307687521215320064#1:build-log.txt%3A47

 "lastState": {
                            "terminated": {
                                "containerID": "cri-o://7d5502e2676ccf111ae75a4bbccb57cf0d7c5d9a437e748374706783e2873b6b",
                                "exitCode": 255,
                                "finishedAt": "2020-09-20T18:03:44Z",
                                "message": "ontroller.go:139] [SDN setup] full SDN setup required (plugin is not setup)\nI0920 18:03:00.890664  297205 ovs.go:158] Error executing ovs-vsctl: 2020-09-20T18:03:00Z|00002|fatal_signal|WARN|terminating with signal 14 (Alarm clock)\nI0920 18:03:31.429678  297205 ovs.go:158] Error executing ovs-vsctl: 2020-09-20T18:03:31Z|00002|fatal_signal|WARN|terminating with signal 14 (Alarm clock)\nI0920 18:03:31.939935  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:32.444816  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:33.074465  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:33.861792  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:34.842590  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:36.068178  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:37.598544  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:39.510458  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:41.899215  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nI0920 18:03:44.884228  297205 ovs.go:158] Error executing ovs-ofctl: ovs-ofctl: /var/run/openvswitch/br0.mgmt: failed to open socket (Connection refused)\nF0920 18:03:44.884277  297205 cmd.go:111] Failed to start sdn: node SDN setup failed: timed out waiting for the condition\n",
                                "reason": "Error",
                                "startedAt": "2020-09-20T18:02:17Z"
                            }
                        },
                        "name": "sdn",
                        "ready": false,
                        "restartCount": 33,
                        "started": false,
                        "state": {
                            "waiting": {
                                "message": "back-off 5m0s restarting failed container=sdn pod=sdn-8fjrn_openshift-sdn(05fa1220-60de-4dff-aad5-502d367958dd)",
                                "reason": "CrashLoopBackOff"
                            }
                        }

Comment 1 Aniket Bhat 2020-09-21 18:11:21 UTC
Looks like a dup of 1874696.

Comment 2 Ben Bennett 2020-09-22 13:06:58 UTC

*** This bug has been marked as a duplicate of bug 1874696 ***