Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1755910

Summary: Nodes become NotReady due to ovn-node pod cannot be running with error no chassis ID configured with OVN plugin
Product: OpenShift Container Platform Reporter: Ben Bennett <bbennett>
Component: NetworkingAssignee: Casey Callendrello <cdc>
Networking sub component: ovn-kubernetes QA Contact: zhaozhanqi <zzhao>
Status: CLOSED DUPLICATE Docs Contact:
Severity: unspecified    
Priority: unspecified CC: bbennett, cdc, dcbw, schoudha, zzhao
Version: 4.2.0   
Target Milestone: ---   
Target Release: 4.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1755784 Environment:
Last Closed: 2019-11-08 16:34:02 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1755784    
Bug Blocks:    

Description Ben Bennett 2019-09-26 12:44:41 UTC
+++ This bug was initially created as a clone of Bug #1755784 +++

Description of problem:

Nodes become NotReady due to Missing CNI default network with OVN plugin on Azure.
When new nodes are added to cluster through auto-scaling, nodes stay in NotReady state with below messages.

runtime network not ready: NetworkReady=false reason:NetworkPluginNotReady message:Network plugin returns error: Missing CNI default network

Version-Release number of selected component (if applicable):

$ oc get clusterversion
NAME      VERSION                             AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.2.0-0.nightly-2019-09-25-191732   True        False         4h21m   Cluster version is 4.2.0-0.nightly-2019-09-25-191732

How reproducible:



Steps to Reproduce:
1. Setup IPI on Azure with networkType: "OVNKubernetes"
2. Try to add nodes through auto or manually scaling.
3.

Actual results:

See below panic in ovnkube-node logs:

# oc logs -n openshift-ovn-kubernetes ovnkube-node-xdzhq -c ovn-node
================== ovnkube.sh --- version: 3 ================
 ==================== command: ovn-node
 =================== hostname: shared-42-upgrade-j48bc-worker-centralus3-6m6qq
 =================== daemonset version 3
 =================== Image built from ovn-kubernetes ref: refs/heads/rhaos-4.2-rhel-7  commit: 93f3f5b6d94ebcc16b869697f84b5d6f988d7cf8
=============== ovn-node - (wait for ovs)
=============== ovn-node - (wait for ready_to_start_node)
ovn_nbdb tcp://10.0.0.5:9641   ovn_sbdb tcp://10.0.0.5:9642  ovn_nbdb_test tcp:10.0.0.5:9641
=============== ovn-node - (ovn-node  wait for ovn-controller.pid)
=============== ovn-node   --init-node
info: Waiting for process_ready ovnkube to come up, waiting 1s ...
panic: failed to localnet gateway: No chassis ID configured for node shared-42-upgrade-j48bc-worker-centralus3-6m6qq

goroutine 1 [running]:
main.runOvnKube(0xc00029a580, 0x0, 0x0)
	/go-controller/_output/go/src/github.com/ovn-org/ovn-kubernetes/go-controller/cmd/ovnkube/ovnkube.go:234 +0x9fe
main.main.func1(0xc00029a580, 0xc00029a580, 0xc000111cf7)
	/go-controller/_output/go/src/github.com/ovn-org/ovn-kubernetes/go-controller/cmd/ovnkube/ovnkube.go:100 +0x2b
github.com/ovn-org/ovn-kubernetes/go-controller/vendor/github.com/urfave/cli.HandleAction(0x126ec40, 0x14d3d50, 0xc00029a580, 0xc000284d80, 0x0)
	/go-controller/_output/go/src/github.com/ovn-org/ovn-kubernetes/go-controller/vendor/github.com/urfave/cli/app.go:502 +0xbe
github.com/ovn-org/ovn-kubernetes/go-controller/vendor/github.com/urfave/cli.(*App).Run(0xc000109880, 0xc0000ae000, 0xf, 0xf, 0x0, 0x0)
	/go-controller/_output/go/src/github.com/ovn-org/ovn-kubernetes/go-controller/vendor/github.com/urfave/cli/app.go:268 +0x5b7
main.main()
	/go-controller/_output/go/src/github.com/ovn-org/ovn-kubernetes/go-controller/cmd/ovnkube/ovnkube.go:103 +0x59b
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...
info: Waiting for process_ready ovnkube to come up, waiting 5s ...


Expected results:

Nodes should be in Ready state

Additional info:

Comment 1 Dan Williams 2019-10-17 15:16:43 UTC
Fixed by https://github.com/ovn-org/ovn-kubernetes/pull/799 upstream and should be in 4.3 already.

Comment 2 Ben Bennett 2019-11-08 16:34:02 UTC

*** This bug has been marked as a duplicate of bug 1755784 ***