Bug 1888075
Summary: | 4.5.14 -> 4.6.rc3 in OVN cluster failed with controller version mismatch | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Mike Fiedler <mifiedle> |
Component: | Machine Config Operator | Assignee: | Antonio Murdaca <amurdaca> |
Status: | CLOSED DUPLICATE | QA Contact: | Michael Nguyen <mnguyen> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.6 | CC: | trozet, yanyang |
Target Milestone: | --- | ||
Target Release: | 4.6.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-10-14 15:22:03 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Mike Fiedler
2020-10-14 00:44:21 UTC
adding Tim to assess weather those nodes just lose connectivity (ready 0) 12 hours later the cluster is still in this state - I'll put a kubeconfig location in a private comment. root@ip-172-31-64-58: ~ # oc get nodes NAME STATUS ROLES AGE VERSION mffiedler1013b-zk5xv-m-0.c.openshift-qe.internal Ready master 16h v1.18.3+970c1b3 mffiedler1013b-zk5xv-m-1.c.openshift-qe.internal Ready master 16h v1.18.3+970c1b3 mffiedler1013b-zk5xv-m-2.c.openshift-qe.internal Ready master 16h v1.18.3+970c1b3 mffiedler1013b-zk5xv-worker-a-2cngf.c.openshift-qe.internal Ready worker 16h v1.19.0+d59ce34 mffiedler1013b-zk5xv-worker-b-hvln5.c.openshift-qe.internal Ready worker 16h v1.19.0+d59ce34 mffiedler1013b-zk5xv-worker-c-ghllj.c.openshift-qe.internal Ready worker 16h v1.19.0+d59ce34 root@ip-172-31-64-58: ~ # oc get co NAME VERSION AVAILABLE PROGRESSING DEGRADED SINCE authentication 4.6.0-rc.3 True False False 6h10m cloud-credential 4.6.0-rc.3 True False False 16h cluster-autoscaler 4.6.0-rc.3 True False False 16h config-operator 4.6.0-rc.3 True False False 15h console 4.6.0-rc.3 True False False 14h csi-snapshot-controller 4.6.0-rc.3 True False False 14h dns 4.6.0-rc.3 True False False 14h etcd 4.6.0-rc.3 True False False 16h image-registry 4.6.0-rc.3 True False False 16h ingress 4.6.0-rc.3 True False False 14h insights 4.6.0-rc.3 True False False 16h kube-apiserver 4.6.0-rc.3 True False False 16h kube-controller-manager 4.6.0-rc.3 True False False 16h kube-scheduler 4.6.0-rc.3 True False False 16h kube-storage-version-migrator 4.6.0-rc.3 True False False 13h machine-api 4.6.0-rc.3 True False False 16h machine-approver 4.6.0-rc.3 True False False 15h machine-config 4.5.14 False True True 14h marketplace 4.6.0-rc.3 True False False 14h monitoring 4.6.0-rc.3 True False False 14h network 4.6.0-rc.3 True False False 16h node-tuning 4.6.0-rc.3 True False False 14h openshift-apiserver 4.6.0-rc.3 True False False 14h openshift-controller-manager 4.6.0-rc.3 True False False 16h openshift-samples 4.6.0-rc.3 True False False 14h operator-lifecycle-manager 4.6.0-rc.3 True False False 16h operator-lifecycle-manager-catalog 4.6.0-rc.3 True False False 16h operator-lifecycle-manager-packageserver 4.6.0-rc.3 True False False 14h service-ca 4.6.0-rc.3 True False False 16h storage 4.6.0-rc.3 True False False 14h This looks the same as the other upgrade issues with OVN, the node running 4.5 has a screwed up br-local bridge with an extra patch port so kapi access wont work: [root@mffiedler1013b-zk5xv-m-0 ~]# ovs-vsctl show df305a14-74e2-4694-8e42-bebcc55fe21d Bridge br-local Port patch-lnet-node_local_switch-to-br-int Interface patch-lnet-node_local_switch-to-br-int type: patch options: {peer=patch-br-int-to-lnet-node_local_switch} Port ovn-k8s-gw0 Interface ovn-k8s-gw0 type: internal Port br-local Interface br-local type: internal Port patch--to-br-int Interface patch--to-br-int type: patch options: {peer=patch-br-int-to-} Port patch-br-local_mffiedler1013b-zk5xv-m-0.c.openshift-qe.internal-to-br-int Interface patch-br-local_mffiedler1013b-zk5xv-m-0.c.openshift-qe.internal-to-br-int type: patch options: {peer=patch-br-int-to-br-local_mffiedler1013b-zk5xv-m-0.c.openshift-qe.internal} *** This bug has been marked as a duplicate of bug 1880591 *** Removing UpgradeBlocker from this older bug, to remove it from the suspect queue described in [1]. If you feel like this bug still needs to be a suspect, please add keyword again. [1]: https://github.com/openshift/enhancements/pull/475 |