Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2028159

Summary: OVN migration to 2nd interface on IPv6 with bond fails
Product: OpenShift Container Platform Reporter: Victor Voronkov <vvoronko>
Component: NetworkingAssignee: Jaime Caamaño Ruiz <jcaamano>
Networking sub component: ovn-kubernetes QA Contact: Anurag saxena <anusaxen>
Status: CLOSED DUPLICATE Docs Contact:
Severity: high    
Priority: medium CC: aos-bugs, bnemec, bpickard, eglottma, jcaamano, rhalle, trozet, yprokule
Version: 4.8Keywords: Reopened
Target Milestone: ---Flags: vvoronko: needinfo-
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-11-14 16:49:10 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2100181    
Bug Blocks:    

Description Victor Voronkov 2021-12-01 15:49:59 UTC
Description of problem:
ovs-migration.service failed


How reproducible:
Deploy BM OCP with bond on IPv6 control plane and trigger OVN migration

Steps to Reproduce:
1.
2.
3.

Actual results:
Migration fails, one of masters stuck with status NotReady,SchedulingDisabled

Expected results:
Migration successfully finished

Additional info:
ClusterID: e75a9bc3-ec72-4be9-93d9-9be9af4bfcdb
ClusterVersion: Stable at "4.8.0-0.nightly-2021-11-26-050203"

[kni@provisionhost-0-0 ~]$ oc get nodes
NAME                                              STATUS                        ROLES    AGE   VERSION
master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com   NotReady,SchedulingDisabled   master   43h   v1.21.6+935ba91
master-0-1.ocp-edge-cluster-0.qe.lab.redhat.com   Ready                         master   43h   v1.21.6+935ba91
master-0-2.ocp-edge-cluster-0.qe.lab.redhat.com   Ready                         master   43h   v1.21.6+935ba91
worker-0-0.ocp-edge-cluster-0.qe.lab.redhat.com   Ready                         worker   43h   v1.21.6+935ba91
worker-0-1.ocp-edge-cluster-0.qe.lab.redhat.com   Ready                         worker   43h   v1.21.6+935ba91

[core@master-0-0 ~]$ systemctl status ovs-configuration.service
● ovs-configuration.service - Configures OVS with proper host networking configuration
   Loaded: loaded (/etc/systemd/system/ovs-configuration.service; enabled; vendor preset: disabled)
   Active: inactive (dead) since Mon 2021-11-29 10:12:07 UTC; 20h ago
 Main PID: 3115 (code=exited, status=0/SUCCESS)
      CPU: 155ms

Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: + echo 'Driver name is'
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: Driver name is
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: + '[' '' = vmxnet3 ']'
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: + echo 'Networking already configured and up for br-ex!'
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: Networking already configured and up for br-ex!
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: + ovs-vsctl --timeout=30 --if-exists del-br br0
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: ovs-configuration.service: Succeeded.
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com configure-ovs.sh[3115]: + exit 0
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: Started Configures OVS with proper host networking configuration.
Nov 29 10:12:07 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: ovs-configuration.service: Consumed 155ms CPU time

[core@master-0-0 ~]$ systemctl status ovs-migration.service
● ovs-migration.service - Migrates OVS configuration to use a new interface on the host
   Loaded: loaded (/etc/systemd/system/ovs-migration.service; enabled; vendor preset: disabled)
   Active: failed (Result: exit-code) since Mon 2021-11-29 10:12:44 UTC; 20h ago
 Main PID: 3177 (code=exited, status=1/FAILURE)
      CPU: 1.381s

Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]:      state:      0
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]:      speed: 0 Mbps now, 0 Mbps max
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]: OFPT_GET_CONFIG_REPLY (xid=0x4): frags=normal miss_send_len=0
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]: + ovs-vsctl list port bond0.373
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]: ovs-vsctl: no row "bond0.373" in table Port
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com migrateOVN.sh[3177]: + exit 1
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: ovs-migration.service: Main process exited, code=exited, status=1/FAILURE
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: ovs-migration.service: Failed with result 'exit-code'.
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: Failed to start Migrates OVS configuration to use a new interface on the host.
Nov 29 10:12:44 master-0-0.ocp-edge-cluster-0.qe.lab.redhat.com systemd[1]: ovs-migration.service: Consumed 1.381s CPU time

Comment 2 Ben Nemec 2021-12-02 15:48:11 UTC
Moving to OVNK since this appears to be an issue with configure-ovs.

Comment 31 Victor Voronkov 2022-11-10 09:34:11 UTC
@jcaamano I can't help with this (moved to another project), check with yprokule

Comment 32 Jaime Caamaño Ruiz 2022-11-14 16:49:10 UTC
I am going to close this issue due to lack of progress. Please, refer to https://bugzilla.redhat.com/show_bug.cgi?id=2100181 or reopen if you think there is additional stuff that needs to be addressed with this BZ.

*** This bug has been marked as a duplicate of bug 2100181 ***