Bug 2139775 - [OVN][16.1] Migration from OVS to OVN hangs on "Sync neutron db with OVN db" task
Summary: [OVN][16.1] Migration from OVS to OVN hangs on "Sync neutron db with OVN db" ...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-networking-ovn
Version: 16.1 (Train)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: z9
: 16.1 (Train on RHEL 8.2)
Assignee: Arnau Verdaguer
QA Contact: Roman Safronov
URL:
Whiteboard:
Depends On:
Blocks: 2129882
TreeView+ depends on / blocked
 
Reported: 2022-11-03 11:34 UTC by Roman Safronov
Modified: 2022-12-07 20:28 UTC (History)
12 users (show)

Fixed In Version: python-networking-ovn-7.3.1-1.20221013173227.4e24f4c.el8ost
Doc Type: No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-12-07 20:27:53 UTC
Target Upstream Version:
Embargoed:
gurpsing: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-19885 0 None None None 2022-11-03 11:56:19 UTC
Red Hat Product Errata RHBA-2022:8795 0 None None None 2022-12-07 20:28:31 UTC

Description Roman Safronov 2022-11-03 11:34:37 UTC
Description of problem:
OVN migration hangs on "Sync neutron db with OVN db" task.
Dowstream CI job is not able to complete OVN migration during 10 hours and then timeouts.
From the ovn migration tool log:
TASK [migration : Sync neutron db with OVN db (container) - Run 1] *************
task path: /home/stack/ovn_migration/playbooks/roles/migration/tasks/sync-dbs.yml:7
Wednesday 02 November 2022  19:15:38 +0000 (0:00:01.182)       0:45:48.529 **** 
META: noop
META: noop

We had the similar situation with OSP17.0 and the reason was a regression caused by https://review.opendev.org/c/openstack/neutron/+/781555
and it was fixed upstream by https://review.opendev.org/c/openstack/neutron/+/817637 or or https://review.opendev.org/c/openstack/neutron/+/805768


Version-Release number of selected component (if applicable):
RHOS-16.1-RHEL-8-20221031.n.1
python3-networking-ovn-7.3.1-1.20221013173226.4e24f4c

How reproducible:
100%

Steps to Reproduce:
1.Deploy OSP16.1 HA environment (3 controllers+ 2 computes)  with ML2OVS neutron backend
2. Create a workload, router, network, VM connected to the network
3. Try to run migration to ML2OVN according to the official procedure
https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/16.2/html-single/migrating_the_networking_service_to_the_ml2ovn_mechanism_driver/index


Actual results:
OVN migration script get stuck on "Sync neutron db with OVN db" task.

Expected results:
OVN migration does not get stuck and finishes successfully.

Additional info:

Comment 6 Gurpreet Singh 2022-11-13 23:28:06 UTC
Hi Eran 

We can skip 16.1, but we need to support 16.2 for OVN migration. So this bug needs to be addressed for 16.2.z

Regards
Gurpreet

Comment 7 Eran Kuris 2022-11-14 07:02:39 UTC
(In reply to Gurpreet Singh from comment #6)
> Hi Eran 
> 
> We can skip 16.1, but we need to support 16.2 for OVN migration. So this bug
> needs to be addressed for 16.2.z
> 
> Regards
> Gurpreet

Thanks, Gurpreet. 
so we have the approval to drop OVN migration support from OSP16.1 

The support for 16.2 will stay. This issue is tracked for 16.2  via this BZ:
https://bugzilla.redhat.com/show_bug.cgi?id=2139674

James, we need to make sure this change will be documented.

Comment 13 Roman Safronov 2022-11-20 21:33:49 UTC
Verified on RHOS-16.1-RHEL-8-20221116.n.1  with  python3-networking-ovn-7.3.1-1.20221013173227.4e24f4c.el8ost.noarch.rpm
Verified by running ovs2ovn d/s CI job. Migration to OVN did not hang and completed successfully.

Comment 22 errata-xmlrpc 2022-12-07 20:27:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 16.1.9 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:8795


Note You need to log in before you can comment on or make changes to this bug.