Bug 1948579 - [ovn] Migration tool is not resilient enough to errors
Summary: [ovn] Migration tool is not resilient enough to errors
Keywords:
Status: CLOSED DUPLICATE of bug 1823324
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-networking-ovn
Version: 16.1 (Train)
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: OSP Team
QA Contact: Eran Kuris
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-04-12 14:04 UTC by Daniel Alvarez Sanchez
Modified: 2023-09-18 00:25 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-01-06 14:51:29 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-2317 0 None None None 2022-01-06 14:52:40 UTC

Description Daniel Alvarez Sanchez 2021-04-12 14:04:20 UTC
Today, if the migration process fails at some point, there's a need for manual intervention due to the lack of resilience of the tool/process.

For example, if some issue happens in the activate-ovn phase [0] or in the middle of the process, such as [1] when changing the network type to Geneve the cloud is left in an intermediate state that it is neither on ML2/OVS nor ML2/OVN and requires engineering intervention to complete the process.

A daemon/agent running on all the nodes could help that tracks the current state of the migration, retry on errors and maybe even allow reverting things back if all goes wrong.

[0] https://github.com/openstack/neutron/blob/master/tools/ovn_migration/tripleo_environment/playbooks/roles/migration/templates/activate-ovn.sh.j2

[1] https://github.com/openstack/neutron/blob/master/tools/ovn_migration/tripleo_environment/playbooks/ovn-migration.yml#L39

Comment 3 Jakub Libosvar 2022-01-06 14:51:29 UTC
In case migration fails, we should be able to revert to OVS using the revert mechanism tracked in bug 1823324

*** This bug has been marked as a duplicate of bug 1823324 ***

Comment 4 Red Hat Bugzilla 2023-09-18 00:25:46 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days


Note You need to log in before you can comment on or make changes to this bug.