Bug 2192580

Summary: [17.1][OVN migration]Trunk VM subport connectivity is broken after VM stop/start (or VM migration)
Product: Red Hat OpenStack Reporter: Eran Kuris <ekuris>
Component: openstack-neutronAssignee: Jakub Libosvar <jlibosva>
Status: CLOSED ERRATA QA Contact: Roman Safronov <rsafrono>
Severity: high Docs Contact:
Priority: high    
Version: 17.1 (Wallaby)CC: apevec, averdagu, bcafarel, chrisw, egarciar, eshulman, gthiemon, ihrachys, jamsmith, jlibosva, jschluet, lhh, majopela, pgrist, rsafrono, scohen, twilson, vkhitrin, ykarel
Target Milestone: gaKeywords: TestBlocker, Triaged
Target Release: 17.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-neutron-18.6.1-1.20230518200971.el9ost Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: 2073182 Environment:
Last Closed: 2023-08-16 01:14:53 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2073182    
Bug Blocks: 1857652, 2052341    

Comment 27 Ihar Hrachyshka 2023-07-06 14:41:04 UTC
Three points to add here:

- neutron should definitely clean up tpi- interfaces from pre-ovn-migration after live migration is complete. Otherwise ovn-controller on the old host will try to claim the subport there too.

- neutron could work around the issue by also setting requested-chassis= on subport to point to the same chassis as the parent. (Though I don't think it will need to do it once the cleanup mentioned above is applied. I also can't guarantee that such change wouldn't affect something else, e.g. introduce race conditions during a switch of a chassis that would now have to be synchronized for both parent and all its subports in a single transaction. Plus that would require a db sync migration rule in neutron maintenance thread to populate the option for all subports. Overall, I don't recommend doing it.)

- it may be argued that ovn-controller should be smart enough to look at parent port requested-chassis field and not claim subports for parent ports that have a different requested-chassis. This is something to be discussed with ovn team to see if there's a reason it's not done already.

Comment 38 Roman Safronov 2023-07-24 07:30:52 UTC
Verified on RHOS-17.1-RHEL-9-20230719.n.1 with openstack-neutron-ovn-migration-tool-18.6.1-1.20230518200971.el9ost.noarch
Verified that trunk subports are accessible after trunk vm migration or trunk vm reboot on environment where neutron migrated from ovs to ovn. More than 10 builds already passed without this issue.

Comment 44 errata-xmlrpc 2023-08-16 01:14:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Release of components for Red Hat OpenStack Platform 17.1 (Wallaby)), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2023:4577