Created attachment 1874467 [details] Instance life cycle for test including live migration failure Description of problem: Multiple live migration tests are failing across several jobs for RHOS 17 with RHEL 9 with error: Error monitoring migration: Cannot load 'vifs' in the base class: NotImplementedError: Cannot load 'vifs' in the base class. Full logs of instance life cycle and migration failure has been attached. Version-Release number of selected component (if applicable): RHOS-17.0-RHEL-9-20220414.n.1 How reproducible: 100% over the last two CI phase3 runs Steps to Reproduce: 1. Deploy a RHOS 17 environment with RHEL 9 and run standard compute tempest suite with live migration parameters enabled 2. 3. Actual results: Live migration is failing for multiple tempest tests Expected results: Live migration passes Additional info: Can provide an environment if necessary. Overall Job Link: https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3 Build Link: http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5 Deployment Logs: http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5 Test Report: https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5//testReport/
Rollback failure reproduced upstream in functional test here: https://review.opendev.org/c/openstack/nova/+/839091 This doesn't address why the live migration didn't work in the first place, but at least we know we have an edge case in the rollback code.
Has the
Moving to z1 as this isn't really a blocker - it's an edge case when rolling back failed live migrations when Neutron doesn't have the multiple port bindings extension.
*** Bug 2110486 has been marked as a duplicate of this bug. ***
We've merged the fix [1] but we're still not convinced that this is a blocker. Again, I'd like to make clear that this BZ is only tracking live migration *rollback* failures, not live migration failures. In addition, this is latent behaviour that exited in all previous versions, but apparently has only been hit in CI now with the live migration failures that we're seeing (tracked in [2] and [3]). So unless new CI has been stood up for 17 that specifically tests rollback, we're not sure how this is blocking anything. [1] https://code.engineering.redhat.com/gerrit/c/nova/+/411586 [2] https://bugzilla.redhat.com/show_bug.cgi?id=2089520 [3] https://bugzilla.redhat.com/show_bug.cgi?id=2110556
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenStack Platform 17.0.1 bug fix and enhancement advisory), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:0271