Bug 2077964

Summary: Live migration fails due to Cannot load 'vifs' in the base class: NotImplementedError: Cannot load 'vifs' in the base class
Product: Red Hat OpenStack Reporter: James Parker <jparker>
Component: openstack-novaAssignee: Artom Lifshitz <alifshit>
Status: CLOSED ERRATA QA Contact: OSP DFG:Compute <osp-dfg-compute>
Severity: medium Docs Contact:
Priority: medium    
Version: 17.0 (Wallaby)CC: alifshit, bdobreli, dasmith, eglynn, ekuris, eolivare, jhakimra, jschluet, kchamart, mblue, mkrcmari, oblaut, sbauza, sgordon, smooney, vromanso
Target Milestone: z1Keywords: AutomationBlocker, Patch, Regression, Triaged
Target Release: 17.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-nova-23.2.2-0.20221121200753.7074ac0.el9ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-01-25 12:28:47 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
Instance life cycle for test including live migration failure none

Description James Parker 2022-04-22 17:48:19 UTC
Created attachment 1874467 [details]
Instance life cycle for test including live migration failure

Description of problem: Multiple live migration tests are failing across several jobs for RHOS 17 with RHEL 9 with error: Error monitoring migration: Cannot load 'vifs' in the base class: NotImplementedError: Cannot load 'vifs' in the base class.  Full logs of instance life cycle and migration failure has been attached. 

Version-Release number of selected component (if applicable):
RHOS-17.0-RHEL-9-20220414.n.1

How reproducible:
100% over the last two CI phase3 runs


Steps to Reproduce:
1. Deploy a RHOS 17 environment with RHEL 9 and run standard compute tempest suite with live migration parameters enabled
2.
3.

Actual results:
Live migration is failing for multiple tempest tests

Expected results:
Live migration passes


Additional info:
Can provide an environment if necessary.
Overall Job Link: https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3
Build Link: http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5
Deployment Logs: http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5
Test Report: https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp-ipv4-geneve-live-migration-config-drive-iso9660-phase3/5//testReport/

Comment 1 Artom Lifshitz 2022-04-25 13:47:17 UTC
Rollback failure reproduced upstream in functional test here: https://review.opendev.org/c/openstack/nova/+/839091

This doesn't address why the live migration didn't work in the first place, but at least we know we have an edge case in the rollback code.

Comment 2 Marian Krcmarik 2022-06-06 15:31:56 UTC
Has the

Comment 3 Artom Lifshitz 2022-07-07 15:22:35 UTC
Moving to z1 as this isn't really a blocker - it's an edge case when rolling back failed live migrations when Neutron doesn't have the multiple port bindings extension.

Comment 6 Maor 2022-07-25 13:28:26 UTC
*** Bug 2110486 has been marked as a duplicate of this bug. ***

Comment 10 Artom Lifshitz 2022-07-26 17:31:35 UTC
We've merged the fix [1] but we're still not convinced that this is a blocker. Again, I'd like to make clear that this BZ is only tracking live migration *rollback* failures, not live migration failures. In addition, this is latent behaviour that exited in all previous versions, but apparently has only been hit in CI now with the live migration failures that we're seeing (tracked in [2] and [3]). So unless new CI has been stood up for 17 that specifically tests rollback, we're not sure how this is blocking anything.

[1] https://code.engineering.redhat.com/gerrit/c/nova/+/411586
[2] https://bugzilla.redhat.com/show_bug.cgi?id=2089520
[3] https://bugzilla.redhat.com/show_bug.cgi?id=2110556

Comment 22 errata-xmlrpc 2023-01-25 12:28:47 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 17.0.1 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:0271