Bug 1360970
Summary: | [RFE][TestOnly] SR-IOV live migration | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | VIKRANT <vaggarwa> | |
Component: | openstack-nova | Assignee: | smooney | |
Status: | CLOSED ERRATA | QA Contact: | James Parker <jparker> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 7.0 (Kilo) | CC: | amodi, brault, ccopello, dasmith, egallen, eglynn, gkadam, gregraka, joflynn, jraju, kchamart, lyarwood, mbooth, nlevinki, sbauza, scohen, sgordon, smooney, sputhenp, srevivo, stephenfin | |
Target Milestone: | beta | Keywords: | FutureFeature, TestOnly, Triaged, ZStream | |
Target Release: | 16.0 (Train on RHEL 8.1) | |||
Hardware: | All | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | openstack-nova-19.1.0-0.20190607104234.d7bad34.el8ost | Doc Type: | Enhancement | |
Doc Text: |
With this enhancement, support for live migration of instances with attached SR-IOV-based neutron interfaces has been added. Neutron SR-IOV interfaces can be grouped into two categories: direct mode and indirect mode. Direct mode SR-IOV interfaces are directly attached to the guest and exposed to the guest OS. Indirect mode SR-IOV interfaces have a software interface, for example, a macvtap, between the guest and the SR-IOV device. This feature enables transparent live migration for instances with indirect mode SR-IOV devices. As there is no generic way to copy hardware state during a live migration, direct mode migration is not transparent to the guest. For direct mode interfaces, mimic the workflow already in place for suspend and resume. For example, with SR-IOV devices, detach the direct mode interfaces before migration and re-attach them after the migration. As a result, instances with direct mode SR-IOV ports lose network connectivity during a migration unless a bond with a live migratable interface is created within the guest.
Previously, it was not possible to live migrate instances with SR-IOV-based network interfaces. This was problematic as live migration is frequently used for host maintenance and similar actions. Previously, the instance had to be cold migrated which involves downtime for the guest.
This enhancement results in the live migration of instances with SR-IOV-based network interfaces.
|
Story Points: | --- | |
Clone Of: | ||||
: | 1631723 1780417 (view as bug list) | Environment: | ||
Last Closed: | 2020-02-06 14:37:21 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1631723 | |||
Bug Blocks: | 1732913, 1756916, 1780417 |
Description
VIKRANT
2016-07-28 05:13:15 UTC
*** This bug has been marked as a duplicate of bug 1222414 *** This issue is related to live-migrate instances with SRIOV and should not be considered as the same as live-migrating with CPU pinning. Both can work independently and even is the fix handle both situations (which i have some doubts) QA will have to test them independently. (In reply to Sahid Ferdjaoui from comment #13) > This issue is related to live-migrate instances with SRIOV and should not be > considered as the same as live-migrating with CPU pinning. Both can work > independently and even is the fix handle both situations (which i have some > doubts) QA will have to test them independently. OK, thanks for the context. I've updated the title accordingly. Requirements: - Support live migration with passthrough of full PCI NIC. - Support live migration with passthrough of PF. - Support live migration with passthrough of VF. - In all cases, performance of networking in general VM lifecycle should not be impacted. Performance degradation during live migration is acceptable. (In reply to Stephen Gordon from comment #15) > Requirements: > > - Support live migration with passthrough of full PCI NIC. > - Support live migration with passthrough of PF. > - Support live migration with passthrough of VF. > - In all cases, performance of networking in general VM lifecycle should not > be impacted. Performance degradation during live migration is acceptable. Achieving this from a technical POV, would require a multi-nic setup in the guest with bonding/teaming. ie every guest would need to have 2 NICs, one SRIOV based and one emulated, both connected to same host network. At migration the SRIOV device would have to be hot-removed, and a new one added afterwards. IOW, as well as impacting guest network performance, you need to mandate special guest setup, and guest cooperation for hot-unplug at start of migration. The implication is if the guest OS is crashed, is in early boot up phase, or otherwise non-responsive, live migration still won't be possible as it won't be responding to the initial hot-unplug request. Not a showstopper though - largely a documentation / expectation setting problem. *** Bug 1631723 has been marked as a duplicate of this bug. *** Note this feature is being targeted for OSP 16 and will not be backportable. all functional code releated to this RFE has merged upstream in master. there is 1 minor follow up patch still pending to adress some code style nits https://review.opendev.org/#/c/659101/ and a docs only patch that needs to be written but this RFE is now feature commplete upstream and testing can start. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:0283 |