Bug 1222414
| Summary: | [RFE] Enable live migration for pinned instances | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Itzik Brown <itbrown> | ||||
| Component: | openstack-nova | Assignee: | Artom Lifshitz <alifshit> | ||||
| Status: | CLOSED ERRATA | QA Contact: | James Parker <jparker> | ||||
| Severity: | urgent | Docs Contact: | |||||
| Priority: | urgent | ||||||
| Version: | 7.0 (Kilo) | CC: | acanan, aherr, akarlsso, alifshit, amodi, asimonel, assaf.eylath, brault, ccopello, cswanson, dasmith, djuran, dvd, ealcaniz, eelena, egallen, eglynn, fbaudin, fherrman, fsoppels, houyatao, itbrown, Jing.C.Zhang, jjoyce, joflynn, jraju, jschluet, kchamart, lyarwood, marjones, maydin, mbooth, michael.or, mmethot, myllynen, nlevinki, sbauza, sclewis, scohen, sgordon, sputhenp, srevivo, stephenfin, tamar.inbar-shelach, tvvcox, vaggarwa, vcojot, vromanso, weiyongjun | ||||
| Target Milestone: | beta | Keywords: | FutureFeature, Triaged | ||||
| Target Release: | 16.0 (Train on RHEL 8.1) | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | openstack-nova-20.0.1-0.20191025043858.390db63.el8ost | Doc Type: | Enhancement | ||||
| Doc Text: |
With this enhancement, support for live migration of instances with a NUMA topology has been added. Previously, this action was disabled by default. It could be enabled using the '[workarounds] enable_numa_live_migration' config option, but this defaulted to False because live migrating such instances resulted in them being moved to the destination host without updating any of the underlying NUMA guest-to-host mappings or the resource usage. With the new NUMA-aware live migration feature, if the instance cannot fit on the destination, the live migration will be attempted on an alternate destination if the request is set up to have alternates. If the instance can fit on the destination, the NUMA guest-to-host mappings will be re-calculated to reflect its new host, and its resource usage updated.
|
Story Points: | --- | ||||
| Clone Of: | |||||||
| : | 1780366 (view as bug list) | Environment: | |||||
| Last Closed: | 2020-02-06 14:37:21 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Bug Depends On: | |||||||
| Bug Blocks: | 1281573, 1339866, 1414999, 1431627, 1442136, 1478186, 1500145, 1500557, 1595325, 1669579, 1732913, 1756916, 1769425, 1780366 | ||||||
| Attachments: |
|
||||||
|
Description
Itzik Brown
2015-05-18 07:47:59 UTC
This is a well know issue upstream. There is a blueprint proposed (not approved for Liberty release at this point, but likely to get) to fix this. The fix is (as is described on https://review.openstack.org/#/c/193576/) very invasive and unlikely to be easily backportable. We should probably add a release note for this saying that live migration is not supported for instances with CPU pinning, (in addition we might want to outright disable it). If we decide to disable it - then it makes sense to keep this as a blocker and do it for GA, otherwise we should not block on this, relnote, and clone the bug for the next release of RHOS where it will get properly fixed (upstream in Liberty). Moving out to A1 as it's not a regression and not a blocker for GA (In reply to Nikola Dipanov from comment #3) > This is a well know issue upstream. There is a blueprint proposed (not > approved for Liberty release at this point, but likely to get) to fix this. > > The fix is (as is described on https://review.openstack.org/#/c/193576/) > very invasive and unlikely to be easily backportable. > > We should probably add a release note for this saying that live migration is > not supported for instances with CPU pinning, (in addition we might want to > outright disable it). > > If we decide to disable it - then it makes sense to keep this as a blocker > and do it for GA, otherwise we should not block on this, relnote, and clone > the bug for the next release of RHOS where it will get properly fixed > (upstream in Liberty). Based on the above and my understanding that this was not in fact fixed in Liberty I am moving the flags to rhos-9.0, Mitaka. Let me know if my interpretation is incorrect... The patch is well-developed, but dependent on review traction to land. *** Bug 1319385 has been marked as a duplicate of this bug. *** Hi Sahid, Is there any chance of this being accepted in the rc-* phase given it's treated as a bug upstream, or should I move this to Pike? Thanks, Steve (In reply to Stephen Gordon from comment #18) > Hi Sahid, > > Is there any chance of this being accepted in the rc-* phase given it's > treated as a bug upstream, or should I move this to Pike? > > Thanks, > > Steve Nothing is really moving in upstream. I guess you should move it to Pike. *** Bug 1360970 has been marked as a duplicate of this bug. *** *** Bug 1559314 has been marked as a duplicate of this bug. *** *** Bug 1585068 has been marked as a duplicate of this bug. *** Any estimation on which version this issue will get resolved? *** Bug 1703734 has been marked as a duplicate of this bug. *** Feature freeze upstream is September 12th. The series is under active review, and has a decent chance of landing before then. If it lands, it'll be in the OSP16 release, but is not backportable to previous releases. *** Bug 1565129 has been marked as a duplicate of this bug. *** I'm going to set HasTestAutomation, since we have test cases in upstream whitebox [1]. We could probably add more, but what we currently have at least tests the happy path. We also have functional tests [2] up for review that cover the Nova-specific bits (rollback, rolling upgrade, etc). [1] https://opendev.org/x/whitebox-tempest-plugin/src/branch/master/whitebox_tempest_plugin/api/compute/test_cpu_pinning.py#L419 [2] https://review.opendev.org/#/c/672595/ Created attachment 1642181 [details]
NUMALiveMigrationTest Whitebox Tempest results
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2020:0283 |