Bug 2189423
Summary: | Failed to migrate VM from rhel 9.3 to rhel 9.2 | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 9 | Reporter: | Min Deng <mdeng> |
Component: | qemu-kvm | Assignee: | Leonardo Bras <leobras> |
qemu-kvm sub component: | Live Migration | QA Contact: | Min Deng <mdeng> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | high | ||
Priority: | high | CC: | coli, fjin, jinzhao, juzhang, leobras, lijin, meili, nilal, peterx, virt-maint |
Version: | 9.3 | Keywords: | Triaged |
Target Milestone: | rc | ||
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | qemu-kvm-8.0.0-5.el9 | Doc Type: | If docs needed, set a value |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2023-11-07 08:27:12 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Min Deng
2023-04-25 07:39:23 UTC
Hmm, when I was looking the upstream backward migration issue over 8.0->7.2 which broke too, I found the recent PCI AER change might be the culprit. I've raised this issue here: https://lore.kernel.org/qemu-devel/ZEhzaWpNM+NvZCUw@x1n It's very possible it's the same issue here for downstream. The bug also reproduced between rhel 9.2 and rhel 9.0 RHEL 9.0 host kernel-5.14.0-303.el9.x86_64 qemu-kvm-6.2.0-11.el9_0.7.x86_64 RHEL 9.2 kernel-5.14.0-302.el9.x86_64 qemu-kvm-8.0.0-1.el9.x86_64 (In reply to Min Deng from comment #5) > The bug also reproduced between rhel 9.2 and rhel 9.0 The bug also reproduced between rhel 9.3 and rhel 9.0 > RHEL 9.0 host > kernel-5.14.0-303.el9.x86_64 > qemu-kvm-6.2.0-11.el9_0.7.x86_64 > RHEL 9.2 RHEL 9.3 > kernel-5.14.0-302.el9.x86_64 > qemu-kvm-8.0.0-1.el9.x86_64 Should be from rhel9.3 to rhel 9.0. Thanks. (In reply to Min Deng from comment #6) > (In reply to Min Deng from comment #5) > > The bug also reproduced between rhel 9.2 and rhel 9.0 > The bug also reproduced between rhel 9.3 and rhel 9.0 > > RHEL 9.0 host > > kernel-5.14.0-303.el9.x86_64 > > qemu-kvm-6.2.0-11.el9_0.7.x86_64 > > RHEL 9.2 > RHEL 9.3 > > kernel-5.14.0-302.el9.x86_64 > > qemu-kvm-8.0.0-1.el9.x86_64 > > Should be from rhel9.3 to rhel 9.0. Thanks. Oh, that makes sense: something introduced in 9.2->9.3 have broken migration Thanks for the testing! I will start debugging this soon. Upstream patch sent: https://patchwork.kernel.org/project/qemu-devel/list/?series=744531 Hi Leonardo Could you please help to set DTM/ITM for this bug ? Thanks Min Hi All, The issue has been reproduced between rhel8.6 to rhel9.3 RHEL 8.6 host 4.18.0-372.58.1.el8_6.x86_64 qemu-kvm-6.2.0-11.module+el8.6.0+18167+43cf40f3.8.x86_64 edk2-ovmf-20220126gitbb1bba3d77-2.el8_6.1.noarch RHEL 9.3 host 5.14.0-316.el9.x86_64 qemu-kvm-8.0.0-4.el9.x86_64 edk2-ovmf-20230301gitf80f052277c8-4.el9.noarch Test results qemu-kvm: warning: Machine type 'pc-q35-rhel8.5/4/3/2/....0' is deprecated: machine types for previous major releases are deprecated QEMU 8.0.0 monitor - type 'help' for more information (qemu) migrate_incoming tcp:[::]:4000 (qemu) qemu-kvm: get_pci_config_device: Bad config data: i=0x6e read: 0 device: 40 cmask: ff wmask: 0 w1cmask:19 qemu-kvm: Failed to load PCIDevice:config qemu-kvm: Failed to load pcie-root-port:parent_obj.parent_obj.parent_obj qemu-kvm: error while loading state for instance 0x0 of device '0000:00:12.0/pcie-root-port' qemu-kvm: load of migration failed: Invalid argument Notes, Except for rhel 8.6.0, the rest tests failed with other machine types. I have to say,the issue blocks almost all tests betwween RHEL8.x and RHEL9.x from QE side. Thanks The MR for this bz: https://gitlab.com/redhat/rhel/src/qemu-kvm/qemu-kvm/-/merge_requests/283 QE bot(pre verify): Set 'Verified:Tested,SanityOnly' as gating/tier1 test pass. (In reply to Min Deng from comment #10) > Hi All, > The issue has been reproduced between rhel8.6 to rhel9.3 > RHEL 8.6 host > 4.18.0-372.58.1.el8_6.x86_64 > qemu-kvm-6.2.0-11.module+el8.6.0+18167+43cf40f3.8.x86_64 > edk2-ovmf-20220126gitbb1bba3d77-2.el8_6.1.noarch > RHEL 9.3 host > 5.14.0-316.el9.x86_64 > qemu-kvm-8.0.0-4.el9.x86_64 > edk2-ovmf-20230301gitf80f052277c8-4.el9.noarch > > Test results > qemu-kvm: warning: Machine type 'pc-q35-rhel8.5/4/3/2/....0' is deprecated: > machine types for previous major releases are deprecated > QEMU 8.0.0 monitor - type 'help' for more information > (qemu) migrate_incoming tcp:[::]:4000 > (qemu) qemu-kvm: get_pci_config_device: Bad config data: i=0x6e read: 0 > device: 40 cmask: ff wmask: 0 w1cmask:19 > qemu-kvm: Failed to load PCIDevice:config > qemu-kvm: Failed to load pcie-root-port:parent_obj.parent_obj.parent_obj > qemu-kvm: error while loading state for instance 0x0 of device > '0000:00:12.0/pcie-root-port' > qemu-kvm: load of migration failed: Invalid argument > > Notes, > Except for rhel 8.6.0, the rest tests failed with other machine types. I > have to say,the issue blocks almost all tests betwween RHEL8.x and RHEL9.x > from QE side. Thanks Hi Min, Thanks for highlighting the importance. Apologies for the delay; I recently came back from my PTOs. It looks like Leonardo already took care of the fix (thanks), so we should be good. However, if something is still missing, please let us know. QE tried the same steps to comment0 on the following builds SRC: RHEL 9.2 kernel-5.14.0-284.18.1.el9_2.x86_64 qemu-kvm-7.2.0-14.el9_2.1.x86_64 RHEL 9.3 kernel-5.14.0-325.el9.x86_64 qemu-kvm-8.0.0-5.el9.x86_64 The original issue has been fixed, thank you ! New bug Bug 2215819 - Stable guest abi test failed while guest is with machine type lower than rhel 8.6.0 (not including 8. Per Leonardo, again, verified the bug on following build (qemu-kvm-8.0.0-10.el9.x86_64), the original issue has gone. SRC:RHEL 9.3 kernel-5.14.0-348.el9.x86_64 qemu-kvm-8.0.0-10.el9.x86_64 DST:RHEL 9.2 5.14.0-284.26.1.el9_2.x86_64 qemu-kvm-7.2.0-14.el9_2.3.x86_64 Steps, please refer to Description Actual results Migration passed Expected results Migration pass Thank you ! Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: qemu-kvm security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2023:6368 |