Bug 1520294
Summary: | Hot-unplug the second pf cause qemu promote " Failed to remove group $iommu_group_num from KVM VFIO device:" | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 7 | Reporter: | Yanan Fu <yfu> |
Component: | qemu-kvm-rhev | Assignee: | Alex Williamson <alex.williamson> |
Status: | CLOSED ERRATA | QA Contact: | Yanan Fu <yfu> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 7.5 | CC: | alex.williamson, chayang, eric.auger, jinzhao, juzhang, knoel, lmiksik, mrezanin, virt-maint, yfu |
Target Milestone: | rc | Keywords: | Regression |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | qemu-kvm-rhev-2.10.0-13.el7 | Doc Type: | If docs needed, set a value |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2018-04-11 00:52:14 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Yanan Fu
2017-12-04 08:25:46 UTC
This is indeed a qemu-kvm-rhev bug, a regression was introduced in QEMU 2.10 where additional groups attached to the same container do not get registered with the kvm-vfio device, thus when those devices are removed we can't find the original registration. More worrisome is that if the first device gets removed, KVM might revert to instruction emulation that isn't compatible with IOMMUs with non-coherent DMA. KVMGT could also be affected if the vGPU is not the first device registered as the "set KVM" notifier would not be triggered. Posted upstream and will try to get in for QEMU 2.11. Eric, I'd appreciate a timely upstream review given how close we are to 2.11 upstream. Adding regression tag since the result here could be more serious than a spurious warning message. https://lists.gnu.org/archive/html/qemu-devel/2017-12/msg00774.html Reproduced the issue as well on ARM64 Gigabyte machine with assignment of the 2 X540T2 PFs. Fix included in qemu-kvm-rhev-2.10.0-13.el7 Test with new version: qemu: qemu-kvm-rhev-2.10.0-13.el7 kernel: kernel-3.10.0-820.el7.x86_64 Same test steps, it is ok. This issue already be fixed. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2018:1104 |