1979276 – SVM: non atomic memslot updates cause boot failure with seabios and cpu-pm=on

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1979276 - SVM: non atomic memslot updates cause boot failure with seabios and cpu-pm=on

Summary: SVM: non atomic memslot updates cause boot failure with seabios and cpu-pm=on

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 9
Classification:	Red Hat
Component:	qemu-kvm
Sub Component:
Version:	9.0
Hardware:	x86_64
OS:	Linux
Priority:	low
Severity:	unspecified
Target Milestone:	beta
Target Release:	---
Assignee:	Emanuele Giuseppe Esposito
QA Contact:	liunana
Docs Contact:
URL:
Whiteboard:
Duplicates (1):	2041007 (view as bug list)
Depends On:
Blocks:	2161188
TreeView+	depends on / blocked

Reported:	2021-07-05 12:52 UTC by Maxim Levitsky
Modified:	2023-05-09 07:42 UTC (History)
CC List:	13 users (show)
Fixed In Version:	qemu-kvm-7.2.0-5.el9
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	2161188 (view as bug list)
Environment:
Last Closed:	2023-05-09 07:19:27 UTC
Type:	Bug
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Gitlab	redhat/centos-stream/src qemu-kvm merge_requests 138	0	None	opened	accel: introduce accelerator blocker API	2023-01-16 12:54:24 UTC
Red Hat Product Errata	RHSA-2023:2162	0	None	None	None	2023-05-09 07:19:56 UTC

Description Maxim Levitsky 2021-07-05 12:52:04 UTC

When booting a VM with -overcommit cpu-pm=on with seabios, the VM crashes with KVM emulation error.

I triaged this root cause of this issue to non atomic update of the KVM memory slots.

In nutshell this is what is happeing:

1. BIOS ROM area 0xF0000-0xFFFFF is mapped by default to the bios-256k.bin, and area is marked as read only rom via Q35 'PAM' registers.
  Later seabios uses these PAM registers to disable the ROM, and write back the ram content there (also known as shadowing of ram).
  Finally seabios uses the PAM registers again to mark the area as read-only RAM.

  Relevant function of seabios are make_bios_writable and make_bios_readonly.

  Relevent qemu function is mch_update_pam

2. Seabios BSP code wakes up all other CPUs to detect which are usable (relevant function: smp_scan)
and parks them on HLT instruction inside the 0xF0000-0xFFFFF area.


2. When qemu updates the PAM registers first time (when seabios does make_bios_writable), it disables the 0xF0000-0xFFFFF memslot and extends one of the RAM memslots to cover it.

3. Due to non atomic memslot update, first the ROM memslot is disabled and then RAM slot is enabled.
   When ROM slot is disabled, the AP which is parked on HLT wakes up (probably due to a TLB flush request) and hits NPT fault.

   Now after a commit e72436bc3a5206f95bb384e741154166ddb3202e, we have code in svm_can_emulate_instruction which detects that we have a code
   fetch problem and that we are now executing code that doesn't belong to any memslot, and thus fails emulation exits to userspace
   with KVM internal error.

   The commit itself is mostly fine (except that that is should translate RIP to GPA, which in this case happens to be the same, I'll send
   a patch fixing this soon), but other than that the root cause is non atomic memslot update.

This is KVM trace with several extra trace points added:





      CPU 0/KVM-378185 [000] 30568.387065: kvm_entry:            vcpu 0, rip 0xe723f
       CPU 0/KVM-378185 [000] 30568.387066: kvm_exit:             reason EXIT_NPF rip 0xe7099 info 100000006 b0000090
       CPU 0/KVM-378185 [000] 30568.387066: kvm_page_fault:       address b0000090 error_code 6
       CPU 0/KVM-378185 [000] 30568.387066: kvm_emulate_insn:     0:e7099: 89 08
       CPU 0/KVM-378185 [000] 30568.387066: vcpu_match_mmio:      gva 0xb0000090 gpa 0xb0000090 Write GPA
       CPU 0/KVM-378185 [000] 30568.387066: kvm_mmio:             mmio write len 4 gpa 0xb0000090 val 0x11111110
       CPU 0/KVM-378185 [000] 30568.387067: kvm_fpu:              unload
       CPU 0/KVM-378185 [000] 30568.387067: kvm_userspace_exit:   reason KVM_EXIT_MMIO (6)
       CPU 1/KVM-378186 [032] 30568.387245: kvm_exit:             reason EXIT_INTR rip 0xfc787 info 0 0
       CPU 1/KVM-378186 [032] 30568.387245: kvm_entry:            vcpu 1, rip 0xfc787
       CPU 0/KVM-378185 [000] 30568.387350: bprint:               __kvm_set_memory_region: memslot delete. base gfn = 0xc0, npages=0x7ff40
       CPU 1/KVM-378186 [032] 30568.387370: kvm_exit:             reason EXIT_INTR rip 0xfc787 info 0 0
       CPU 1/KVM-378186 [032] 30568.387387: kvm_entry:            vcpu 1, rip 0xfc787
       CPU 1/KVM-378186 [032] 30568.387389: kvm_exit:             reason EXIT_NPF rip 0xfc787 info 100000014 fc000
       CPU 1/KVM-378186 [032] 30568.387389: kvm_page_fault:       address fc000 error_code 14
       CPU 1/KVM-378186 [032] 30568.387390: kvm_entry:            vcpu 1, rip 0xfc787
       CPU 1/KVM-378186 [032] 30568.387390: kvm_exit:             reason EXIT_NPF rip 0xfc787 info 100000014 fc000
       CPU 1/KVM-378186 [032] 30568.387390: kvm_page_fault:       address fc000 error_code 14
       CPU 1/KVM-378186 [032] 30568.387391: bprint:               svm_can_emulate_instruction: GVA is 0xfc787
       CPU 1/KVM-378186 [032] 30568.387392: bprint:               svm_can_emulate_instruction: GPA is 0xfc787
       CPU 1/KVM-378186 [032] 30568.387392: bprint:               svm_can_emulate_instruction: GFN is 0xfc
       CPU 1/KVM-378186 [032] 30568.387392: bprint:               svm_can_emulate_instruction: slot 0: base gfn: 0x100000, npages = 0x200000
       CPU 1/KVM-378186 [032] 30568.387392: bprint:               svm_can_emulate_instruction: slot 1: base gfn: 0xfffc0, npages = 0x40
       CPU 1/KVM-378186 [032] 30568.387393: bprint:               svm_can_emulate_instruction: slot 2: base gfn: 0xfb000, npages = 0x800
       CPU 1/KVM-378186 [032] 30568.387393: bprint:               svm_can_emulate_instruction: slot 3: base gfn: 0x0, npages = 0xa0
       CPU 1/KVM-378186 [032] 30568.387393: bputs:                svm_can_emulate_instruction: emulation of an invalid RIP
       CPU 1/KVM-378186 [032] 30568.387394: kvm_emulate_insn:     0:fc787: 89 08
       CPU 1/KVM-378186 [032] 30568.387394: kvm_emulate_insn:     0:fc787: 89 08 FAIL
       CPU 1/KVM-378186 [032] 30568.387394: kvm_fpu:              unload
       CPU 1/KVM-378186 [032] 30568.387395: kvm_userspace_exit:   reason KVM_EXIT_INTERNAL_ERROR (17)
       CPU 0/KVM-378185 [000] 30568.387433: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0xc0, npages=0xb
       CPU 0/KVM-378185 [000] 30568.387460: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0xcb, npages=0x3
       CPU 0/KVM-378185 [000] 30568.387485: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0xce, npages=0xa
       CPU 0/KVM-378185 [000] 30568.387508: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0xd8, npages=0x18
       CPU 0/KVM-378185 [000] 30568.387531: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0xf0, npages=0x10
       CPU 0/KVM-378185 [000] 30568.387555: bprint:               __kvm_set_memory_region: memslot update. base gfn = 0x100, npages=0x7ff00
       CPU 0/KVM-378185 [000] 30568.387828: kvm_fpu:              load
       CPU 0/KVM-378185 [000] 30568.387832: kvm_entry:            vcpu 0, rip 0xe709b
       CPU 0/KVM-378185 [000] 30568.387833: kvm_exit:             reason EXIT_NPF rip 0xe709b info 100000014 e7000
       CPU 0/KVM-378185 [000] 30568.387834: kvm_page_fault:       address e7000 error_code 14
       CPU 0/KVM-378185 [000] 30568.387837: kvm_entry:            vcpu 0, rip 0xe709b
       CPU 0/KVM-378185 [000] 30568.387839: kvm_exit:             reason EXIT_NPF rip 0xe70bb info 100000004 6e14


This is the debug patch that was used:


diff --git a/arch/x86/kvm/svm/svm.c b/arch/x86/kvm/svm/svm.c
index 689fdc62eea8..e274bdb173f7 100644
--- a/arch/x86/kvm/svm/svm.c
+++ b/arch/x86/kvm/svm/svm.c
@@ -4496,13 +4496,37 @@ static bool svm_can_emulate_instruction(struct kvm_vcpu *vcpu, void *insn, int i
 	 */
 	if (likely(!insn || insn_len))
 		return true;
-
 	/*
 	 * If RIP is invalid, go ahead with emulation which will cause an
 	 * internal error exit.
 	 */
-	if (!kvm_vcpu_gfn_to_memslot(vcpu, kvm_rip_read(vcpu) >> PAGE_SHIFT))
+	u64 gva = kvm_get_linear_rip(vcpu);
+
+	trace_printk("GVA is 0x%llx", gva);
+
+	u64 gpa = kvm_mmu_gva_to_gpa_read(vcpu, gva, NULL);
+
+	trace_printk("GPA is 0x%llx", gpa);
+
+	u64 gfn = gpa >> PAGE_SHIFT;
+
+	trace_printk("GFN is 0x%llx", gfn);
+
+	if (gpa == INVALID_GPA || !kvm_vcpu_gfn_to_memslot(vcpu, gfn)) {
+
+
+		struct kvm_memslots* slots = kvm_vcpu_memslots(vcpu);
+
+		int i;
+
+		for (i = 0 ; i < slots->used_slots ; i++) {
+			struct kvm_memory_slot *slot = &slots->memslots[i];
+			trace_printk("slot %d: base gfn: 0x%llx, npages = 0x%lx", i, slot->base_gfn, slot->npages);
+		}
+
+		trace_printk("emulation of an invalid RIP");
 		return true;
+	}
 
 	cr4 = kvm_read_cr4(vcpu);
 	smep = cr4 & X86_CR4_SMEP;
diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c
index f7445c3bcd90..64827065c5ab 100644
--- a/virt/kvm/kvm_main.c
+++ b/virt/kvm/kvm_main.c
@@ -1593,8 +1593,10 @@ int __kvm_set_memory_region(struct kvm *kvm,
 		old.id = id;
 	}
 
-	if (!mem->memory_size)
+	if (!mem->memory_size) {
+		trace_printk("memslot delete. base gfn = 0x%llx, npages=0x%lx", old.base_gfn, old.npages);
 		return kvm_delete_memslot(kvm, mem, &old, as_id);
+	}
 
 	new.as_id = as_id;
 	new.id = id;
@@ -1603,6 +1605,8 @@ int __kvm_set_memory_region(struct kvm *kvm,
 	new.flags = mem->flags;
 	new.userspace_addr = mem->userspace_addr;
 
+	trace_printk("memslot update. base gfn = 0x%llx, npages=0x%lx", new.base_gfn, new.npages);
+
 	if (new.npages > KVM_MEM_MAX_NR_PAGES)
 		return -EINVAL;

Comment 1 Maxim Levitsky 2021-07-05 17:24:19 UTC

Note that I do consider commit e72436bc3a5206f95bb384e741154166ddb3202e to be correct in principle but I wonder if
we need to do the same thing on VMX as well.

Comment 2 Maxim Levitsky 2021-07-05 17:34:12 UTC

Also note that the same VM boots on VMX (although same issue likely to be present there), it lacks the
code that was added in the commit e72436bc3a5206f95bb384e741154166ddb3202e.

Best regards,
    Maxim Levitsky

Comment 3 Yumei Huang 2021-07-06 07:14:05 UTC

Hi Maxim,

I can't reproduce the issue. Guest can boot up normally with '-overcommit cpu-pm=on' with seabios. The test details are as below, would you please help check if anything missing, thanks!

qemu-kvm-6.0.0-21.module+el8.5.0+11555+e0ab0d09
kernel-4.18.0-318.el8.x86_64
seabios-1.14.0-1.module+el8.5.0+11676+e3d1fd29.x86_64

QEMU cli:
# /usr/libexec/qemu-kvm \
    -name 'avocado-vt-vm1' \
    -machine q35  \
    -nodefaults \
    -device VGA,bus=pcie.0,addr=0x1  \
    -device pvpanic,ioport=0x505,id=id5SK4co  \
    -device pcie-root-port,id=pcie.0-root-port-2,slot=2,chassis=2,addr=0x2,bus=pcie.0 \
    -device virtio-scsi-pci,id=virtio_scsi_pci0,bus=pcie.0,addr=0x3 \
    -drive id=drive_image1,if=none,snapshot=off,aio=threads,cache=none,format=qcow2,file=/home/kvm_autotest_root/images/rhel850-64-virtio-scsi.qcow2 \
    -device scsi-hd,id=image1,drive=drive_image1 \
    -device pcie-root-port,id=pcie.0-root-port-4,slot=4,chassis=4,addr=0x4,bus=pcie.0 \
    -device virtio-net-pci,mac=9a:39:3a:3b:3c:3d,id=idzyzw7g,vectors=4,netdev=idhia6GM,bus=pcie.0-root-port-4,addr=0x0  \
    -netdev tap,id=idhia6GM \
    -m 8192,slots=20,maxmem=80G  \
    -smp 64 \
    -cpu 'Cascadelake-Server-noTSX',+kvm_pv_unhalt \
    -vnc :0  \
    -rtc base=utc,clock=host,driftfix=slew  \
    -boot order=cdn,once=c,menu=off,strict=off \
    -enable-kvm \
    -device pcie-root-port,id=pcie.0-root-port-5,slot=5,chassis=5,addr=0x5,bus=pcie.0 \
    -monitor stdio \
    -serial tcp:0:1234,server,nowait \
    -monitor unix:/tmp/monitor3,server,nowait \
    -qmp tcp:0:3333,server,nowait \
    -overcommit cpu-pm=on

Comment 4 Maxim Levitsky 2021-07-06 08:10:30 UTC

As I mentioned this is only broken on SVM, because commit e72436bc3a5206f95bb384e741154166ddb3202e
only added the code for SVM, although similar code probably is needed for VMX as well.

On VMX the same issue exists but it doesn't cause the guest to fail boot because during the time
the memslot is disabled, the affected VCPUs are in endless loop of nested page faults which
ends when the memslot is re-installed.

Comment 5 Yumei Huang 2021-07-06 08:35:40 UTC

Thanks Maxim. 

I was able to reproduce on EPYC host.

# /usr/libexec/qemu-kvm -overcommit cpu-pm=on -m 8G -smp 8 /home/kvm_autotest_root/images/rhel850-64-virtio-scsi.qcow2 -vnc :0 
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
KVM internal error. Suberror: 1
emulation failure
EAX=000f35fe EBX=00000000 ECX=000002ff EDX=00000006
ESI=00000000 EDI=00000000 EBP=00000000 ESP=00006d94
EIP=000fd08d EFL=00000002 [-------] CPL=0 II=0 A20=1 SMM=0 HLT=0
...

Comment 6 Paolo Bonzini 2021-07-07 13:16:39 UTC

We have discussed several times adding atomic memslot update with Peter.  This seems to be a good reason to add it!

Comment 7 Maxim Levitsky 2021-07-07 15:47:55 UTC

Hi Paolo!

I agree with that!

However I would like to hear your opinion about revering this commit that I found until we have atomic memslots updates, 
on the ground that same thing isn't done on VMX anyway.

If you agree to revert it, then I'll send a patch.

If you think we should keep this commit, then to be consistent, we should do the
same on VMX (or move that check to the common code), and then we need to fix the code in this commit
to translate RIP to GPA, since otherwise the check it does can be wrong.
I can send patches for both things.

What do you think?

Best regards,
    Maxim Levitsky

Comment 8 Vitaly Kuznetsov 2021-07-19 09:51:21 UTC

Note: we already have https://bugzilla.redhat.com/show_bug.cgi?id=1855298 which seems
to describe an issue with the same root cause.

Comment 9 Maxim Levitsky 2021-07-19 09:57:18 UTC

@Vitaly Kuznetsov.

Yep, exactly, and it is yet another reason to have atomic memslot updates.

Comment 12 John Ferlan 2021-07-22 18:31:05 UTC

Assigned to Rick for next level triage per bz process and age of bug created or assigned to virt-maint without triage.

Comment 17 John Ferlan 2021-09-14 23:57:37 UTC

Bulk update: Move RHEL8 bugs to RHEL9. If necessary to resolve in RHEL8, then clone to the current RHEL8 release.

Comment 18 liunana 2021-12-23 04:00:36 UTC

Can reproduce this bug both on RHEL8.6.0 and RHEL.9.0.0 on Milan host.


Test Env:
    qemu-kvm-6.2.0-1.module+el8.6.0+13725+61ae1949
    4.18.0-356.el8.x86_64

    qemu-kvm-6.2.0-1.el9.x86_64
    5.14.0-32.el9.x86_64

Comment 19 Peter Xu 2021-12-23 06:33:41 UTC

Hi, all,

I assume we need the new ioctl() that Paolo mentioned, say KVM_SET_USER_MEMORY_REGIONS (one more "S"), and it should simply just allow an array of kvm_userspace_memory_region structs?

(I used to think that allowing memslot to contain holes would work, but it seems not if we simply mark memory regions RW/RO and switch between them.. and that idea is probably adding even more complexity without major gain..)

The major (but IMHO not-so-big) challenge, afaict, is that we should allow __kvm_set_memory_region() to apply things upon a temporary kvm_memslots, then in the new ioctl() we'd need to not do install_new_memslots() to the VM until we applied all the memslot changes to the temporary one, so we do RCU assignment only once to replace the working set with the temp one.

QEMU needs to teach the memory core to batch update in memory_region_transaction_commit() if we have that new kernel detected.

But I haven't looked into any real details of it, so I can't tell whether there's other hidden challenges.

Maxime/Vitaly, have you started looking into the details of it, yet?  Let me know if I can help in any form; as I do see that we're piling up use cases for it so we'll probably need to tackle it ultimately, in any form..

Thanks,
Peter

Comment 20 Vitaly Kuznetsov 2021-12-23 10:06:22 UTC

(In reply to Peter Xu from comment #19)

> Maxime/Vitaly, have you started looking into the details of it, yet?  Let me
> know if I can help in any form; as I do see that we're piling up use cases
> for it so we'll probably need to tackle it ultimately, in any form..

I didn't do any real work yet, I have a BZ:
https://bugzilla.redhat.com/show_bug.cgi?id=1855298
which also needs this. Feel free to pick this up if you have some spare cycles!

Comment 21 Maxim Levitsky 2021-12-23 13:41:27 UTC

@Peter Xu, I agree 100% with everything you said, this is pretty much exactly what I know about this issue.

I am not yet aware of any hidden challenges but as usual there might be.

Best regards,
       Maxim Levitsky

Comment 22 Peter Xu 2021-12-24 00:53:37 UTC

Thanks, Vitaly, Maxime.

After I tried to remember more.. the initial issue I encountered was when sync dirty bitmap during memslot removal that'll be racy if we cannot do it atomically (sync+remove), because guest can still dirty data after sync but before removal.

The thing to be investigated is whether above sync+remove issue can be a real problem of guest data loss during migration. Say, whether there is real use case of guest writing valid data during memory layout changes.  Since I don't know all the reasons for memory layout changes so I cannot say.. I think we need to decide whether we need to fix that along, which comment 19 proposal may not be able to solve, afaiu, because sync is not set-user-mem.

Any further thoughts on above from anyone would still be welcomed.  At the meantime, I also hope I didn't miss any other relevant problems..

Comment 24 liunana 2022-03-31 12:29:23 UTC

*** Bug 2041007 has been marked as a duplicate of this bug. ***

Comment 26 liunana 2022-07-13 04:14:38 UTC

Still can reproduce this issue on 9.1.

Test Env:
    5.14.0-127.el9.x86_64
    qemu-kvm-7.0.0-8.el9.x86_64
    libvirt-client-8.5.0-1.el9.x86_64
    dell-per6515-04.lab.eng.pek2.redhat.com
    Model name:            AMD EPYC 7302P 16-Core Processor

Comment 27 Maxim Levitsky 2022-08-23 08:51:08 UTC

@Emanuele:

Since you currently work on this bug I assigned it to you.

I also did some investigation on why we were not able to reproduce it with seabios and cpu-pm=on.

So:

- First of all as I said in Comment 2 (and we didn't notice this, I forgot about it), the KVM internal error happens in this case only on AMD
because the patch that started it was AMD only.

- Second, indeed the patch was reverted and somehow accepted without generating too much noise:

commit 31c25585695abdf03d6160aa6d829e855b256329
Author: Sean Christopherson <seanjc>
Date:   Thu Jan 20 01:07:12 2022 +0000

    Revert "KVM: SVM: avoid infinite loop on NPF from bad address"
    
    Revert a completely broken check on an "invalid" RIP in SVM's workaround
    for the DecodeAssists SMAP errata.  kvm_vcpu_gfn_to_memslot() obviously
    expects a gfn, i.e. operates in the guest physical address space, whereas
    RIP is a virtual (not even linear) address.  The "fix" worked for the
    problematic KVM selftest because the test identity mapped RIP.
    
    Fully revert the hack instead of trying to translate RIP to a GPA, as the
    non-SEV case is now handled earlier, and KVM cannot access guest page
    tables to translate RIP.
    
    This reverts commit e72436bc3a5206f95bb384e741154166ddb3202e.
    
    Fixes: e72436bc3a52 ("KVM: SVM: avoid infinite loop on NPF from bad address")
    Reported-by: Liam Merwick <liam.merwick>
    Cc: stable.org
    Signed-off-by: Sean Christopherson <seanjc>
    Reviewed-by: Liam Merwick <liam.merwick>
    Message-Id: <20220120010719.711476-3-seanjc>
    Signed-off-by: Paolo Bonzini <pbonzini>


The underlying issue of course as we both know is still there.

You might have luck reproducing it with this bug

https://bugzilla.redhat.com/show_bug.cgi?id=1855298

But for me it looks like it is 'working' as well, so you might have
to write a unit test to trigger the issue.

Comment 28 Emanuele Giuseppe Esposito 2022-11-17 13:34:50 UTC

The fix was done in userspace, rather than on KVM. 

Upstream patches: https://patchew.org/QEMU/20221111154758.1372674-1-eesposit@redhat.com/

It will be probably merged after the feature freeze, in qemu 8.0.

While we wait, @liunana or anyone else if you want you can change the component/subcomponent and maybe BZ title (I don't know what is the correct category so I'll leave it to you).

Thank you,
Emanuele

Comment 29 liunana 2022-12-06 06:54:16 UTC

(In reply to Emanuele Giuseppe Esposito from comment #28)
> The fix was done in userspace, rather than on KVM. 
> 
> Upstream patches:
> https://patchew.org/QEMU/20221111154758.1372674-1-eesposit@redhat.com/
> 
> It will be probably merged after the feature freeze, in qemu 8.0.
> 
> While we wait, @liunana or anyone else if you want you can change the
> component/subcomponent and maybe BZ title (I don't know what is the correct
> category so I'll leave it to you).
> 
> Thank you,
> Emanuele

Thanks Emanuele.

I can reproduce this issue with upstream qemu on EPYC Milan host:
# ./qemu-system-x86_64 -overcommit cpu-pm=on -m 8G -smp 8 -monitor stdio -cpu host -machine accel=kvm
QEMU 7.1.93 monitor - type 'help' for more information
(qemu) VNC server running on ::1:5900
KVM internal error. Suberror: 1
KVM internal error. Suberror: 1
extra data[0]: 0x0000000000000000
extra data[1]: 0x0000000000000400
extra data[2]: 0x0000000100000014

......



And I can't reproduce this issue with upstream qemu with your patches on EPYC Milan host:
# ./qemu-system-x86_64 -overcommit cpu-pm=on -m 8G -smp 8 -monitor stdio -cpu host -machine accel=kvm
QEMU 7.1.93 monitor - type 'help' for more information
(qemu) VNC server running on ::1:5900
(qemu)


I will change the component of this bug to qemu now.
Feel free to change it back if you have a different advice, thanks.


Best regards
Nana Liu

Comment 31 Emanuele Giuseppe Esposito 2022-12-14 15:19:14 UTC

Still waiting that the patches land upstream

Comment 33 RHEL Program Management 2023-01-05 07:27:57 UTC

After evaluating this issue, there are no plans to address it further or fix it in an upcoming release.  Therefore, it is being closed.  If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened.

Comment 37 Yanan Fu 2023-01-19 05:24:08 UTC

QE bot(pre verify): Set 'Verified:Tested,SanityOnly' as gating/tier1 test pass.

Comment 40 liunana 2023-01-31 02:26:14 UTC

Can reproduce this issue with qemu-kvm-7.2.0-4.el9.x86_64.

Can't reproduce this issue with qemu-kvm-7.2.0-5.el9.x86_64.

# /usr/libexec/qemu-kvm -overcommit cpu-pm=on -m 8G -smp 8 -monitor stdio -cpu host -machine accel=kvm -M q35
QEMU 7.2.0 monitor - type 'help' for more information
(qemu) VNC server running on ::1:5900


Move this bug to verified.

Comment 44 errata-xmlrpc 2023-05-09 07:19:27 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: qemu-kvm security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:2162

Note You need to log in before you can comment on or make changes to this bug.