1386239 – emulation failure for some of our machines

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1386239 - emulation failure for some of our machines

Summary: emulation failure for some of our machines

Keywords:
Status:	CLOSED DUPLICATE of bug 1428347
Alias:	None
Product:	Red Hat Enterprise Linux 7
Classification:	Red Hat
Component:	seabios
Sub Component:
Version:	7.3
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	high
Target Milestone:	rc
Target Release:	7.4
Assignee:	Virtualization Maintenance
QA Contact:	Virtualization Bugs
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1395265 1401400
TreeView+	depends on / blocked

Reported:	2016-10-18 13:19 UTC by Tomas Pelka
Modified:	2017-06-06 14:41 UTC (History)
CC List:	10 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-06-06 14:41:41 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
vm.xml (2.17 KB, text/plain) 2016-10-19 09:08 UTC, Tomas Pelka	no flags	Details
View All

Description Tomas Pelka 2016-10-18 13:19:22 UTC

Description of problem:
2016-10-18 10:31:51.146+0000: starting up libvirt version: 2.0.0, package: 10.el7 (Red Hat, Inc. <http://bugzilla.redhat.com/bugzilla>, 2016-09-21-10:15:26, x86-038.build.eng.bos.redhat.com), qemu version: 1.5.3 (qemu-kvm-1.5.3-126.el7), hostname: qe-dell-ovs5.dqe.lab.eng.bos.redhat.com
LC_ALL=C PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin QEMU_AUDIO_DRV=none /usr/libexec/qemu-kvm -name qe-dell-ovs5-vm-24 -S -machine rhel6.3.0,accel=kvm,usb=off -m 2048 -realtime mlock=off -smp 1,sockets=1,cores=1,threads=1 -uuid 0ebfba3a-83a1-144f-4e5d-9f7731677231 -no-user-config -nodefaults -chardev socket,id=charmonitor,path=/var/lib/libvirt/qemu/domain-239-qe-dell-ovs5-vm-24/monitor.sock,server,nowait -mon chardev=charmonitor,id=monitor,mode=control -rtc base=utc -no-shutdown -boot menu=off,strict=on -device piix3-usb-uhci,id=usb,bus=pci.0,addr=0x1.0x2 -drive file=/dev/vg_virt_01/qe-dell-ovs5-vm-24.img,format=raw,if=none,id=drive-virtio-disk0,cache=none,aio=native -device virtio-blk-pci,scsi=off,bus=pci.0,addr=0x4,drive=drive-virtio-disk0,id=virtio-disk0,bootindex=2 -netdev tap,fd=58,id=hostnet0,vhost=on,vhostfd=82 -device virtio-net-pci,netdev=hostnet0,id=net0,mac=52:54:02:2c:e1:40,bus=pci.0,addr=0x3,bootindex=1 -chardev socket,id=charserial0,path=/tmp/qe-dell-ovs5-vm-24.console,server,nowait -device isa-serial,chardev=charserial0,id=serial0 -vnc 0.0.0.0:28 -vga cirrus -device intel-hda,id=sound0,bus=pci.0,addr=0x6 -device hda-duplex,id=sound0-codec0,bus=sound0.0,cad=0 -device virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x5 -msg timestamp=on
KVM internal error. Suberror: 1
emulation failure
EAX=63e866d8 EBX=ffd15d73 ECX=63e866d8 EDX=00000052
ESI=00000003 EDI=7ffb13c8 EBP=000f7000 ESP=ffffffd3
EIP=bee133f6 EFL=00010016 [----AP-] CPL=0 II=0 A20=1 SMM=0 HLT=0
ES =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
CS =0008 00000000 ffffffff 00c09b00 DPL=0 CS32 [-RA]
SS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
DS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
FS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
GS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
LDT=0000 00000000 0000ffff 00008200 DPL=0 LDT
TR =0000 00000000 0000ffff 00008b00 DPL=0 TSS32-busy
GDT=     000f7cb0 00000037
IDT=     000f7cee 00000000
CR0=00000011 CR2=00000000 CR3=00000000 CR4=00000000
DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000 
DR6=00000000ffff0ff0 DR7=0000000000000400
EFER=0000000000000000

Version-Release number of selected component (if applicable):
kernel-3.10.0-512.el7.x86_64
qemu-kvm-1.5.3-126.el7.x86_64
libvirt-daemon-2.0.0-10.el7.x86_64

How reproducible:
?

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:
Please advice other logs that I can provide. No oops in dmesg

Comment 1 Tomas Pelka 2016-10-19 09:08:07 UTC

Created attachment 1212078 [details]
vm.xml

This issue is not critical yet, but it will be soon.

We have hypervisor with about 60 VM's that are connected to beaker, when majority of them (90%) are scheduled via beaker, which is easy when we run our mass run, than I start observing mentioned issue.

XML definition for these machines are the same and attached.

One other symptom, besides on auto pausing (only way how to resume from pause state is force reset) machines with mantion trace in c0, I can also see that some machines are stucked with 100% CPU load right after beaker complete provisioning. It need to be restarted in order to continue with scheduled tasks.

Maybe we are just overloading hypervisor (150G RAM, RAID5 SATA, 32 CPU cores)?

Comment 2 Radim Krčmář 2016-10-19 20:47:22 UTC

(In reply to Tomas Pelka from comment #1)
> Created attachment 1212078 [details]
> One other symptom, besides on auto pausing (only way how to resume from
> pause state is force reset) machines with mantion trace in c0,

Btw. the trace doesn't have a line that starts with code "Code=" and follows with a list of bytes at the bottom?
Something like "Code=00 00 [...] 12 <34> 56 [...] ff ff"

>                                                                I can also
> see that some machines are stucked with 100% CPU load right after beaker
> complete provisioning. It need to be restarted in order to continue with
> scheduled tasks.

The emulation error happens when trying to emulate an invalid instruction, but the trace doesn't say what the instruction is, so please run these commands on the host for misbehaving domains:

  virsh qemu-monitor-command $domain --hmp info status
  virsh qemu-monitor-command $domain --hmp info registers
  virsh qemu-monitor-command $domain --hmp 'x /i $pc'
  virsh qemu-monitor-command $domain --hmp 'x /64xb $pc-48'

and share their results.

Thanks.

> Maybe we are just overloading hypervisor (150G RAM, RAID5 SATA, 32 CPU
> cores)?

No, overloading the hypervisor shouldn't cause an emulation error.
The guests might misbehave because things don't execute as fast as they'd expect, but we'd like to fix everything else (and expectations of RHEL guests too).

Comment 3 Tomas Pelka 2016-10-20 07:46:52 UTC

virsh # qemu-monitor-command qe-dell-ovs5-vm-12 --hmp info status
VM status: paused (internal-error)

virsh # qemu-monitor-command qe-dell-ovs5-vm-12 --hmp info registers
EAX=0000008a EBX=ffd18f73 ECX=0000008a EDX=00000000
ESI=00000003 EDI=7ffb13c8 EBP=00000066 ESP=00006e98
EIP=7ffbcf88 EFL=00010046 [---Z-P-] CPL=0 II=0 A20=1 SMM=0 HLT=0
ES =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
CS =0008 00000000 ffffffff 00c09b00 DPL=0 CS32 [-RA]
SS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
DS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
FS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
GS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
LDT=0000 00000000 0000ffff 00008200 DPL=0 LDT
TR =0000 00000000 0000ffff 00008b00 DPL=0 TSS32-busy
GDT=     000f7cb0 00000037
IDT=     000f7cee 00000000
CR0=00000011 CR2=00000000 CR3=00000000 CR4=00000000
DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000 
DR6=00000000ffff0ff0 DR7=0000000000000400
EFER=0000000000000000
FCW=037f FSW=0000 [ST=0] FTW=00 MXCSR=00001f80
FPR0=0000000000000000 0000 FPR1=0000000000000000 0000
FPR2=0000000000000000 0000 FPR3=0000000000000000 0000
FPR4=0000000000000000 0000 FPR5=0000000000000000 0000
FPR6=0000000000000000 0000 FPR7=0000000000000000 0000
XMM00=00000000000000000000000000000000 XMM01=00000000000000000000000000000000
XMM02=00000000000000000000000000000000 XMM03=00000000000000000000000000000000
XMM04=00000000000000000000000000000000 XMM05=00000000000000000000000000000000
XMM06=00000000000000000000000000000000 XMM07=00000000000000000000000000000000

virsh # qemu-monitor-command qe-dell-ovs5-vm-12 --hmp 'x /i $pc'
0x000000007ffbcf88:  enter  $0xe7c6,$0x71

virsh # qemu-monitor-command qe-dell-ovs5-vm-12 --hmp 'x /64xb $pc-48'
000000007ffbcf58: 0x00 0x00 0x00 0x88 0xd9 0xd3 0xe0 0xb9
000000007ffbcf60: 0xe8 0x03 0x00 0x00 0x31 0xd2 0xf7 0xf1
000000007ffbcf68: 0x89 0x44 0x24 0x04 0xc7 0x04 0x24 0x77
000000007ffbcf70: 0x70 0x0f 0x00 0xe8 0xd7 0x68 0x13 0x80
000000007ffbcf78: 0xb0 0x34 0xe6 0x43 0x31 0xc0 0xe6 0x40
000000007ffbcf80: 0xe6 0x40 0xb1 0x8a 0x88 0xc8 0xe6 0x70
000000007ffbcf88: 0xc8 0xc6 0xe7 0x71 0xb0 0x8b 0xe6 0x70
000000007ffbcf90: 0xe4 0x71 0x83 0xe0 0x01 0x83 0xc8 0x02

Comment 4 Tomas Pelka 2016-10-20 07:48:37 UTC

One more thing CPU is SandyBridge one, I tried to set it manually to Nahalem to if that helps. Following outputs are with Nehalem set.

Comment 5 Tomas Pelka 2016-10-20 07:51:54 UTC

(In reply to Radim Krčmář from comment #2)
> (In reply to Tomas Pelka from comment #1)
> > Created attachment 1212078 [details]
> > One other symptom, besides on auto pausing (only way how to resume from
> > pause state is force reset) machines with mantion trace in c0,
> 
> Btw. the trace doesn't have a line that starts with code "Code=" and follows
> with a list of bytes at the bottom?
> Something like "Code=00 00 [...] 12 <34> 56 [...] ff ff"

I can't see that line in /var/log/libvirt/qemu/domain.log, where can I find it?

Comment 6 Tomas Pelka 2016-10-20 07:56:04 UTC

Also adding qemu-monitor-command outputs for running machine but stuck on reboot eating 100% CPU

virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp info status
VM status: running

virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp info registers
EAX=00000000 EBX=ffd18f73 ECX=0000008a EDX=00000000
ESI=00000003 EDI=7ffb13c8 EBP=00000066 ESP=00000003
EIP=7ffbcf16 EFL=00010046 [---Z-P-] CPL=0 II=0 A20=1 SMM=0 HLT=0
ES =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
CS =0008 00000000 ffffffff 00c09b00 DPL=0 CS32 [-RA]
SS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
DS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
FS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
GS =0010 00000000 ffffffff 00c09300 DPL=0 DS   [-WA]
LDT=0000 00000000 0000ffff 00008200 DPL=0 LDT
TR =0000 00000000 0000ffff 00008b00 DPL=0 TSS32-busy
GDT=     000f7cb0 00000037
IDT=     000f7cee 00000000
CR0=00000011 CR2=00000000 CR3=00000000 CR4=00000000
DR0=0000000000000000 DR1=0000000000000000 DR2=0000000000000000 DR3=0000000000000000 
DR6=00000000ffff0ff0 DR7=0000000000000400
EFER=0000000000000000
FCW=037f FSW=0000 [ST=0] FTW=00 MXCSR=00001f80
FPR0=0000000000000000 0000 FPR1=0000000000000000 0000
FPR2=0000000000000000 0000 FPR3=0000000000000000 0000
FPR4=0000000000000000 0000 FPR5=0000000000000000 0000
FPR6=0000000000000000 0000 FPR7=0000000000000000 0000
XMM00=00000000000000000000000000000000 XMM01=00000000000000000000000000000000
XMM02=00000000000000000000000000000000 XMM03=00000000000000000000000000000000
XMM04=00000000000000000000000000000000 XMM05=00000000000000000000000000000000
XMM06=00000000000000000000000000000000 XMM07=00000000000000000000000000000000

virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp 'x /i $pc'
0x000000007ffbcf16:  str    -0x7cfebd(%ebp)

virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp 'x /64xb $pc-48'
000000007ffbcee6: 0x2b 0x7c 0x24 0x44 0x1b 0x6c 0x24 0x48
000000007ffbceee: 0x69 0xdd 0x99 0x9e 0x36 0x00 0xb9 0x99
000000007ffbcef6: 0x9e 0x36 0x00 0x89 0xf8 0xf7 0xe1 0x01
000000007ffbcefe: 0xda 0x05 0xff 0x07 0x00 0x00 0x83 0xd2
000000007ffbcf06: 0x00 0x89 0xc6 0x89 0xd7 0x0f 0xac 0xfe
000000007ffbcf0e: 0x0b 0xc1 0xef 0x0b 0x8a 0x1d 0x6e 0x7b
000000007ffbcf16: 0x0f 0x00 0x8d 0x43 0x01 0x83 0xff 0x00
000000007ffbcf1e: 0x76 0x10 0x83 0xc6 0x01 0x83 0xd7 0x00

Comment 7 Radim Krčmář 2016-10-20 14:26:54 UTC

(In reply to Tomas Pelka from comment #3)
> virsh # qemu-monitor-command qe-dell-ovs5-vm-12 --hmp 'x /i $pc'
> 0x000000007ffbcf88:  enter  $0xe7c6,$0x71

This domain tried to execute an instruction that is emulated by KVM only if the second argument is 0.

KVM can be improved to accept all values, but I'm starting to have doubts about the system -- there is no reason to pass a number bigger than 31 as the second paramater, because the CPU does a modulo 32 on it.

Is the instruction from the original program? i.e. isn't there some memory corruption going on?  (and if not, what program is doing that?)

Are internal emulation errors from other guests also on stuck on this instruction?


(In reply to Tomas Pelka from comment #6)
> virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp 'x /i $pc'
> 0x000000007ffbcf16:  str    -0x7cfebd(%ebp)
> 
> virsh # qemu-monitor-command qe-dell-ovs5-vm-18 --hmp 'x /64xb $pc-48'
> 000000007ffbcee6: 0x2b 0x7c 0x24 0x44 0x1b 0x6c 0x24 0x48
> 000000007ffbceee: 0x69 0xdd 0x99 0x9e 0x36 0x00 0xb9 0x99
> 000000007ffbcef6: 0x9e 0x36 0x00 0x89 0xf8 0xf7 0xe1 0x01
> 000000007ffbcefe: 0xda 0x05 0xff 0x07 0x00 0x00 0x83 0xd2
> 000000007ffbcf06: 0x00 0x89 0xc6 0x89 0xd7 0x0f 0xac 0xfe
> 000000007ffbcf0e: 0x0b 0xc1 0xef 0x0b 0x8a 0x1d 0x6e 0x7b
> 000000007ffbcf16: 0x0f 0x00 0x8d 0x43 0x01 0x83 0xff 0x00
> 000000007ffbcf1e: 0x76 0x10 0x83 0xc6 0x01 0x83 0xd7 0x00

This also doesn't make much sense ...
"0f 00 8d 43 01 83 ff" is "str -0x7cfebd(%ebp)", but if the CPU was executing the code in the block above, then the instruction boundary would be elsewhere, likely forming "8a 1d 6e 7b 0f 00" and "8d 43 01", but never "0f 00 8d 43 01 83 ff" -- instruction length decoder is a very exposed part in KVM, so a bug there is less likely that corruption or a buggy jump.

Please upload trace.dat after using

  trace-cmd record -e kvm:\* -P $thread_id_of_the_vcpu

to see what the vcpu does while burning CPU.
$thread_id_of_the_vcpu can learned with

  virsh qemu-monitor-command $domain --hmp info cpus


(In reply to Tomas Pelka from comment #4)
> One more thing CPU is SandyBridge one, I tried to set it manually to Nahalem
> to if that helps. Following outputs are with Nehalem set.

So the host CPU is Sandy Bridge? (I assumed Nehalem, because it was in vm.xml.)

Please run this on the host, it will allow us to tell under which conditions is KVM expected to emulate instructions:
  grep . /sys/module/kvm{,_intel}/parameters/*


(In reply to Tomas Pelka from comment #5)
> (In reply to Radim Krčmář from comment #2)
> > (In reply to Tomas Pelka from comment #1)
> > > Created attachment 1212078 [details]
> > > One other symptom, besides on auto pausing (only way how to resume from
> > > pause state is force reset) machines with mantion trace in c0,
> > 
> > Btw. the trace doesn't have a line that starts with code "Code=" and follows
> > with a list of bytes at the bottom?
> > Something like "Code=00 00 [...] 12 <34> 56 [...] ff ff"
> 
> I can't see that line in /var/log/libvirt/qemu/domain.log, where can I find
> it?

If it's not right under EFER, then QEMU likely didn't print it ... not that important now, it just complicates debugging.

Comment 10 juzhang 2017-01-19 01:23:23 UTC

Hi Zhiyi,

Could you handle this needinfo?

Best Regards,
Junyi

Comment 11 Radim Krčmář 2017-03-14 13:24:53 UTC

Any luck with reproducing this on other machine?

Thanks.

Comment 20 Radim Krčmář 2017-06-06 14:41:41 UTC

The bug was caused by seabios memory corruption.

*** This bug has been marked as a duplicate of bug 1428347 ***

Note You need to log in before you can comment on or make changes to this bug.