Bug 1178655

Summary: qemu core dumped when do stop/cont VM after suspend/resume under netperf
Product: Red Hat Enterprise Linux 7 Reporter: Sibiao Luo <sluo>
Component: qemu-kvm-rhevAssignee: Marcelo Tosatti <mtosatti>
Status: CLOSED CURRENTRELEASE QA Contact: FuXiangChun <xfu>
Severity: medium Docs Contact:
Priority: medium    
Version: 7.1CC: amit.shah, chayang, juzhang, knoel, michen, mtosatti, pbonzini, sluo, virt-maint, xfu
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-05-25 20:51:34 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1158779, 1158780    
Attachments:
Description Flags
qemu-core-dumped-bt-log.txt none

Description Sibiao Luo 2015-01-05 09:21:03 UTC
Description of problem:
tried to run the autotest case virt.qemu.smp_2.8192m.repeat1.run_test.Host_RHEL.7.1.qcow2.virtio_blk.up.virtio_net.RHEL.7.1.x86_64.io-github-autotest-qemu.suspend_under_netperf.guest_s3, met qemu core dumped when do stop/cont VM after suspend/resume under netperf, both qemu-kvm and qemu-kvm-rhev can hit it.
BTW, if tried the s4 with the same testing(virt.qemu.smp_2.8192m.repeat1.run_test.Host_RHEL.7.1.qcow2.virtio_blk.up.virtio_net.RHEL.7.1.x86_64.io-github-autotest-qemu.suspend_under_netperf.guest_s4) which did not hit this core dumped.

Version-Release number of selected component (if applicable):
host info:
# uname -r && rpm -q qemu-kvm-rhev
3.10.0-217.el7.x86_64
qemu-kvm-rhev-2.1.2-17.el7.x86_64
guest info:
3.10.0-217.el7.x86_64

How reproducible:
100%

Steps to Reproduce:
1, Boot guest with command:
/usr/bin/qemu-kvm \
    -S  \
    -name 'virt-tests-vm1'  \
    -sandbox on  \
    -M pc  \
    -nodefaults  \
    -vga std  \
    -chardev socket,id=hmp_id_hmp1,path=/tmp/monitor-hmp1-20150105-112252-ymEGEBEh,server,nowait \
    -mon chardev=hmp_id_hmp1,mode=readline  \
    -chardev socket,id=serial_id_serial0,path=/tmp/serial-serial0-20150105-112252-ymEGEBEh,server,nowait \
    -device isa-serial,chardev=serial_id_serial0  \
    -chardev socket,id=seabioslog_id_20150105-112252-ymEGEBEh,path=/tmp/seabios-20150105-112252-ymEGEBEh,server,nowait \
    -device isa-debugcon,chardev=seabioslog_id_20150105-112252-ymEGEBEh,iobase=0x402 \
    -device ich9-usb-uhci1,id=usb1,bus=pci.0,addr=03 \
    -drive id=drive_image1,if=none,file=/usr/code/autotest/client/tests/virt/shared/data/images/rhel71-64.qcow2 \
    -device virtio-blk-pci,id=image1,drive=drive_image1,bootindex=0,bus=pci.0,addr=04 \
    -device virtio-net-pci,mac=9a:8b:8c:8d:8e:8f,id=id16Gr6X,vectors=4,netdev=idrmF6gb,bus=pci.0,addr=05  \
    -netdev tap,id=idrmF6gb,vhost=on,vhostfd=23,fd=22  \
    -m 1024  \
    -smp 2,maxcpus=2,cores=1,threads=1,sockets=2  \
    -cpu 'Nehalem' \
    -device usb-tablet,id=usb-tablet1,bus=usb1.0,port=1  \
    -vnc :0  \
    -rtc base=utc,clock=host,driftfix=none  \
    -boot order=cdn,once=c,menu=off,strict=off  \
    -global PIIX4_PM.disable_s3=0 \
    -enable-kvm

2, Sending monitor command 'cont'.

3, Stop iptables in guest and host.

4, Start netperf server in host.

5, Start 6 sessions netperf background with cmd: ' /var/tmp/netperf-2.6.0/src/netperf -H $host_ip -- -b 10 -D -t TCP_STREAM -l 60 ' 

6, Start suspend [echo mem > /sys/power/state]

7, Sleep a while before resuming guest

8, Sending monitor command 'system_wakeup'

9, Verify guest system log by command 'dmesg |grep -E "ACPI: Waking up from system sleep state S3"\|"ACPI: PCI interrupt for device .* disabled"'

10, Sending monitor command 'stop'

11, Check image info by qemu-img.

12, Sending monitor command 'cont'.

Actual results:
after step 12, qemu core dumped.
[qemu output] qemu-kvm: /builddir/build/BUILD/qemu-2.1.2/hw/i386/kvm/clock.c:69: kvmclock_current_nsec: Assertion `time.tsc_timestamp <= migration_tsc' failed.
[qemu output] /tmp/aexpect/jWFBxUUd/aexpect-OFsvMo.sh: line 1:  2118 Aborted                 (core dumped) /usr/bin/qemu-kvm -S -name 'virt-tests-vm1' -sandbox on -M pc -nodefaults -vga std -chardev socket,id=hmp_id_hmp1,path=/tmp/monitor-hmp1-20150105-112252-ymEGEBEh,server,nowait -mon chardev=hmp_id_hmp1,mode=readline -chardev socket,id=serial_id_serial0,path=/tmp/serial-serial0-20150105-112252-ymEGEBEh,server,nowait -device isa-serial,chardev=serial_id_serial0 -chardev socket,id=seabioslog_id_20150105-112252-ymEGEBEh,path=/tmp/seabios-20150105-112252-ymEGEBEh,server,nowait -device isa-debugcon,chardev=seabioslog_id_20150105-112252-ymEGEBEh,iobase=0x402 -device ich9-usb-uhci1,id=usb1,bus=pci.0,addr=03 -drive id=drive_image1,if=none,file=/usr/code/autotest/client/tests/virt/shared/data/images/rhel70-64.qcow2 -device virtio-blk-pci,id=image1,drive=drive_image1,bootindex=0,bus=pci.0,addr=04 -device virtio-net-pci,mac=9a:8b:8c:8d:8e:8f,id=id16Gr6X,vectors=4,netdev=idrmF6gb,bus=pci.0,addr=05 -netdev tap,id=idrmF6gb,vhost=on,vhostfd=23,fd=22 -m 1024 -smp 2,maxcpus=2,cores=1,threads=1,sockets=2 -cpu 'Nehalem' -device usb-tablet,id=usb-tablet1,bus=usb1.0,port=1 -vnc :0 -rtc base=utc,clock=host,driftfix=none -boot order=cdn,once=c,menu=off,strict=off -global PIIX4_PM.disable_s3=0 -enable-kvm
[qemu output] (Process terminated with status 134)

Expected results:
it should no any core dumped at all.

Additional info:
host cpu info,
# cat /proc/cpuinfo
...
processor	: 3
vendor_id	: GenuineIntel
cpu family	: 6
model		: 26
model name	: Intel(R) Xeon(R) CPU           X5550  @ 2.67GHz
stepping	: 5
microcode	: 0x19
cpu MHz		: 2661.000
cache size	: 8192 KB
physical id	: 0
siblings	: 4
core id		: 3
cpu cores	: 4
apicid		: 6
initial apicid	: 6
fpu		: yes
fpu_exception	: yes
cpuid level	: 11
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm dca sse4_1 sse4_2 popcnt lahf_lm ida dtherm tpr_shadow vnmi flexpriority ept vpid
bogomips	: 5333.52
clflush size	: 64
cache_alignment	: 64
address sizes	: 40 bits physical, 48 bits virtual
power management:

Comment 1 Sibiao Luo 2015-01-05 09:24:57 UTC
Created attachment 976269 [details]
qemu-core-dumped-bt-log.txt

Comment 3 Qian Guo 2015-01-05 09:52:11 UTC
Seams dupl from bug 1158780

Just FYI

Comment 4 Paolo Bonzini 2015-01-09 11:20:58 UTC
CCing Marcelo, who worked upstream on the patches that introduced the bug.

Comment 14 juzhang 2018-01-18 22:44:12 UTC
Hi Xiangchun,

Could you have a test and reply comment13?

Best Regards,
Junyi

Comment 15 FuXiangChun 2018-01-23 03:22:05 UTC
According to comment0, I re-tested this bug with the latest RHEL7.4.z.

qemu-kvm-rhev-2.9.0-16.el7_4.14.x86_64
3.10.0-693.17.1.el7.x86_64

I cann't reproduce this bug. I did twice.  Guest works well.