Bug 756915

Summary: qemu-kvm core dumps and disk corrupt during win7_64 installation
Product: Red Hat Enterprise Linux 6 Reporter: Xiaoqing Wei <xwei>
Component: qemu-kvmAssignee: Asias He <asias>
Status: CLOSED CURRENTRELEASE QA Contact: Virtualization Bugs <virt-bugs>
Severity: high Docs Contact:
Priority: high    
Version: 6.3CC: acathrow, areis, bsarathy, juzhang, michen, mkenneth, rhod, shuang, shu, tburke, virt-maint, vrozenfe
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-04-20 02:44:11 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
gdb bt full none

Description Xiaoqing Wei 2011-11-25 05:33:30 UTC
Description of problem:

qemu-kvm core dumps and disk corrupt during win7_64 installation
Version-Release number of selected component (if applicable):

qemu-kvm-0.12.1.2-2.210.el6.x86_64
How reproducible:

5 / 50
Steps to Reproduce:
1. start a win7_64 installation by:
/home/staf-kvm-devel/autotest-devel/client/tests/kvm/qemu -name 'vm1' -chardev
socket,id=qmp_monitor_id_qmpmonitor1,path=/tmp/monitor-qmpmonitor1-20111122-165744-STUN,server,nowait
-mon chardev=qmp_monitor_id_qmpmonitor1,mode=control \
-chardev
socket,id=serial_id_20111122-165744-STUN,path=/tmp/serial-N,server,nowait \
-device isa-serial,chardev=serial_id_20111122-165744-STUN \
-drive
file='win7-64.qcow2',index=0,if=none,id=drive-ide0-0-0,media=disk,cache=none,format=qcow2,aio=native
\
-device ide-drive,bus=ide.0,unit=0,drive=drive-ide0-0-0,id=ide0-0-0 \
-device
rtl8139,netdev=idvdg4cO,mac=9a:fe:9d:9b:b9:96,id=ndev00idvdg4cO,bus=pci.0,addr=0x3
\
-netdev tap,id=idvdg4cO,fd=21 \
-m 4G -smp 4,cores=2,threads=1,sockets=2 \
-drive
file='en_windows_7_ultimate_with_sp1_x64_dvd_618240.iso',index=1,if=none,id=drive-ide0-0-1,media=cdrom,readonly=on,format=raw
\
-device ide-drive,bus=ide.0,unit=1,drive=drive-ide0-0-1,id=ide0-0-1 \
-drive
file='winutils.iso',index=2,if=none,id=drive-ide0-1-0,media=cdrom,readonly=on,format=raw
\
-device ide-drive,bus=ide.1,unit=0,drive=drive-ide0-1-0,id=ide0-1-0 \
-drive
file='virtio-win.iso',index=3,if=none,id=drive-ide0-1-1,media=cdrom,readonly=on,format=raw
\
-device ide-drive,bus=ide.1,unit=1,drive=drive-ide0-1-1,id=ide0-1-1 \
-cpu cpu64-rhel6,+sse2,+x2apic \
-fda
'/home/staf-kvm-devel/autotest-devel/client/tests/kvm/images/win7-64/answer.vfd'
\
-spice port=8000,disable-ticketing -vga qxl \
-rtc base=localtime,clock=host,driftfix=slew \
-boot order=cdn,once=d,menu=off     -M rhel6.2.0 -usb -device usb-tablet
-enable-kvm 
2.
3.

Actual results:

1) qemu-kvm core dumps during installation.

2) qemu-img check image.qcow2
 22114 internal errors have occurred during the check.

Expected results:

installation finish, both guest and host works well.



Additional info:
   NOTE: bt_full is attached.
#0  0x0000000000496d6a in alloc_refcount_block (bs=0x2e1e010,
offset=4584051076082565690, length=<value optimized out>, addend=-1) at
block/qcow2-refcount.c:334
(gdb) #0  0x0000000000496d6a in alloc_refcount_block (bs=0x2e1e010,
offset=4584051076082565690, length=<value optimized out>, addend=-1) at
block/qcow2-refcount.c:334
#1  update_refcount (bs=0x2e1e010, offset=4584051076082565690, length=<value
optimized out>, addend=-1) at block/qcow2-refcount.c:459
#2  0x00000000004975e0 in qcow2_free_clusters (bs=0x2e1e010,
offset=4584051076082565690, size=65536) at block/qcow2-refcount.c:639
#3  0x0000000000498cee in qcow2_alloc_cluster_link_l2 (bs=0x2e1e010, m=<value
optimized out>) at block/qcow2-cluster.c:672
#4  0x0000000000493ea8 in qcow2_aio_write_cb (opaque=0x2ffff10, ret=0) at
block/qcow2.c:642
#5  0x0000000000485d6a in qemu_laio_process_completion (s=<value optimized
out>, laiocb=0x7f63e0000950) at linux-aio.c:68
#6  0x0000000000485f7f in qemu_laio_enqueue_completed (opaque=0x2e1be80) at
linux-aio.c:107
#7  qemu_laio_completion_cb (opaque=0x2e1be80) at linux-aio.c:144
#8  0x000000000040c46f in main_loop_wait (timeout=1000) at
/usr/src/debug/qemu-kvm-0.12.1.2/vl.c:4024
#9  0x000000000042af2a in kvm_main_loop () at
/usr/src/debug/qemu-kvm-0.12.1.2/qemu-kvm.c:2225
#10 0x000000000040deb5 in main_loop (argc=<value optimized out>, argv=<value
optimized out>, envp=<value optimized out>) at
/usr/src/debug/qemu-kvm-0.12.1.2/vl.c:4234
#11 main (argc=<value optimized out>, argv=<value optimized out>, envp=<value
optimized out>) at /usr/src/debug/qemu-kvm-0.12.1.2/vl.c:6470

Comment 1 Xiaoqing Wei 2011-11-25 05:34:15 UTC
Created attachment 536151 [details]
gdb bt full

Comment 6 Kevin Wolf 2011-12-07 16:35:07 UTC
If you think this is a regression, can you please test with some older versions and track down which version introduced the bug?

Comment 7 Xiaoqing Wei 2011-12-08 03:27:11 UTC
(In reply to comment #6)
> If you think this is a regression, can you please test with some older versions
> and track down which version introduced the bug?

Hi Kevin,

I am start do it now, will update bz then :)

Thanks and Best Regards,
Xiaoqing Wei.

Comment 9 Kevin Wolf 2012-02-17 14:29:12 UTC
Any new information on this?

Comment 10 Xiaoqing Wei 2012-02-19 09:18:39 UTC
(In reply to comment #9)
> Any new information on this?

Hi Kevin,

Not news on this bug, but there's a new bz794691.
it's qemu core dump and qcow2 corrupt when qemu being killed.

Best Regards,
Xiaoqing Wei.

Comment 11 Dor Laor 2012-03-07 11:01:23 UTC
(In reply to comment #10)
> (In reply to comment #9)
> > Any new information on this?
> 
> Hi Kevin,
> 
> Not news on this bug, but there's a new bz794691.
> it's qemu core dump and qcow2 corrupt when qemu being killed.
> 
> Best Regards,
> Xiaoqing Wei.

That's unrelated since no one killed qemu here.
QE should test the latest 6.3 qcow2 code w/ coroutines changes.

Comment 12 Xiaoqing Wei 2012-03-09 11:58:06 UTC
(In reply to comment #11)
> (In reply to comment #10)
> > (In reply to comment #9)
> > > Any new information on this?
> > 
> > Hi Kevin,
> > 
> > Not news on this bug, but there's a new bz794691.
> > it's qemu core dump and qcow2 corrupt when qemu being killed.
> > 
> > Best Regards,
> > Xiaoqing Wei.
> 
> That's unrelated since no one killed qemu here.
> QE should test the latest 6.3 qcow2 code w/ coroutines changes.

I am letting autotest to run 10 rounds, will paste reuslt when finish.

Comment 13 Xiaoqing Wei 2012-03-12 02:00:04 UTC
20 rounds installation[using HOST_MAX_MEM/SMP], without error.

https://virtlab.englab.nay.redhat.com/job/46201/details/

Comment 14 Xiaoqing Wei 2012-03-12 02:03:49 UTC
(In reply to comment #13)
> 20 rounds installation[using HOST_MAX_MEM/SMP], without error.
> 
> https://virtlab.englab.nay.redhat.com/job/46201/details/

using qemu-kvm-0.12.1.2-2.241.el6.x86_64

Comment 15 Xiaoqing Wei 2012-04-20 02:36:06 UTC
100 rounds of installation, unable to reproduce this bug:

https://virtlab.englab.nay.redhat.com/job/48372/details
versions used:
kernel-2.6.32-262.el6.x86_64
qemu-kvm-0.12.1.2-2.275.el6.x86_64
seabios-0.6.1.2-18.el6.x86_64
sgabios-0-0.3.20110621svn.el6.x86_64
spice-server-0.10.1-5.el6.x86_64
vgabios-0.6b-3.6.el6.noarch
virtio-win-1.4.0

Comment 16 Asias He 2012-04-20 02:44:11 UTC
According to Comment 15, I'm closing this bug since it is not reproducible in current release.