Bug 537077
Summary: | error codes aren't always propagated up through the block layer (e.g. -ENOSPC) | |||
---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Eduardo Habkost <ehabkost> | |
Component: | kvm | Assignee: | Kevin Wolf <kwolf> | |
Status: | CLOSED ERRATA | QA Contact: | Virtualization Bugs <virt-bugs> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 5.5 | CC: | bkahn, bstein, ehabkost, fyang, jplans, lihuang, markmc, pm-eus, riek, rsibley, tburke, virt-maint, ykaul | |
Target Milestone: | rc | |||
Target Release: | --- | |||
Hardware: | All | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | kvm-83-153.el5 | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | ||
Clone Of: | 520693 | |||
: | 537083 560623 (view as bug list) | Environment: | ||
Last Closed: | 2010-03-30 07:52:37 UTC | Type: | --- | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 537083, 560623 |
Description
Eduardo Habkost
2009-11-12 12:09:20 UTC
Hi Eduardo, I am a tester in Beijing team. I fail to reproduce bug 537077 on kvm-83-137.el5. I think I should misunderstand your steps. Could you tell me the detailed steps and iometer's config you used. Following are the steps I used: 1. Start a VM with command: #qemu-kvm -drive file=/usr/kvm-autotest/client/tests/kvm/images/win2003-32-virtio.qcow2,if=virtio,boot=on -net nic,vlan=0,model=e1000,macaddr=00:AE:A9:13:4E:00 -net tap,vlan=0,ifname=e1000_0_6001,script=/usr/kvm-autotest/client/tests/kvm/scripts/qemu-ifup-switch,downscript=no -m 2048 -smp 1 -usbdevice tablet -rtc-td-hack -no-hpet -cpu qemu64,+sse2 -no-kvm-pit-reinjection -vnc :0 2. Login VM by vncviewer. 3. Install IOmeter got from http://sourceforge.net/projects/iometer/files/iometer-stable/2006-07-27/iometer-2006.07.27.win32.i386-setup.exe/download. 4. Run IOmeter with write only 2K, 4K, 8K, 16K, 32K, 64K transfer request on C: for almost one hour. But I did not see any error message in console and IOmeter. Could you help me verify the test steps and show me iometer config you used. Thanks very much! I didn't do any testing, I just filed this bug for the error found on bug #520693 comment 8. The problem was found by Bob Sibley. You'll probably need RHEV-M and VDSM to reproduce this bug, as it depends on getting -ENOSPC errors being reported during normal operation. Tou won't get any -ENOSPC errors if you use a pre-existing image. RHEV-M, on the other hand, uses on-demand resizing of LVM volumes, and depend on proper -ENOSPC error reporting. Hi all Thanks for your help! Here i just want to confirm with you about how to verify this bug? 1: I used to try this way on kvm-83-105.el5_4.13 and kvm-83-153.el5. 1. Greate a qcow2 image in LVM [root@localhost ~]# pvcreate /dev/sda1 Physical volume "/dev/sda1" successfully created [root@localhost ~]# vgcreate vgtest /dev/sda1 Volume group "vgtest" successfully created [root@localhost ~]# lvcreate -n lvtest -L 1G vgtest Logical volume "lvtest" created [root@localhost ~]# qemu-img create -f qcow2 /dev/vgtest/lvtest 5G Formatting '/dev/vgtest/lvtest', fmt=qcow2, size=5242880 kB 2. Start a VM with command: [root@localhost ~]# /usr/libexec/qemu-kvm -rtc-td-hack -no-hpet -usbdevice tablet -cpu qemu64,+sse2 -drive file=win2003-64-virtio.qcow2,if=virtio,boot=on,format=qcow2,cache=off,werror=stop -smp 2 -m 2G -vnc :1 -net nic,macaddr=20:20:20:11:12:56,model=virtio,vlan=0 -net tap,script=/etc/qemu-ifup,vlan=0 -monitor stdio -drive file=/dev/vgtest/lvtest,if=virtio,format=qcow2,cache=off,werror=stop 3. Run "notify all on" in monitor (qemu) notify all on 4. Using IOmeter Sequential Writes 32K transfer request size in guest. 5. In monitor i see following message: (qemu) # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # RTC: new time is UTC-28802 # VM is stopped due to disk write error: virtio1: No space left on device info status VM status: paused (qemu) c (qemu) # VM is stopped due to disk write error: virtio1: No space left on device So i fail to reproduce this bug on kvm-83-105.el5_4.13 and kvm-83-153.el5. Could i use this way to verify this bug? 2: Kevin Wolf used to refer to following in his email. What you could do to manually provoke an I/O error is to use an image on NFS, set a breakpoint in one of the functions that didn't return the right error code, stop NFS when the function is called and look if the right results is returned. Uli Obergfell has described this kind of reproducing such bugs in Bugzilla, you can refer to https://bugzilla.redhat.com/show_bug.cgi?id=531827#c17 for example. Could i follow this way to verify this bug? If yes, I just want to know how many function need i set breakpoint to check? How can we make sure all the error code have been correct for this bug? 3: We have a qcow2 auto test suit base on autotest, it contains some function test such as - prepare: only image_copy - raw_function: only qemuio_test - normal: only boot reboot linux_s4 mig2.load.dbench - function: only fillup_disk format_disk multi_disk.default ioquit hdparm - endurance: only autotest.bonnie autotest.parallel_dd autotest.dbench autotest.ctcs2 autotest.iozone autotest.disktest - end: only shutdown Could i run this suite on IDE/VIRTIO Windows/Linux qcow2 image to verify this bug? I think this way should be more reasonable than others. Any suggestion on how to verify this bug? Run qcow2 test suite with IDE/VIRTIO, Windows/Linux, qcow2 on kvm-83-153.el5. No KVM bug is found. Few case fail for script/testing environment issue, so i waived them in list. job link in our virtlab: https://virtlab.englab.nay.redhat.com/job/6505/details/ https://virtlab.englab.nay.redhat.com/job/6527/details/ https://virtlab.englab.nay.redhat.com/job/6506/details/ https://virtlab.englab.nay.redhat.com/job/6467/details/ https://virtlab.englab.nay.redhat.com/job/6407/details/ https://virtlab.englab.nay.redhat.com/job/6389/details/ An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2010-0271.html |