Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 896358

Summary: qemu fail to general EIO and guest kernel call trace if enable data-plane when disconnect the NFS server
Product: Red Hat Enterprise Linux 6 Reporter: Sibiao Luo <sluo>
Component: qemu-kvmAssignee: Virtualization Maintenance <virt-maint>
Status: CLOSED DEFERRED QA Contact: Virtualization Bugs <virt-bugs>
Severity: medium Docs Contact:
Priority: high    
Version: 6.4CC: acathrow, bsarathy, chayang, dyasny, juzhang, kwolf, michen, mkenneth, pbonzini, qzhang, shu, sluo, stefanha, virt-maint
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 896504 (view as bug list) Environment:
Last Closed: 2013-01-17 13:13:13 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 896504    
Attachments:
Description Flags
log of guest kernel call trace. none

Description Sibiao Luo 2013-01-17 06:03:36 UTC
Description of problem:
Configure the NFS and mount it, boot a guest with a data disk enable data-plane located on NFS storage, disconnect the NFS server during some I/O to the data disk in guest, then resume the paused guest by running "cont" in monitor after reconnect NFS server. But qemu fail to general EIO and guest kernel call trace if enable data-plane when disconnect the NFS server. btw, this is a regression issue.

Version-Release number of selected component (if applicable):
host info:
# uname -r && rpm -q qemu-kvm
2.6.32-355.el6.x86_64
qemu-kvm-0.12.1.2-2.352.el6.x86_64

How reproducible:
100%

Steps to Reproduce:
1.configure the NFS and mount it.
# cat /etc/exports 
/home *(rw,no_root_squash,sync)
# mount -o soft,timeo=15,retrans=3,nosharecache $nfs_server_ip:/home/ /mnt/
2.boot a guest with a data disk(werror=stop,rerror=stop) located on NFS storage.
e.g:...-drive file=/mnt/my-data-disk.raw,if=none,id=drive-virtio-disk,format=raw,cache=none,aio=native,werror=stop,rerror=stop -device virtio-blk-pci,serial="QEMU-DISK2",bus=pci.0,addr=0x9,scsi=off,x-data-plane=on,drive=drive-virtio-disk,id=virtio-disk,serial="QEMU-DISK2"
3.disconnect the NFS server during some I/O to the data disk in guest.
# service nfs stop
4.reconnect NFS server.
# service nfs restart
5.resume the paused guest by running "cont" in monitor.
  
Actual results:
after step 3, qemu fail to general EIO and guest kernel output many 'I/O error' and call trace.

Expected results:
qemu general EIO successfully, the VM was in paused status without any problem, and then guest can be resumed successfully after the nfs re-connect.

Additional info:

Comment 1 Sibiao Luo 2013-01-17 06:04:36 UTC
Created attachment 680015 [details]
log of guest kernel call trace.

Comment 2 Sibiao Luo 2013-01-17 06:06:27 UTC
since this issue was regression, so add Regression keywords.

Comment 3 Sibiao Luo 2013-01-17 06:09:21 UTC
My gest info:
kernel-2.6.32-355.el6.x86_64

The qemu fail to general EIO and guest kernel call trace if enable data-plane when disconnect the NFS server, and the VM tatus is still running, like:
(qemu) info status 
VM status: running
(qemu) info status 
VM status: running

The expected result like:
(qemu) block I/O error in device 'drive-virtio-disk': Input/output error (5)
block I/O error in device 'drive-virtio-disk': Input/output error (5)
block I/O error in device 'drive-virtio-disk': Input/output error (5)
block I/O error in device 'drive-virtio-disk': Input/output error (5)
...
(qemu) info status
VM status: paused (io-error)

Comment 5 Paolo Bonzini 2013-01-17 12:24:25 UTC
Not a regression since data-plane is not in 6.3.  But I'm leaving the blocker? anyway.

Comment 6 Stefan Hajnoczi 2013-01-17 13:01:07 UTC
(In reply to comment #3)
> My gest info:
> kernel-2.6.32-355.el6.x86_64
> 
> The qemu fail to general EIO and guest kernel call trace if enable
> data-plane when disconnect the NFS server, and the VM tatus is still
> running, like:
> (qemu) info status 
> VM status: running
> (qemu) info status 
> VM status: running
> 
> The expected result like:
> (qemu) block I/O error in device 'drive-virtio-disk': Input/output error (5)
> block I/O error in device 'drive-virtio-disk': Input/output error (5)
> block I/O error in device 'drive-virtio-disk': Input/output error (5)
> block I/O error in device 'drive-virtio-disk': Input/output error (5)
> ...
> (qemu) info status
> VM status: paused (io-error)

This is expected behavior because x-data-plane=on does not implement -drive werror=...,rerror=....  The only supported error policy is to report host I/O errors to the guest.  The guest is not paused when an I/O error occurs.

Other error handling policies will be supported by virtio-blk-data-plane in the future.  There is no immediate fix for this, it will be supported when hw/virtio-blk.c is refactored to run in the data-plane thread.