Bug 669777

Summary: libvirtd blocks on stale NFS handle
Product: Red Hat Enterprise Linux 6 Reporter: Bowe Strickland <bowe>
Component: libvirtAssignee: Michal Privoznik <mprivozn>
Status: CLOSED DUPLICATE QA Contact: Virtualization Bugs <virt-bugs>
Severity: medium Docs Contact:
Priority: low    
Version: 6.0CC: berrange, dallan, dyuan, eblake, jyang, kxiong, vbian, xen-maint
Target Milestone: rc   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-06-21 03:35:29 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Bowe Strickland 2011-01-14 18:24:14 UTC
Description of problem
----------------------------------

When managing images on NFS share, libvirtd blocks for long times before returning

"createLinux failed: virDomainCreateLinux() failed monitor socket did not show up.: No such file or directory"

and logging 

Jan 14 13:18:18 lal06 libvirtd: 13:18:18.013: error : qemuSetupDiskPathAllow:3551 : Unable to allow access for disk path /shared/storage/image_pool/835ba933-7679-467a-92df-d48545070c44: Stale NFS file handle

(amongst others).

I'm filing under advisement of danpb:


(12:40:56 PM) bowe: looks like NFS is getting flaky.. that would match the symptoms.
(12:41:32 PM) danpb: bowe: please file a BZ,  we should treat stale NFS handle as fatal
(12:41:57 PM) bowe: will do...


Version-Release number of selected component (if applicable):
------------------------------------------------------------------

[root@lal06 image_pool]# rpm -qa | grep -e libvirt -e kvm -e qemu -e kernel
libvirt-client-0.8.1-27.el6.x86_64
libvirt-0.8.1-27.el6.x86_64
kernel-2.6.32-71.el6.x86_64
kernel-firmware-2.6.32-71.el6.noarch
qemu-img-0.12.1.2-2.113.el6.x86_64
dracut-kernel-004-32.el6.noarch
abrt-addon-kerneloops-1.1.13-4.el6.x86_64
gpxe-roms-qemu-0.9.7-6.3.el6.noarch
qemu-kvm-0.12.1.2-2.113.el6.x86_64

[root@lal06 image_pool]# uname -a
Linux lal06.gls.prod.phx2.redhat.com 2.6.32-71.el6.x86_64 #1 SMP Wed Sep 1 01:33:01 EDT 2010 x86_64 x86_64 x86_64 GNU/Linux

How reproducible:
Only presents under load

Steps to Reproduce:
------------------------------------------------------------------
1. get 20 students in a virtual classroom
2. have them each fire up their guests
3. wait 2 hours ;).

Comment 2 RHEL Program Management 2011-04-04 01:43:45 UTC
Since RHEL 6.1 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 3 Michal Privoznik 2011-06-03 13:48:25 UTC
Code inspection shows we check for return value of every function on chain of function calls from that error till virDomainCreateXML(). So logs would be really helpful here. Can you please provide them?

Comment 4 Bowe Strickland 2011-06-03 18:19:04 UTC
unfortunately in the meantime we've returned to RHEL5 for our hosts to avoid the issue, so would have a hard time reproducing. I'll keep this in mind if we see it again at somepoint RHEL6...

Comment 5 Dave Allan 2011-06-17 18:07:25 UTC
*** Bug 694422 has been marked as a duplicate of this bug. ***

Comment 6 Dave Allan 2011-06-21 03:35:29 UTC

*** This bug has been marked as a duplicate of bug 692663 ***