Bug 808745

Summary: VDS::handleNetworkException Server failed to respond,
Product: Red Hat Enterprise Linux 6 Reporter: Chao Yang <chayang>
Component: vdsmAssignee: Dan Kenigsberg <danken>
Status: CLOSED NOTABUG QA Contact: yeylon <yeylon>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 6.2CC: abaron, bazulay, chayang, iheim, juzhang, michen, shuang, srevivo, ykaul
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-04-18 19:39:05 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
the related log under /var/log/jbossas/rhevm-slimmed none

Description Chao Yang 2012-03-31 12:44:33 UTC
Description of problem:
I have 3 hosts running in the same cluster:
dhcp-72-66-intel,  SPM
intel-e5620-16-6.englab.nay.redhat.com
intel-e5620-16-5.englab.nay.redhat.com

And dhcp-72-66-intel was in SPM status. I found all VMs running on intel-e5620-16-6.englab.nay.redhat.com have been in unknown status, from the Events tab, I see "Error: Network error during communication with the Host." So, I login to the host, ping got replied from remote host. Not sure why it complained this error. After a while, dhcp-72-66-intel and intel-e5620-16-6.englab.nay.redhat.com were set to up in "Hosts" tab, all seems right, except that: 
1.SPM changed to intel-e5620-16-6.englab.nay.redhat.com
2.VMs running on intel-e5620-16-6.englab.nay.redhat.com were shutdown, I need to start them up.

Version-Release number of selected component (if applicable):
# uname -r;rpm -q qemu-kvm libvirt vdsm
2.6.32-220.el6.x86_64
qemu-kvm-0.12.1.2-2.236.el6.x86_64
libvirt-0.9.10-2.el6.x86_64
vdsm-4.9-112.7.el6_2.x86_64


How reproducible:
1/1 so far.

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Chao Yang 2012-03-31 12:46:05 UTC
Created attachment 574202 [details]
the related log under /var/log/jbossas/rhevm-slimmed

Comment 3 Dan Kenigsberg 2012-04-18 19:39:05 UTC
what you describe fits to a temporary loss of network connectivity to the spm host. I do not see shred of a vdsm bug. Please reopen if this reproduces, but try to understand when it happens.