Bug 1148663 - After host reboot engine gets VDSNetworkException: Message timeout which can be caused by communication issues
Summary: After host reboot engine gets VDSNetworkException: Message timeout which can ...
Keywords:
Status: CLOSED DUPLICATE of bug 1148688
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine
Version: 3.5.0
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 3.5.0
Assignee: Piotr Kliczewski
QA Contact: Pavel Stehlik
URL:
Whiteboard: infra
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2014-10-02 04:45 UTC by Gal Amado
Modified: 2016-02-10 19:00 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2014-10-06 08:46:14 UTC
oVirt Team: Infra
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Gal Amado 2014-10-02 04:45:01 UTC
Description of problem:
Rebooting a host cause a VDSNetworkException on the engine.
As a result, the host get a faulty "none responding" status forever (tested for more than 15 hours) on engine's admin GUI   

Version-Release number of selected component (if applicable):
Red Hat Enterprise Virtualization Manager Version: 3.5.0-0.13.beta.el6ev
vdsm: vdsm-4.16.5-2.el6ev.x86_64


How reproducible:
Happens all the time.

Steps to Reproduce:
Setup : up and running engine with 1 host.
1.on the host , run "reboot"
2.wait for vdsm service to be up + some reasonable idle time for update (some 5 min !?) 
3. check engine, for a clear log
4. check host status on engine's admin GUI - should be OK (green) 

Actual results:
Exception on engine's log : 
VDSNetworkException: Message timeout which can be caused by communication issues  
On engine's Admin GUI :
- Host status is read
- repeated msg on engines event log "Host1 is not responding ..."
 
Expected results:
- sometime after the host is up and running, the host status should be OK on admin's GUI , and no exceptions on engines log.


Additional info:
- restarting engine service (by "service ovirt-engine restart"), and the host seems OK again.
This bug blocks Automation tests that reboot the hosts.

Comment 1 Oved Ourfali 2014-10-05 11:11:44 UTC
Seems like a duplicate of Bug 1148688 (the description is different, but same symptoms).
Piotr - can you verify, and if so close it as duplicate?

Comment 2 Piotr Kliczewski 2014-10-06 08:46:14 UTC
Yes. This issue was already fixed for Bug 1148688.

*** This bug has been marked as a duplicate of bug 1148688 ***


Note You need to log in before you can comment on or make changes to this bug.