Bug 1092667

Summary: vdsm doesn't finish reconstruction when vdsm's connection to the master storage domain is blocked(single host in a cluster)
Product: Red Hat Enterprise Virtualization Manager Reporter: Ori Gofen <ogofen>
Component: vdsmAssignee: Liron Aravot <laravot>
Status: CLOSED DUPLICATE QA Contact: Aharon Canan <acanan>
Severity: high Docs Contact:
Priority: unspecified    
Version: 3.4.0CC: acanan, bazulay, gklein, iheim, laravot, lpeer, scohen, tnisan, yeylon
Target Milestone: ---   
Target Release: 3.4.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: storage
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-05-01 15:05:17 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
vdsm+engine logs none

Description Ori Gofen 2014-04-29 17:06:53 UTC
Created attachment 890868 [details]
vdsm+engine logs

Description of problem:

note:This bug appears only when a cluster has a single host.
When blocking the connection(via iptables) from vdsm to its master domain,vdsm doesn't finish the reconstruction and engine is stuck on an infinite loop of requests.

The message in ui dialogue box:

"Invalid status on Data Center $name. Setting status to Non Responsive"

Version-Release number of selected component (if applicable):

host:vdsm-4.14.7-0.1.beta3.el6ev.x86_64

engine:rhevm-cli-3.4.0.6-3.el6ev.noarch ;
rhevm-backend-3.4.0-0.15.beta3.el6ev.noarch

How reproducible:
100%
ran twice (first session 6 hours,second time 18:00:54 - 19:41:51 <-in logs)

Steps to Reproduce:
1.create a shared DC
2.add one host
3.create two storage domains (block and file)
4.block connectivity to the master domain

Actual results:
vdsm fail to reconstruct,engine is stuck

Expected results:
vdsm should succeed reconstructing,set his none blocked storage domain as master
and get spm status

Additional info:

Comment 2 Liron Aravot 2014-05-01 15:05:17 UTC
Reconstruct completes succesfully, the spm can't be started because of the issue solved in 1072900.

closing as duplicate.

*** This bug has been marked as a duplicate of bug 1072900 ***