Description of problem: I came across this by accendent I shutdown one of my host by mistake. It turns out it was also the SPM host and it took down the entire network. I was able to recover the datacenter / cluster but it was a lot of downtime. This could be an issue in a production network. Version-Release number of selected component (if applicable): oVirt Engine Version: 3.1.0-3.9.el6 vdsm-cli: 4.10.0-0.58.gita6f4929.el6 How reproducible: I have done it 3 times in my test network. Steps to Reproduce: 1. Build a 3 node network not sure if it makes a diff I am using glusterfs 2. manually shutdown the Current SPM node. 3. Watch as everything crashes. Actual results: The primarary data store goes down taking the entire data center down at the same time. Expected results: All the VM's move and the data store continues to runs in degraded state Additional info: Not sure if this an engine ore vdsm issue and what logs are needed. Since it is very repeatable were do you want me to do to help debug this very problematic issue. Thanks Robert
type of data center (if posixfs/gluster, i assume duplicate of the can't elect spm bug)? if not, logs...
It is NFS. Later today I will route out the logs and then force the event to happen again to generate logs. Thanks Robert
Closing old bugs. If this issue is still relevant/important in current version, please re-open the bug.