Red Hat Bugzilla – Bug 838206
oVirt manually shuting down the spm host will take the entire data center down.
Last modified: 2016-02-10 11:38:01 EST
Description of problem:
I came across this by accendent I shutdown one of my host by mistake. It turns out it was also the SPM host and it took down the entire network. I was able to recover the datacenter / cluster but it was a lot of downtime. This could be an issue in a production network.
Version-Release number of selected component (if applicable):
oVirt Engine Version: 3.1.0-3.9.el6
I have done it 3 times in my test network.
Steps to Reproduce:
1. Build a 3 node network not sure if it makes a diff I am using glusterfs
2. manually shutdown the Current SPM node.
3. Watch as everything crashes.
The primarary data store goes down taking the entire data center down at the same time.
All the VM's move and the data store continues to runs in degraded state
Not sure if this an engine ore vdsm issue and what logs are needed. Since it is very repeatable were do you want me to do to help debug this very problematic issue.
type of data center (if posixfs/gluster, i assume duplicate of the can't elect spm bug)?
if not, logs...
It is NFS. Later today I will route out the logs and then force the event to happen again to generate logs.
Closing old bugs. If this issue is still relevant/important in current version, please re-open the bug.