Bug 1368202

Summary: HA VMs are not restarted on different host if NonResponsive host is off and start action failed
Product: Red Hat Enterprise Virtualization Manager Reporter: Martin Tessun <mtessun>
Component: ovirt-engineAssignee: Martin Perina <mperina>
Status: CLOSED ERRATA QA Contact: Israel Pinto <ipinto>
Severity: urgent Docs Contact:
Priority: high    
Version: 4.0.0CC: aperotti, bugs, dmoessne, flo_bugzilla, jentrena, lsurette, mgoldboi, mkalinin, mperina, mtessun, pkubica, rbalakri, Rhev-m-bugs, srevivo, tspeetje, ykaul
Target Milestone: ovirt-4.0.4Keywords: ZStream
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1367557
: 1372812 (view as bug list) Environment:
Last Closed: 2016-09-28 22:16:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1367557    
Bug Blocks: 1338732, 1372812    

Description Martin Tessun 2016-08-18 16:10:00 UTC
+++ This bug was initially created as a clone of Bug #1367557 +++

Description of problem:

HA VMs are not restarted on different host(s) if NonResponsive host is off (detected by successful power management status action) and start power management action failed

Version-Release number of selected component (if applicable):

3.0 and later

How reproducible:

100%

Steps to Reproduce:
1. Create 2 or more hosts cluster with power management properly configured
2. Run HA VMs on host1 and make this host non responsive by turning the power off
3. Make sure that power management status of host1 can be properly detected, but power management start action failed
4. Check that fence of host1 failed and HA VMs are not restarted on different hosts although we detect that host1 is turned off using power management

Actual results:

HA VMs are not restarted on different host even though host1 is powered off

Expected results:

HA VMs are restarted on different hosts

Additional info:

Comment 1 Martin Tessun 2016-08-18 16:11:20 UTC
We would also need this fix to be backported to RHEV 3.6 as well.

Comment 2 Martin Perina 2016-08-18 18:04:05 UTC
Moving to 4.0.4 to sync with upstream bug. Once it's merged to 4.0.z, we can clone this into 3.6.z

Comment 4 Martin Perina 2016-09-04 13:00:03 UTC
Moving to MODIFIED to sync with upstream bug

Comment 6 Israel Pinto 2016-09-11 13:14:20 UTC
Verify with:
rhevm-4.0.4.2-0.1.el7ev.noarch
Hosts with PM:
OS Version:RHEL - 7.2 - 9.el7
OS Description:Red Hat Enterprise Linux Server 7.2 (Maipo)
Kernel Version:3.10.0 - 327.28.3.el7.x86_64
KVM Version:2.3.0 - 31.el7_2.21
LIBVIRT Version:libvirt-1.2.17-13.el7_2.5
VDSM Version:vdsm-4.18.13-1.el7ev

Steps:
1. Create 2 hosts in cluster with power management properly configured
2. Run HA 2 VMs and 1 none HA VM  on host1 
3. Turn power off host_1 from power management and setup host not to power up(from remote PM iLO4 in this case)
4. Check HA VMs are started on host_2

Results:
HA VMs run on host_2 - PASS

Comment 9 errata-xmlrpc 2016-09-28 22:16:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHSA-2016-1967.html