Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1085889

Summary: [power management] RESTART fails if two agents are configured concurrent and one is corrupt
Product: Red Hat Enterprise Virtualization Manager Reporter: sefi litmanovich <slitmano>
Component: ovirt-engineAssignee: Eli Mesika <emesika>
Status: CLOSED NOTABUG QA Contact:
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.3.0CC: acathrow, gklein, iheim, lpeer, oourfali, Rhev-m-bugs, yeylon
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: infra
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2014-04-10 18:16:46 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
engine.log none

Description sefi litmanovich 2014-04-09 15:08:39 UTC
Created attachment 884544 [details]
engine.log

Description of problem:

if power management is configured with two agents (one is configured with wrong values) on concurrent mode, fence fails.

Version-Release number of selected component (if applicable):

3.3.2, is35.1

How reproducible:

always

Steps to Reproduce:
1. set an environment with two hosts in the same cluster connected to storage.
2. configure two power management agents for host in concurrent mode - one with a working agent's values(address, username, pass, type), the other (tried both primary and secondary) with corrupt values e.g. wrong address 
3. try to restart host with power management

Actual results:

host fails to restart.

Expected results:

host should restart using the working pm agent

Additional info:

Please see the flow in engine.log starting at 2014-04-09 17:58

Comment 1 Oved Ourfali 2014-04-10 18:16:46 UTC
As far as I understand, a restart operation consist of shutdown + start. Concurrent mode means that the shutdown operation should pass with both PM agents before succeeding in shutting the host down. However, only one is needed in order to start the host up.
So, in your case it will fail, as designed.
That's also what I understand from:
http://www.ovirt.org/Features/HostPMMultipleAgents

Closing.
If I'm wrong here then please reopen.

Comment 2 Eli Mesika 2014-04-13 08:16:25 UTC
(In reply to Oved Ourfali from comment #1)
> As far as I understand, a restart operation consist of shutdown + start.
> Concurrent mode means that the shutdown operation should pass with both PM
> agents before succeeding in shutting the host down. However, only one is
> needed in order to start the host up.
> So, in your case it will fail, as designed.
> That's also what I understand from:
> http://www.ovirt.org/Features/HostPMMultipleAgents
> 
> Closing.
> If I'm wrong here then please reopen.

Approved, certainly working as designed , if you want you can check for the same scenario for two Sequential PM cards that one of them is corrupted ...