Bug 987446 - After enable concurrent option under host power management fencing begin failed
Summary: After enable concurrent option under host power management fencing begin failed
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine
Version: 3.2.0
Hardware: Unspecified
OS: Linux
unspecified
unspecified
Target Milestone: ---
: 3.2.2
Assignee: Eli Mesika
QA Contact: Artyom
URL:
Whiteboard: infra
Depends On: 977689
Blocks:
TreeView+ depends on / blocked
 
Reported: 2013-07-23 11:52 UTC by Idith Tal-Kohen
Modified: 2016-02-10 19:24 UTC (History)
14 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Due to a race condition which attempted to use a null proxy host for fencing, the concurrent option for host power management could not be used. Now, the engine checks for a valid proxy when performing fencing operations, so the concurrent option can be used.
Clone Of: 977689
Environment:
Last Closed: 2013-08-13 13:09:10 UTC
oVirt Team: Infra
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
vdsm + engine logs, local env (397.52 KB, application/x-compressed-tar)
2013-08-05 14:55 UTC, Barak Dagan
no flags Details
BZ is meregd only into sf19.1 (19.20 KB, image/png)
2013-08-08 09:11 UTC, Eli Mesika
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2013:1149 0 normal SHIPPED_LIVE rhevm bug fix update 2013-08-13 17:06:36 UTC
oVirt gerrit 17206 0 None None None Never

Comment 2 Barak Dagan 2013-08-05 14:55:48 UTC
Created attachment 782866 [details]
vdsm + engine logs, local env

Verification failed on sf19:

1) secure = no, nothing happens


Thread-1826::DEBUG::2013-08-05 17:36:41,579::API::1094::vds::(fenceNode) rc 0 in agent=fence_apc_snmp
ipaddr=10.35.23.24
login=talayan
option=status
passwd=XXXX
port=4
secure=no out Status: ON
 err Parse error: Ignoring unknown option 'secure=no'

Thread-1827::DEBUG::2013-08-05 17:36:41,584::API::1068::vds::(fenceNode) fenceNode(addr=10.35.23.24,port=,agent=apc_snmp,user=talayan,passwd=XXXX,action=status,secure=,options=port=4
secure=no)
Thread-1827::DEBUG::2013-08-05 17:36:41,957::API::1094::vds::(fenceNode) rc 0 in agent=fence_apc_snmp
ipaddr=10.35.23.24
login=talayan
option=status
passwd=XXXX
port=4
secure=no out Status: ON
 err Parse error: Ignoring unknown option 'secure=no'

2) secure = yes, icon change to rebooting for a few seconds - host is not rebooted

Thread-1916::DEBUG::2013-08-05 17:38:16,752::API::1094::vds::(fenceNode) rc 0 in agent=fence_apc_snmp
ipaddr=10.35.23.24
login=talayan
option=status
passwd=XXXX
port=4
secure=yes out Status: ON
 err Parse error: Ignoring unknown option 'secure=yes'

Thread-1917::DEBUG::2013-08-05 17:38:16,757::API::1068::vds::(fenceNode) fenceNode(addr=10.35.23.24,port=,agent=apc_snmp,user=talayan,passwd=XXXX,action=status,secure=,options=port=4
secure=yes)
Thread-1917::DEBUG::2013-08-05 17:38:17,098::API::1094::vds::(fenceNode) rc 0 in agent=fence_apc_snmp
ipaddr=10.35.23.24
login=talayan
option=status
passwd=XXXX
port=4
secure=yes out Status: ON
 err Parse error: Ignoring unknown option 'secure=yes'

Comment 4 Eli Mesika 2013-08-06 06:45:53 UTC
(In reply to Barak Dagan from comment #2)

Can you please check that you succeeded to push the TEST button and get the status OK
I don't understand why guessing the secure flag , you should know it and it totally dependant in the specific engine you are testing 
Please try to verify again doing the exect scenarion as in the bug description :

Add host, add primary and secondary agents of power management, check via test button if agents defined correct. Check concurrent option and try to restart host.

If TEST is not working, there is no point in going further, please fix that 
Also , you will probably have to test on a diffrent agent rather than APC due to 
BZ 982266

Please let me know if you think this BZ should be dependant on BZ 982266

Comment 5 Artyom 2013-08-06 08:03:01 UTC
(In reply to Eli Mesika from comment #4)
> (In reply to Barak Dagan from comment #2)
> 
> Can you please check that you succeeded to push the TEST button and get the
> status OK
> I don't understand why guessing the secure flag , you should know it and it
> totally dependant in the specific engine you are testing 
> Please try to verify again doing the exect scenarion as in the bug
> description :
> 
> Add host, add primary and secondary agents of power management, check via
> test button if agents defined correct. Check concurrent option and try to
> restart host.
> 
> If TEST is not working, there is no point in going further, please fix that 
> Also , you will probably have to test on a diffrent agent rather than APC
> due to 
> BZ 982266
> 
> Please let me know if you think this BZ should be dependant on BZ 982266
rhevm sf19
hosts: vdsm-4.10.2-24.0.el6ev
one of host with two power managements apc_snmp and ipmilan
click on test button show "Test Succeeded, on"
Trying fence host:
Receive messages: Failed to stop Host aqua-vds7.qa.lab.tlv.redhat.com, (User: admin@internal).
Failed to restart Host aqua-vds7.qa.lab.tlv.redhat.com, (User: admin@internal).
In engine still null where proxie:
Executing <Stop> Power Management command, Proxy Host:null, Agent:ipmilan, Target Host:aqua-vds7.qa.lab.tlv.redhat.com, Management IP:aqua-vds7-mgmt.qa.lab.tlv.redhat.com, User:root, Options:

Comment 6 Eli Mesika 2013-08-08 09:09:57 UTC
Please test with sf 19.1 , see attached image showing that this fix was not inclused in sf19 , only in sf19.1

Comment 7 Eli Mesika 2013-08-08 09:11:07 UTC
Created attachment 784264 [details]
BZ is meregd only into sf19.1

Comment 9 Barak Dagan 2013-08-08 14:01:23 UTC
Verified on SF 19.1

Comment 11 errata-xmlrpc 2013-08-13 13:09:10 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHBA-2013-1149.html


Note You need to log in before you can comment on or make changes to this bug.