Bug 1013654

Summary: Fencing flow fails to get to secondary PM (primary is deliberately configured with wrong params)
Product: Red Hat Enterprise Virtualization Manager Reporter: sefi litmanovich <slitmano>
Component: ovirt-engineAssignee: Eli Mesika <emesika>
Status: CLOSED CURRENTRELEASE QA Contact: sefi litmanovich <slitmano>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.3.0CC: acathrow, bazulay, eedri, emesika, iheim, lpeer, pstehlik, Rhev-m-bugs, yeylon
Target Milestone: ---Keywords: Triaged
Target Release: 3.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: infra
Fixed In Version: is20.2 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 982604    
Attachments:
Description Flags
engine log
none
new engine.log with proxy host problem none

Description sefi litmanovich 2013-09-30 14:20:51 UTC
Created attachment 805232 [details]
engine log

Description of problem:

Fencing flow fails to get to secondery PM (primary is deliberatly configured with wrong params) 

Version-Release number of selected component (if applicable):
rhevm 3.3 IS16

How reproducible:



Steps to Reproduce:
1. block network communication between engine and host to provoke fencing:
iptables -A OUTPUT -s {engine_IP} -d {host_IP} -j REJECT

Actual results:
host goes down, fails to reboot, soft fencing fails as well, fence is provoked, fails with the corrupt primary PM, then fails to start fence with secondery PM.
host remains non-responsive


Expected results:
host goes down, fails to reboot, soft fencing fails as well, fence is provoked, fails with the corrupt primary PM, then fence starts with the secondery PM and should restart host succefully.

Additional info:

relevant error from engine.log:

2013-09-30 16:21:47,900 ERROR [org.ovirt.engine.core.bll.StopVdsCommand] (pool-5-thread-47) Command org.ovirt.engine.core.bll.StopVdsCommand throw exception: java.lang.IllegalArgumentException: Unmatched braces

Comment 1 sefi litmanovich 2013-10-30 10:04:01 UTC
Bug still occures on is20 with the exact same exception.

Comment 2 Eli Mesika 2013-10-30 10:21:41 UTC
(In reply to sefi litmanovich from comment #1)
> Bug still occures on is20 with the exact same exception.

BZ fix is not a part of is20 , please check in the next comming build ...

Comment 5 sefi litmanovich 2013-11-04 10:19:21 UTC
Error with braces solved but now there is another error.
proxyhost parameter gets null during second PM fencing attempt:

2013-11-04 11:05:10,927 INFO  [org.ovirt.engine.core.bll.FenceExecutor] (pool-5-thread-48) [3e262576] Executing <Stop> Power Management command, Proxy Host:null, Agent:ipmilan, Target Host:rose07.qa.lab.tlv.redhat.com, Management IP:rose07-mgmt.qa.lab.tlv.redhat.com, User:root, Options:
2013-11-04 11:05:10,928 ERROR [org.ovirt.engine.core.vdsbroker.ResourceManager] (pool-5-thread-48) [3e262576] CreateCommand failed: java.lang.NullPointerException

Will attach the current engine.log

Comment 6 sefi litmanovich 2013-11-04 10:20:36 UTC
Created attachment 819045 [details]
new engine.log with proxy host problem

Comment 10 sefi litmanovich 2013-12-18 11:34:13 UTC
Verified on RHEVM3.3 IS27

Comment 11 Itamar Heim 2014-01-21 22:30:30 UTC
Closing - RHEV 3.3 Released

Comment 12 Itamar Heim 2014-01-21 22:30:32 UTC
Closing - RHEV 3.3 Released

Comment 13 Itamar Heim 2014-01-21 22:33:31 UTC
Closing - RHEV 3.3 Released