Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1044089 - Allow manual fence in connecting state
Allow manual fence in connecting state
Status: CLOSED ERRATA
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine (Show other bugs)
3.2.0
x86_64 Linux
high Severity high
: ---
: 3.4.0
Assigned To: Eli Mesika
Tareq Alayan
infra
: Reopened, ZStream
Depends On:
Blocks: 1044088 1066400 rhev3.4beta 1142926
  Show dependency treegraph
 
Reported: 2013-12-17 14:15 EST by Lee Yarwood
Modified: 2016-02-10 14:30 EST (History)
15 users (show)

See Also:
Fixed In Version: ovirt-3.4.0-beta3
Doc Type: Bug Fix
Doc Text:
Previously, a full host power outage resulted in a nine minute reconnection time before manual SPM relocation could be performed. Now, a host in connecting state can be manually fenced.
Story Points: ---
Clone Of:
: 1066400 (view as bug list)
Environment:
Last Closed: 2014-06-09 11:07:39 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: Infra
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)


External Trackers
Tracker ID Priority Status Summary Last Updated
oVirt gerrit 24241 None None None Never
oVirt gerrit 24528 None None None Never
Red Hat Product Errata RHSA-2014:0506 normal SHIPPED_LIVE Moderate: Red Hat Enterprise Virtualization Manager 3.4.0 update 2014-06-09 14:55:38 EDT

  None (edit)
Description Lee Yarwood 2013-12-17 14:15:58 EST
Description of problem:
In the event of a full host power outage (including fence devices) a user must wait 9 mins (3 x 3 minute timeouts) until they can manually fence a host to relocate the SPM.

Version-Release number of selected component (if applicable):
rhevm-3.2.3-0.43.el6ev.noarch

How reproducible:
Always

Steps to Reproduce:
1. Remove all power to an active SPM, including any fence agents that are configured.
2. Attempt to manual fence the SPM to relocate the role.

Actual results:
The role can only be relocated once the host has moved from a state of 'connecting'.

Expected results:
The role can be relocated while the host is still marked as 'connecting' if the user confirms the host is down.

Additional info:
Comment 3 Itamar Heim 2013-12-17 16:44:19 EST
lee - is this still true with bug 863211 fixed for 3.3?
ayal - why do we need to wait for fencing with spm being based on sanlock?
Comment 4 Ayal Baron 2013-12-17 16:48:35 EST
(In reply to Itamar Heim from comment #3)
> lee - is this still true with bug 863211 fixed for 3.3?
> ayal - why do we need to wait for fencing with spm being based on sanlock?

I don't see what sanlock has to do with it.  It should be just as safe with the old locking mechanism.

I have no idea why we wait
Comment 5 Lee Yarwood 2013-12-17 18:13:04 EST
(In reply to Itamar Heim from comment #3)
> lee - is this still true with bug 863211 fixed for 3.3?

Thanks Itamar, that looks promising but I'll need to verify. Setting needinfo as a reminder for the morning.
Comment 6 Lee Yarwood 2014-01-07 11:47:23 EST
(In reply to Lee Yarwood from comment #5)
> (In reply to Itamar Heim from comment #3)
> > lee - is this still true with bug 863211 fixed for 3.3?
> 
> Thanks Itamar, that looks promising but I'll need to verify. Setting
> needinfo as a reminder for the morning.

Testing this shows a drastically reduced time for the SPM to failover in the event of a complete power outage. I'm going to close this out as a dup of 863211.

Thanks,

Lee

*** This bug has been marked as a duplicate of bug 863211 ***
Comment 8 Eli Mesika 2014-02-09 08:00:58 EST
Changing the BZ title according to comment 7 and re-assigning the BZ 
We will support manual fence  in connecting state
Comment 9 Eli Mesika 2014-02-09 08:01:48 EST
Arthur please approve
Comment 10 Arthur Berezin 2014-02-09 08:46:54 EST
If this doesn't break any existing flows - ACK
Comment 13 Sandro Bonazzola 2014-02-19 07:27:55 EST
This bug is referenced in ovirt-engine-3.4.0-beta3 logs. Moving to ON_QA
Comment 15 Tareq Alayan 2014-02-20 07:23:27 EST
tested on ovirt-engine-3.4.0-0.11.beta3.el6.noarch


1. Put host in connecting state by iptables -D INPUT -p tcp --dport 54321 -j ACCEPT
2. Host have unreachable PM
3. Host state is now connecting and there is attempts to check host status 
4. /etc/init.d/iptables restart
5. right-click and confirm host is rebooted
Result host came up immediately 


Can i move this to verify?
Comment 16 Tareq Alayan 2014-02-20 07:24:14 EST
see comment 15
Comment 17 Eli Mesika 2014-02-20 07:27:56 EST
(In reply to Tareq Alayan from comment #15)
> tested on ovirt-engine-3.4.0-0.11.beta3.el6.noarch
> 
> 
> 1. Put host in connecting state by iptables -D INPUT -p tcp --dport 54321 -j
> ACCEPT
> 2. Host have unreachable PM
> 3. Host state is now connecting and there is attempts to check host status 
> 4. /etc/init.d/iptables restart
> 5. right-click and confirm host is rebooted
> Result host came up immediately 
> 
> 
> Can i move this to verify?

Yes
Comment 18 Tareq Alayan 2014-02-20 07:29:00 EST
verified per comment 17
Comment 19 errata-xmlrpc 2014-06-09 11:07:39 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHSA-2014-0506.html

Note You need to log in before you can comment on or make changes to this bug.