Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1572148

Summary: Fencing takes too long when first agent is unreachable
Product: [oVirt] ovirt-engine Reporter: Petr Matyáš <pmatyas>
Component: BLL.InfraAssignee: Eli Mesika <emesika>
Status: CLOSED CURRENTRELEASE QA Contact: Pavol Brilla <pbrilla>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 4.2.3.2CC: bugs, lsvaty, lveyde, mperina, talayan
Target Milestone: ovirt-4.2.4Keywords: Automation
Target Release: ---Flags: rule-engine: ovirt-4.2+
lsvaty: testing_ack+
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ovirt-engine-4.2.4.1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-06-26 08:36:50 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
engine log none

Description Petr Matyáš 2018-04-26 09:37:50 UTC
Created attachment 1427103 [details]
engine log

Description of problem:
When trying to fence a host with unreachable first agent (not concurrent) the fencing now takes 3 times as long as with the first agent reachable.
Before this took about the same as it tried to use the second agent right after 3 attempts between which were no timeouts, now there is a 3 minute timeout after failed fence action after which it tries the same agent again (4 times), thus fencing takes 20 minutes.

Version-Release number of selected component (if applicable):
ovirt-engine-4.2.3.2-0.1.el7.noarch

How reproducible:
always

Steps to Reproduce:
1. add fake fencing agent to a host
2. add second fencing agent to a host (this one should work)
3. execute fence action restart

Actual results:
host takes 20 minutes to be fenced

Expected results:
this action should not try the same fencing agent so often and with so large timeouts between retries

Additional info:

Comment 1 Pavol Brilla 2018-06-08 11:45:10 UTC
Verified on: ovirt-engine-4.2.4.1-0.1.el7.noarch

2nd fencing device used if 1st one was faulty

Comment 2 Sandro Bonazzola 2018-06-26 08:36:50 UTC
This bugzilla is included in oVirt 4.2.4 release, published on June 26th 2018.

Since the problem described in this bug report should be
resolved in oVirt 4.2.4 release, it has been closed with a resolution of CURRENT RELEASE.

If the solution does not work for you, please open a new bug report.