Bug 1670339

Summary: HA VMs are started too early before the lease expire time and fails with error "Lease is held by another host"
Product: Red Hat Enterprise Virtualization Manager Reporter: nijin ashok <nashok>
Component: vdsmAssignee: Shmuel Melamud <smelamud>
Status: CLOSED ERRATA QA Contact: Polina <pagranat>
Severity: high Docs Contact:
Priority: high    
Version: 4.2.6CC: dfediuck, dwhitley, gscott, lsurette, michal.skrivanek, mjankula, mkalinin, mtessun, schandle, smelamud, srevivo, ycui
Target Milestone: ovirt-4.4.2Keywords: TestOnly, ZStream
Target Release: 4.4.2Flags: lsvaty: testing_plan_complete-
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-09-23 16:16:06 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Virt RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 912723, 1325468    
Bug Blocks:    

Description nijin ashok 2019-01-29 10:42:28 UTC
Description of problem:

The HA VMs after a network failure is started immediately after fencing the host. While the other host tries to acquire the lease, the lease will be still valid from the host which was just fenced as it's not expired yet. So acquiring the lease will fail with error "Lease is held by another host". Then it will add the VM in re-run list and will try to start it in other hosts as well but this will also fail since it also happens when the old host lease is still valid.  If there are less number of hosts in the cluster, this will prevent the startup of HA VMs.

Version-Release number of selected component (if applicable):

rhvm-4.2.6.4-0.1.el7ev.noarch

How reproducible:

Reproducible in the customer environment.

Steps to Reproduce:

1. Disconnect the network between manager and the host with HA VMs having storage domain lease.
2. Check if the HA VMs are getting started automatically.


Actual results:

HA VMs are started too early before the lease expire time and fails with error "Lease is held by another host"

Expected results:

The HA VMs should start fine.

Additional info:

Comment 6 Daniel Gur 2019-08-28 13:15:21 UTC
sync2jira

Comment 7 Daniel Gur 2019-08-28 13:20:23 UTC
sync2jira

Comment 9 Michal Skrivanek 2019-11-25 08:24:38 UTC
should be solved by bug 1325468

Comment 10 Ryan Barry 2019-12-06 13:54:28 UTC
Please retest. The changes in https://bugzilla.redhat.com/show_bug.cgi?id=912723 are likely to resolve, as there is not longer a limited number of retries

Comment 11 RHV bug bot 2019-12-13 13:17:38 UTC
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops

Comment 12 RHV bug bot 2019-12-20 17:46:46 UTC
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops

Comment 13 RHV bug bot 2020-01-08 14:50:16 UTC
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops

Comment 14 RHV bug bot 2020-01-08 15:20:48 UTC
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops

Comment 18 RHV bug bot 2020-01-24 19:51:59 UTC
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed:

[Found non-acked flags: '{}', ]

For more info please contact: rhv-devops

Comment 27 errata-xmlrpc 2020-09-23 16:16:06 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (RHV RHEL Host (ovirt-host) 4.4.z [ovirt-4.4.2]), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:3822