Description of problem: The HA VMs after a network failure is started immediately after fencing the host. While the other host tries to acquire the lease, the lease will be still valid from the host which was just fenced as it's not expired yet. So acquiring the lease will fail with error "Lease is held by another host". Then it will add the VM in re-run list and will try to start it in other hosts as well but this will also fail since it also happens when the old host lease is still valid. If there are less number of hosts in the cluster, this will prevent the startup of HA VMs. Version-Release number of selected component (if applicable): rhvm-4.2.6.4-0.1.el7ev.noarch How reproducible: Reproducible in the customer environment. Steps to Reproduce: 1. Disconnect the network between manager and the host with HA VMs having storage domain lease. 2. Check if the HA VMs are getting started automatically. Actual results: HA VMs are started too early before the lease expire time and fails with error "Lease is held by another host" Expected results: The HA VMs should start fine. Additional info:
sync2jira
should be solved by bug 1325468
Please retest. The changes in https://bugzilla.redhat.com/show_bug.cgi?id=912723 are likely to resolve, as there is not longer a limited number of retries
WARN: Bug status (ON_QA) wasn't changed but the folowing should be fixed: [Found non-acked flags: '{}', ] For more info please contact: rhv-devops: Bug status (ON_QA) wasn't changed but the folowing should be fixed: [Found non-acked flags: '{}', ] For more info please contact: rhv-devops
verified on http://bob-dr.lab.eng.brq.redhat.com/builds/4.4/rhv-4.4.0-18 relevant test cases are https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26910 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26923 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26924 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26925 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26926 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26953 https://polarion.engineering.redhat.com/polarion/redirect/project/RHEVM3/workitem?id=RHEVM-26957
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (RHV RHEL Host (ovirt-host) 4.4.z [ovirt-4.4.2]), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:3822