Bug 1346270

Summary: Pool VM loses its disk during reinitialisation after shutdown.
Product: Red Hat Enterprise Virtualization Manager Reporter: Roman Hodain <rhodain>
Component: ovirt-engineAssignee: Arik <ahadas>
Status: CLOSED ERRATA QA Contact: sefi litmanovich <slitmano>
Severity: high Docs Contact:
Priority: high    
Version: 3.6.6CC: ahadas, amureini, bgraveno, gklein, lsurette, michal.skrivanek, rbalakri, Rhev-m-bugs, sbonazzo, srevivo, ykaul
Target Milestone: ovirt-4.0.1Keywords: ZStream
Target Release: 4.0.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
This update fixes a race condition that exists during automatic startup of prestarted virtual machines in virtual machine pools and manual operations of virtual machines in the pool. The condition caused virtual machines to lose its disks while being returned to the virtual machine pool.
Story Points: ---
Clone Of:
: 1346848 1348950 (view as bug list) Environment:
Last Closed: 2016-08-23 20:42:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Virt RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1348950    

Description Roman Hodain 2016-06-14 12:11:04 UTC
Description of problem:
     When a pool VM is returned to the pull it sometimes loses its disk and cannot be prestarted any more as it does not have any disk.

Version-Release number of selected component (if applicable):
    rhevm-3.6.5.3-0.1.el6.noarch

How reproducible:
    Time to time as this is a race condition.

Steps to Reproduce:
    1. Return a pool VM to the pool exactly in the same time when the VMs are automatically being prestarted. (every VmPoolMonitorIntervalInMinutes)

Actual results:
     Vm loses all disk volumes and is not bale to start any more.

Expected results:
     Vm does not lose the volumes (DestroyImageVDSCommand is not triggered twice)

Comment 9 sefi litmanovich 2016-08-01 13:58:02 UTC
Ran whole vm pools test plan on automation several times, In addition wrote another automated test with a vm pool with all vms set as prestarted, and user allocating vms, then detaching, also allocating and stopping the vms, and a bit of both, all that for around 1 hour.. Did not produce the bug.
Bug didn't re produce at no point.

Verified running with rhevm-4.0.2.3-0.1.el7ev.noarch, all tests passed.

Comment 11 errata-xmlrpc 2016-08-23 20:42:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHEA-2016-1743.html