Bug 1585039

Summary: [downstream clone - 4.2.4] Live Storage Migration continued on after snapshot creation hung and timed out
Product: Red Hat Enterprise Virtualization Manager Reporter: RHV bug bot <rhv-bugzilla-bot>
Component: ovirt-engineAssignee: Benny Zlotnik <bzlotnik>
Status: CLOSED ERRATA QA Contact: Elad <ebenahar>
Severity: high Docs Contact:
Priority: medium    
Version: 4.1.6CC: bzlotnik, ebenahar, jcoscia, kgoldbla, kshukla, lsurette, lveyde, mkalinin, rabraham, ratamir, rbalakri, Rhev-m-bugs, srevivo, tnisan, ykaul, ylavi
Target Milestone: ovirt-4.2.4Keywords: ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Linux   
Whiteboard:
Fixed In Version: ovirt-engine-4.2.4.1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1497355 Environment:
Last Closed: 2018-06-27 10:02:42 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Storage RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1497355    
Bug Blocks:    

Description RHV bug bot 2018-06-01 08:14:09 UTC
+++ This bug is a downstream clone. The original bug is: +++
+++   bug 1497355 +++
======================================================================

Description of problem:

The snapshot creation of a Live Storage Migration hung and timed out on the engine side. However, the engine then continued on with CloneImageGroupStructureVDSCommand and VmReplicateDiskStartVDSCommand, etc.

The result was that the LSM effectively failed, with the disk still residing in the source storage domain. However, volumes were created in the target storage domain, which caused a subsequent LSM to fail.


Version-Release number of selected component (if applicable):

RHV 4.1.6
RHVH 4.1.6
  vdsm-4.19.31-1.el7ev
  

How reproducible:

Not.

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

(Originally by Gordon Watson)

Comment 5 RHV bug bot 2018-06-01 08:14:33 UTC
Benny, can you take a look please?

Tentitively targetting for 4.2.
If there's something safe enough to backport to 4.1.z, we should do that, but I'm not commiting on such a fix unless we see what the upstream fix contains.

(Originally by amureini)

Comment 15 Elad 2018-06-19 13:36:58 UTC
Live storage migration is aborted in case of a snapshot creation failure:


2018-06-19 16:34:28,149+03 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedThreadFactory-engineScheduled-Thread-62) [] EVENT_ID: USER_CREATE_SNAPSHOT_FINISHED_FAILURE(69),
 Failed to complete snapshot 'test1_Disk1 Auto-generated for Live Storage Migration' creation for VM 'test1'.



2018-06-19 16:34:30,249+03 ERROR [org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand] (EE-ManagedThreadFactory-engineScheduled-Thread-42) [56a5561c-14dd-4b09-ad90-5f23f20c40ca] Ending command 'org.ovir
t.engine.core.bll.storage.lsm.LiveMigrateDiskCommand' with failure.



Used:
rhvm-4.2.4.2-0.1.el7_3.noarch
vdsm-4.20.30-1.el7ev.x86_64

Comment 16 Tal Nisan 2018-06-20 06:15:25 UTC
*** Bug 1591514 has been marked as a duplicate of this bug. ***

Comment 18 errata-xmlrpc 2018-06-27 10:02:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2018:2071

Comment 19 Franta Kust 2019-05-16 13:08:31 UTC
BZ<2>Jira Resync