Bug 1585039 - [downstream clone - 4.2.4] Live Storage Migration continued on after snapshot creation hung and timed out
Summary: [downstream clone - 4.2.4] Live Storage Migration continued on after snapshot...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine
Version: 4.1.6
Hardware: Unspecified
OS: Linux
medium
high
Target Milestone: ovirt-4.2.4
: ---
Assignee: Benny Zlotnik
QA Contact: Elad
URL:
Whiteboard:
: 1591514 (view as bug list)
Depends On: 1497355
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-06-01 08:14 UTC by RHV bug bot
Modified: 2022-07-09 09:56 UTC (History)
16 users (show)

Fixed In Version: ovirt-engine-4.2.4.1
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 1497355
Environment:
Last Closed: 2018-06-27 10:02:42 UTC
oVirt Team: Storage
Target Upstream Version:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Knowledge Base (Solution) 3194802 0 None None None 2018-06-01 08:15:12 UTC
Red Hat Product Errata RHSA-2018:2071 0 None None None 2018-06-27 10:03:32 UTC
oVirt gerrit 83836 0 master ABANDONED WIP core: introduce CreateAllSnapshotsFromVmCommandCallback 2018-06-01 08:15:12 UTC
oVirt gerrit 87671 0 master MERGED core: introduce CreateSnapshotForVm 2018-06-01 08:15:12 UTC
oVirt gerrit 87805 0 master MERGED core: when snapshot creation fails, do not cleanup target 2018-06-01 08:15:12 UTC
oVirt gerrit 89309 0 ovirt-engine-4.2 MERGED core: when snapshot creation fails, do not cleanup target 2018-06-01 08:15:12 UTC
oVirt gerrit 89773 0 master MERGED core: fix error handling in CreateSnapshotForVmCommand 2018-06-01 08:15:12 UTC
oVirt gerrit 90670 0 ovirt-engine-4.2 MERGED core: introduce CreateSnapshotForVm 2018-06-01 08:15:12 UTC
oVirt gerrit 90835 0 ovirt-engine-4.2 MERGED core: fix error handling in CreateSnapshotForVmCommand 2018-06-01 08:15:12 UTC
oVirt gerrit 91359 0 master MERGED core: remove image from storage after failed snapshot 2018-06-01 08:15:12 UTC
oVirt gerrit 91658 0 ovirt-engine-4.2 MERGED core: remove image from storage after failed snapshot 2018-06-01 08:15:12 UTC

Description RHV bug bot 2018-06-01 08:14:09 UTC
+++ This bug is a downstream clone. The original bug is: +++
+++   bug 1497355 +++
======================================================================

Description of problem:

The snapshot creation of a Live Storage Migration hung and timed out on the engine side. However, the engine then continued on with CloneImageGroupStructureVDSCommand and VmReplicateDiskStartVDSCommand, etc.

The result was that the LSM effectively failed, with the disk still residing in the source storage domain. However, volumes were created in the target storage domain, which caused a subsequent LSM to fail.


Version-Release number of selected component (if applicable):

RHV 4.1.6
RHVH 4.1.6
  vdsm-4.19.31-1.el7ev
  

How reproducible:

Not.

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

(Originally by Gordon Watson)

Comment 5 RHV bug bot 2018-06-01 08:14:33 UTC
Benny, can you take a look please?

Tentitively targetting for 4.2.
If there's something safe enough to backport to 4.1.z, we should do that, but I'm not commiting on such a fix unless we see what the upstream fix contains.

(Originally by amureini)

Comment 15 Elad 2018-06-19 13:36:58 UTC
Live storage migration is aborted in case of a snapshot creation failure:


2018-06-19 16:34:28,149+03 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedThreadFactory-engineScheduled-Thread-62) [] EVENT_ID: USER_CREATE_SNAPSHOT_FINISHED_FAILURE(69),
 Failed to complete snapshot 'test1_Disk1 Auto-generated for Live Storage Migration' creation for VM 'test1'.



2018-06-19 16:34:30,249+03 ERROR [org.ovirt.engine.core.bll.storage.lsm.LiveMigrateDiskCommand] (EE-ManagedThreadFactory-engineScheduled-Thread-42) [56a5561c-14dd-4b09-ad90-5f23f20c40ca] Ending command 'org.ovir
t.engine.core.bll.storage.lsm.LiveMigrateDiskCommand' with failure.



Used:
rhvm-4.2.4.2-0.1.el7_3.noarch
vdsm-4.20.30-1.el7ev.x86_64

Comment 16 Tal Nisan 2018-06-20 06:15:25 UTC
*** Bug 1591514 has been marked as a duplicate of this bug. ***

Comment 18 errata-xmlrpc 2018-06-27 10:02:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2018:2071

Comment 19 Franta Kust 2019-05-16 13:08:31 UTC
BZ<2>Jira Resync


Note You need to log in before you can comment on or make changes to this bug.