Bug 1246114 - [RFE][scale] Snapshot deletion of poweredoff VM takes longer time.
Summary: [RFE][scale] Snapshot deletion of poweredoff VM takes longer time.
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: ovirt-engine
Classification: oVirt
Component: General
Version: ---
Hardware: All
OS: Unspecified
high
high
Target Milestone: ovirt-4.1.0-beta
: 4.1.0.2
Assignee: Ala Hino
QA Contact: Raz Tamir
URL: https://github.com/alhino/ovirt-site/...
Whiteboard:
: 1206328 1275652 (view as bug list)
Depends On: 1408583
Blocks: 1256500 1275655 1369942 1395146
TreeView+ depends on / blocked
 
Reported: 2015-07-23 13:17 UTC by dev-unix-virtualization
Modified: 2017-02-01 14:54 UTC (History)
17 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2017-02-01 14:54:35 UTC
oVirt Team: Storage
Embargoed:
rule-engine: ovirt-4.1+
gklein: testing_plan_complete-
ylavi: planning_ack+
amureini: devel_ack+
eberman: testing_ack+


Attachments (Terms of Use)
snaphost (13.83 KB, image/png)
2015-07-23 13:17 UTC, dev-unix-virtualization
no flags Details
VDSM Logs (8.66 MB, text/plain)
2015-07-23 13:19 UTC, dev-unix-virtualization
no flags Details
Engine Logs (9.70 MB, text/plain)
2015-07-23 13:21 UTC, dev-unix-virtualization
no flags Details


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 66426 0 master MERGED Cold Merge: Implement SDM.merge verb 2016-12-15 15:18:06 UTC
oVirt gerrit 66994 0 master MERGED Cold Merge: Merge step 2016-12-15 13:01:04 UTC
oVirt gerrit 66997 0 master MERGED Cold Merge: Destroy image step 2016-12-15 13:02:40 UTC
oVirt gerrit 67064 0 master POST api: Introduce SubchainInfo type 2016-11-22 20:45:47 UTC
oVirt gerrit 67079 0 master MERGED core: Add API to verify that qemuimg commit is supported 2016-12-15 12:59:28 UTC
oVirt gerrit 67184 0 master MERGED Cold Merge: Introduce cold merge related verbs 2016-11-22 21:54:13 UTC
oVirt gerrit 67249 0 master MERGED Cold Merge: Prepare merge step 2016-12-15 13:00:30 UTC
oVirt gerrit 67250 0 master MERGED Cold Merge: Finalize merge step 2016-12-15 13:01:45 UTC
oVirt gerrit 67393 0 master MERGED Cold Merge: Change prepare and finalize merge verbs definition 2016-12-12 15:31:05 UTC
oVirt gerrit 68247 0 master MERGED Cold Merge: Implement StoragePool.prepareMerge 2016-12-15 15:17:50 UTC
oVirt gerrit 68248 0 master MERGED Cold Merge: Implement StoragePool.finalizeMerge 2016-12-15 15:17:57 UTC
oVirt gerrit 68542 0 master MERGED Cold Merge: Enable qemuimg commit support 2016-12-15 16:16:34 UTC

Description dev-unix-virtualization 2015-07-23 13:17:56 UTC
Created attachment 1055381 [details]
snaphost

Description of problem:
Snapshot deletion of powered off VMs takes longer.


Version-Release number of selected component (if applicable):
Version 3.5.1-0.4.el6ev

How reproducible:Always


Steps to Reproduce:
1.Create a snapshot of powered off VM
2.Try to delete the snapshot
3.Snapshot will be in locked state for more than 10-15 mins which is huge when compared to live VM snapshot removal which is taking 1-2 mins.

Actual results:


Expected results:


Additional info:

Comment 1 dev-unix-virtualization 2015-07-23 13:19:42 UTC
Created attachment 1055382 [details]
VDSM Logs

Comment 2 dev-unix-virtualization 2015-07-23 13:21:17 UTC
Created attachment 1055383 [details]
Engine Logs

Comment 3 Allon Mureinik 2015-07-26 10:48:44 UTC
This needs some initial analysis to see what's taking up the time here.

Nir - can you take a look please?

Comment 4 Nir Soffer 2015-07-26 10:58:39 UTC
(In reply to dev-unix-virtualization from comment #0)
> 3.Snapshot will be in locked state for more than 10-15 mins which is huge
> when compared to live VM snapshot removal which is taking 1-2 mins.

Both tests were done on same storage and same wipe-after-delete settings?

Comment 5 dev-unix-virtualization 2015-07-26 12:48:49 UTC
Yes, There was no change w.r.t to storage and settings, Also reproducible every time on our 2 test setups. One setup is on iSCSI storage domain and one is on local storage/NFS.

Comment 6 Nir Soffer 2015-07-26 13:36:32 UTC
(In reply to dev-unix-virtualization from comment #5)
> Yes, There was no change w.r.t to storage and settings, Also reproducible
> every time on our 2 test setups. One setup is on iSCSI storage domain and
> one is on local storage/NFS.

Can you provide detailed description, how to reproduce this issue?

- Vm configuration
- Disk configuration (format, virtual size, actual size)
- What os to install
- How to create the snapshot/delete (using the rest api?)

Comment 7 dev-unix-virtualization 2015-07-26 14:29:32 UTC
RHEV-M:- Version 3.5.1-0.4.el6ev
RHEVH:- RHEV Hypervisor - 7.1 - 20150420.0.el7ev
Kernel Version:- 3.10.0 - 229.1.2.el7.x86_64
VDSM Version:vdsm-4.16.13.1-1.el7ev

CPU Model:- Intel(R) Xeon(R) CPU           E5620  @ 2.40GHz
CPU Type: Intel Westmere Family
Model:- PowerEdge R410


VM Configuration.

OS:- Windows 2012 
Virtual size:- 32GB
Actual size :- 8GB
Interface type:- VirtIO
Storage type:- iSCSI from Dell Compellent array.
Allocation Policy:- Thin Provisioned


RestAPI used to create and delete the snapshot.

For Create Snapshot:
										<VM ID>
POST REAT API: [https://rhevm.devemc.commvault.com/api/vms/8f7eccd9-156e-4790-9ef7-db090e4662cf/snapshot]

Body: [<?xml version="1.0" encoding="UTF-8" standalone="no" ?> <snapshot>

  <description>_GX_BACKUP_vm1_25_20111_392cd700</description>

</snapshot>]


For Delete Snapshot:
										<VM ID>						<Snapshot Id>
DELETE REST API: [https://rhevm.devemc.commvault.com/api/vms/8f7eccd9-156e-4790-9ef7-db090e4662cf/snapshots/6195d4a0-7d85-404e-a659-06f7852289e4]

Body: [<?xml version="1.0" encoding="UTF-8" standalone="no" ?> <action>

  <async>false</async>

  <detach>false</detach>

</action>]

Comment 8 dev-unix-virtualization 2015-09-01 22:00:30 UTC
Any update on this issue ?

Comment 9 dev-unix-virtualization 2015-09-09 14:53:37 UTC
Can you please elaborate upon what info is being asked for ? We have provided requested information in Comment#7 as a response to Comment#6.

Comment 10 Yaniv Lavi 2015-09-09 15:39:21 UTC
The need info was on the assignee, restoring it.

Comment 11 Nir Soffer 2015-09-14 14:30:16 UTC
(In reply to dev-unix-virtualization from comment #8)
> Any update on this issue ?

Cold snapshot delete is implemented differently and is less efficient
than live snapshot delete, since "qemu-img commit" was not available when
cold snapshot was developed.

We still need to investigate further to see if the the timing info in
comment 0 is normal.

Comment 12 Sandro Bonazzola 2015-10-26 12:48:41 UTC
this is an automated message. oVirt 3.6.0 RC3 has been released and GA is targeted to next week, Nov 4th 2015.
Please review this bug and if not a blocker, please postpone to a later release.
All bugs not postponed on GA release will be automatically re-targeted to

- 3.6.1 if severity >= high
- 4.0 if severity < high

Comment 13 Yaniv Lavi 2015-10-29 09:52:16 UTC
*** Bug 1206328 has been marked as a duplicate of this bug. ***

Comment 14 Maor 2015-11-11 16:40:18 UTC
*** Bug 1275652 has been marked as a duplicate of this bug. ***

Comment 15 Yaniv Lavi 2015-11-16 12:54:24 UTC
We will be rewriting this flow in RHEV 4 to make it faster and more similar to live merge. This is not a task possible for RHEV 3.6.

Comment 16 Red Hat Bugzilla Rules Engine 2015-11-30 19:12:59 UTC
Target release should be placed once a package build is known to fix a issue. Since this bug is not modified, the target version has been reset. Please use target milestone to plan a fix for a oVirt release.

Comment 17 Nikolai Sednev 2015-12-09 12:52:54 UTC
Happens also on FC storage domain.

Comment 18 Aharon Canan 2016-01-19 15:16:36 UTC
I just realized that this is a regression (following comment #2 on bug 1275652 which was closed as duplicate of this one)

As it affects our automation and as this is a regression , I would like to raise and try to get a fix on 3.6.

Comment 19 Red Hat Bugzilla Rules Engine 2016-01-19 15:16:38 UTC
This bug is marked for z-stream, yet the milestone is for a major version, therefore the milestone has been reset.
Please set the correct milestone or drop the z stream flag.

Comment 20 Red Hat Bugzilla Rules Engine 2016-01-19 15:16:38 UTC
This bug report has Keywords: Regression or TestBlocker.
Since no regressions or test blockers are allowed between releases, it is also being identified as a blocker for this release. Please resolve ASAP.

Comment 21 Allon Mureinik 2016-01-19 15:22:01 UTC
(In reply to Aharon Canan from comment #18)
> I just realized that this is a regression (following comment #2 on bug
> 1275652 which was closed as duplicate of this one)
Cold merge is slower than live merge, even in 3.5 (as comment 2 states).
This is not a regression.

> As it affects our automation and as this is a regression , I would like to
> raise and try to get a fix on 3.6.
The solution is to rewrite the flow. Hardly a 3.6 item.

Comment 22 Nir Soffer 2016-11-22 21:55:39 UTC
We merged the new api, but it is not implemented yet.

Comment 23 Tal Nisan 2017-01-12 13:21:04 UTC
Ala, please update the doc text with the feature page or a relevant text


Note You need to log in before you can comment on or make changes to this bug.