Bug 1395941
| Summary: | [Scale] Average time for VM Snapshot to complete degrades once a VM contains multiple snapshots [Fixed for RHEL >= 7.5] | ||
|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | mlehrer |
| Component: | General | Assignee: | Ala Hino <ahino> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | guy chen <guchen> |
| Severity: | medium | Docs Contact: | |
| Priority: | high | ||
| Version: | 4.0.5.1 | CC: | ahino, amureini, bugs, guchen, mlehrer, nsoffer, rgolan, tjelinek, tnisan, ylavi |
| Target Milestone: | ovirt-4.2.2 | Keywords: | Performance |
| Target Release: | 4.2.2.2 | Flags: | rule-engine:
ovirt-4.2+
ylavi: exception+ |
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2018-03-29 11:07:45 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | Storage | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1213786 | ||
| Bug Blocks: | 1551684 | ||
|
Description
mlehrer
2016-11-17 00:50:13 UTC
Does it have anything to do with the dataset in step 1? Can you attach relevant logs? ping, can you please attach the logs? (In reply to Yaniv Kaul from comment #1) > Does it have anything to do with the dataset in step 1? It may contribute, we'll need to test a less populated enviroment to have more insights. > Can you attach relevant logs? Yes see link below [1] (In reply to Tomas Jelinek from comment #2) > ping, can you please attach the logs? Yes see link below [1] Following activity was done sequentially 4 times with no concurrency to the same VM. The actions were: VM Snapshot (with memory) then VM Snapshot (no Memory) In the shared logs link [1] you'll find a comparison table of the Snapshot samples you'll see the execution time increasing per iteration. [1] https://drive.google.com/open?id=0B8V1DXeGhPPWempHNlJjNVNMU1U Based on conversations with Tomas, we have agreed to re-test this scenario manually to verify that automated scripts which ran immediately before this test scenario did not in anyway contribute to seeing this issue. I will update the BZ with the results of this manual test. Tal can you or one of the team take that? Let's retest this on 4.1 DC level please. The compat=1.1 parameter we use for qcows there should improve things, and may render this BZ obsolete (or not, of course ;-)) I have retest this on latest 4.1 build 4 - the bug reproduced, first snapshot took 46 seconds, each snapshot after it takes longer, after 10 snapshots creating a snapshot takes 1 minutes and 49 seconds. Moving out all non blocker\exceptions. (In reply to guy chen from comment #8) > I have retest this on latest 4.1 build 4 - the bug reproduced, first > snapshot took 46 seconds, each snapshot after it takes longer, after 10 > snapshots creating a snapshot takes 1 minutes and 49 seconds. Guy, Can you provide info about the setup used to test? Mainly, I'd like to know how many hosts there were in the deployment. In addition, can you have the same test while the VM is running on HSM? The system was with 1 host, 1 SD iscsi, 200 Vms with 5 thin provisioned disks per VM. Currently we don't have a setup with more then 1 hosts but when we will have it I will run it on HSM. Ala, see https://bugzilla.redhat.com/show_bug.cgi?id=1213786#c5 - maybe we like to depend on that bug? (In reply to Nir Soffer from comment #12) > Ala, see https://bugzilla.redhat.com/show_bug.cgi?id=1213786#c5 - maybe we > like to > depend on that bug? Sounds legit. (In reply to Nir Soffer from comment #12) > Ala, see https://bugzilla.redhat.com/show_bug.cgi?id=1213786#c5 - maybe we > like to > depend on that bug? Allon was quick and marked this bug to depend on BZ 1213786. (In reply to Ala Hino from comment #14) > (In reply to Nir Soffer from comment #12) > > Ala, see https://bugzilla.redhat.com/show_bug.cgi?id=1213786#c5 - maybe we > > like to > > depend on that bug? > > Allon was quick and marked this bug to depend on BZ 1213786. bug 1213786 has be suggested for 7.4.z, let's see where it ends up. (In reply to Allon Mureinik from comment #16) > bug 1213786 has be suggested for 7.4.z, let's see where it ends up. It looks like it's only going to 7.5 - but is already in VERIFIED state. Anything we need to do here? In any case, moving to ASSIGNED, as the patch above is no longer valid. The patch was in POST state, not verified. In any case, this will wait for 7.5. (In reply to Ala Hino from comment #18) > The patch was in POST state, not verified. The platform BZ is in VERIFIED state. > In any case, this will wait for 7.5. Which means you can take the package and test if it fixes the issue already - what do we need to wait for? Do we have any work on our side to do here? (In reply to Yaniv Kaul from comment #19) > (In reply to Ala Hino from comment #18) > > The patch was in POST state, not verified. > > The platform BZ is in VERIFIED state. > > > In any case, this will wait for 7.5. > > Which means you can take the package and test if it fixes the issue already > - what do we need to wait for? > > Do we have any work on our side to do here? Yes, we have to change our code to use the new option. Ala, all the patches attached are merged. Are we waiting for something else? Sorry Mordehai, meant to direct this needinfo at Ala - please ignore. (In reply to Allon Mureinik from comment #21) > Ala, all the patches attached are merged. > Are we waiting for something else? This can be only verified on RHEL 7.5 where we have all qemu unsafe support. Moving to MODIFIED then. QA contact - note this requires qemu-*-[rh]ev-2.10 to verify. Was retested with ovirt 4.2.2, vdsm 4.20.20, and RHEL 7.5. Created 10 snapshots on a VMS with 2 HDD, was not reproduce, duration time stable thus bug is verified. This bugzilla is included in oVirt 4.2.2 release, published on March 28th 2018. Since the problem described in this bug report should be resolved in oVirt 4.2.2 release, it has been closed with a resolution of CURRENT RELEASE. If the solution does not work for you, please open a new bug report. |