| Summary: | [Scale] Creating VM from Template dialog box blocking UI for 2 minutes during concurrent VM deletes | ||
|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | mlehrer |
| Component: | Frontend.WebAdmin | Assignee: | bugs <bugs> |
| Status: | CLOSED WONTFIX | QA Contact: | Pavel Stehlik <pstehlik> |
| Severity: | high | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.0.5.1 | CC: | ahadas, bugs, michal.skrivanek, mlehrer, oourfali, tjelinek |
| Target Milestone: | --- | Keywords: | Performance |
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-11-30 09:06:34 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | Virt | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
|
Description
mlehrer
2016-11-17 00:24:03 UTC
logs? Sounds like an engine issue, not UI. Is the UI waiting on a server response when this is happening? (check dev tools) Moving to virt to examine the issue. (In reply to Yaniv Kaul from comment #1) > logs? https://drive.google.com/open?id=0B8V1DXeGhPPWXzZHWXJnVjFQX1U (In reply to Greg Sheremeta from comment #2) > Sounds like an engine issue, not UI. > > Is the UI waiting on a server response when this is happening? (check dev > tools) The longest request is 34s, a POST but unfortunately GWT debug is not on so I can't see much more than GenericApiGWTService I also see several "reports-interface-proxy?command=status" GETs that return in 404, likely unrelated but just sharing. Per Engine.log Deletes began here 2016-11-17 19:16:55,607 INFO [org.ovirt.engine.core.bll.RemoveVmCommand] (org.ovirt.thread.pool-6-thread-30) [6cd3ed09] Running command: RemoveVmCommand internal: false. Entities affected : ID: d049a4fe-edff-4343-bc08-7f17d63eff30 Type: VMAction group DELETE_VM with role type USER Template Creation began here 2016-11-17 19:18:08,178 INFO [org.ovirt.engine.core.bll.AddVmFromTemplateCommand] (default task-63) [2f494013] Lock Acquired to object 'EngineLock:{exclusiveLocks='[test1=<VM_NAME, ACTION_TYPE_FAILED_OBJECT_LOCKED>]', sharedLocks='[95c31e8b-663d-4842-938e-55863afe0daf=<DISK, ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, eac7a463-92b7-441a-bbb3-27fa8385adc3=<DISK, ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, f9d3c73b-8375-4f96-b813-eb21568a2032=<DISK, ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, 5faf956f-4341-489e-b770-e1c640123cd4=<DISK, ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, a0b4bf0a-73dd-456d-843b-7392ec866b7d=<TEMPLATE, ACTION_TYPE_FAILED_TEMPLATE_IS_USED_FOR_CREATE_VM$VmName test1>, 89e264d9-ef3a-41e3-95c8-36da9876e53f=<DISK, ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>]'}' Again, logs, and video of test are found here: https://drive.google.com/open?id=0B8V1DXeGhPPWXzZHWXJnVjFQX1U (In reply to mlehrer from comment #4) > (In reply to Yaniv Kaul from comment #1) > > logs? > > https://drive.google.com/open?id=0B8V1DXeGhPPWXzZHWXJnVjFQX1U > > (In reply to Greg Sheremeta from comment #2) > > Sounds like an engine issue, not UI. > > > > Is the UI waiting on a server response when this is happening? (check dev > > tools) > > The longest request is 34s, a POST but unfortunately GWT debug is not on so > I can't see much more than GenericApiGWTService > I also see several "reports-interface-proxy?command=status" GETs that return > in 404, likely unrelated but just sharing. This is bug 1371570 and indeed, unrelated. > > > Per Engine.log > > Deletes began here > 2016-11-17 19:16:55,607 INFO [org.ovirt.engine.core.bll.RemoveVmCommand] > (org.ovirt.thread.pool-6-thread-30) [6cd3ed09] Running command: > RemoveVmCommand internal: false. Entities affected : ID: > d049a4fe-edff-4343-bc08-7f17d63eff30 Type: VMAction group DELETE_VM with > role type USER > > Template Creation began here > 2016-11-17 19:18:08,178 INFO > [org.ovirt.engine.core.bll.AddVmFromTemplateCommand] (default task-63) > [2f494013] Lock Acquired to object > 'EngineLock:{exclusiveLocks='[test1=<VM_NAME, > ACTION_TYPE_FAILED_OBJECT_LOCKED>]', > sharedLocks='[95c31e8b-663d-4842-938e-55863afe0daf=<DISK, > ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, > eac7a463-92b7-441a-bbb3-27fa8385adc3=<DISK, > ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, > f9d3c73b-8375-4f96-b813-eb21568a2032=<DISK, > ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, > 5faf956f-4341-489e-b770-e1c640123cd4=<DISK, > ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>, > a0b4bf0a-73dd-456d-843b-7392ec866b7d=<TEMPLATE, > ACTION_TYPE_FAILED_TEMPLATE_IS_USED_FOR_CREATE_VM$VmName test1>, > 89e264d9-ef3a-41e3-95c8-36da9876e53f=<DISK, > ACTION_TYPE_FAILED_DISK_IS_USED_FOR_CREATE_VM$VmName test1>]'}' > > Again, logs, and video of test are found here: > https://drive.google.com/open?id=0B8V1DXeGhPPWXzZHWXJnVjFQX1U ok, i see it does take most of the time during storage operation on those disks. There seem to be ~25 tasks on SPM at that time so kicking them off takes some time. We should probably make it more async than it is today (it would likely be possible) but I do not think the current behavior is so bad. It is not nicer, but you would need to wait for finishing the template before being able to use it anyway. Tomas/Arik, any more thoughts? Note that some operations (such as cloning) is moving to HSM soon, and will be sent in round-robin fashion to the HSMs. This should both reduce the load on the SPM (task-wise) as well as storage-wise (if its pipes are full with traffic / storage actions). @Michal - I agree it is not that bad. We could close the dialog also sooner, much sooner actually, but we would sacrifice usability in case the environment is not so loaded and some issue occurred the user wants to fix. Also, we do not block the user from closing the dialog immediately, it will not cancel the operation. I would propose based on this and Comment 7 to close this as wont fix. |