Bug 1211512 - Add VM pool task doesn't ends
Summary: Add VM pool task doesn't ends
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-engine
Version: 3.6.0
Hardware: Unspecified
OS: Unspecified
high
urgent
Target Milestone: ovirt-3.6.0-rc
: 3.6.0
Assignee: Shmuel Melamud
QA Contact: Shira Maximov
URL:
Whiteboard:
: 1215955 1247506 1256030 1304663 (view as bug list)
Depends On:
Blocks: 1247506 1256269
TreeView+ depends on / blocked
 
Reported: 2015-04-14 08:18 UTC by Shira Maximov
Modified: 2016-04-20 01:10 UTC (History)
20 users (show)

Fixed In Version: 3.6.0-12
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 1247506 (view as bug list)
Environment:
Last Closed: 2016-04-20 01:10:12 UTC
oVirt Team: Infra
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
the tasks print screen (216.77 KB, image/png)
2015-04-14 08:21 UTC, Shira Maximov
no flags Details
the engine and hosts logs (1.24 MB, application/x-bzip)
2015-04-14 08:24 UTC, Shira Maximov
no flags Details
new_pool_engine_log (56.93 KB, text/plain)
2015-08-13 12:40 UTC, sefi litmanovich
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1263678 0 medium CLOSED allocate vm from pool doesn't end task 2021-02-22 00:41:40 UTC
oVirt gerrit 40354 0 master MERGED core: Run AddVmToPool command in a separate context Never
oVirt gerrit 45008 0 master ABANDONED core: POOL vm tasks doesn't ends Never
oVirt gerrit 45547 0 master MERGED engine: Revert JPA Never
oVirt gerrit 45582 0 ovirt-engine-3.6 MERGED core: revert moving backup to JPA Never
oVirt gerrit 45586 0 ovirt-engine-3.6 MERGED engine: Revert JPA Never

Internal Links: 1263678

Description Shira Maximov 2015-04-14 08:18:14 UTC
Description of problem:
When creating a pool vm, the tasks stuck on started and doesn't ends (even not agter severl days or after engine restart)

Version-Release number of selected component (if applicable):
rhevm.noarch	3.6.0-0.1000.7.a03a5cd.master.el6ev
art.noarch	1.0.6-1.6v36

How reproducible:
100%

Steps to Reproduce:
1.create a pool vm
2.look at the pool vm tasks
3.

Actual results:
the task never ends

Expected results:
the tasks spouse to end

Additional info:
the automation build : http://jenkins-ci.eng.lab.tlv.redhat.com/view/0%20Unstable%203.6/job/rhevm_3.6_el6-engine_el7-host_automation_coretools_two_hosts_restapi_vms_nfs_rest_factory/407/

Comment 1 Shira Maximov 2015-04-14 08:21:49 UTC
Created attachment 1014190 [details]
the tasks print screen

Comment 2 Shira Maximov 2015-04-14 08:24:16 UTC
Created attachment 1014191 [details]
the engine and hosts logs

Comment 3 Omer Frenkel 2015-04-29 07:25:12 UTC
*** Bug 1215955 has been marked as a duplicate of this bug. ***

Comment 4 Shira Maximov 2015-07-28 08:55:23 UTC
*** Bug 1247506 has been marked as a duplicate of this bug. ***

Comment 5 Shira Maximov 2015-07-28 08:56:30 UTC
This bug was verified in 3.6.0-2 but was broken again in 3.6.0-3  (vdsm-4.17.0-1054.git562e711.el7.noarch)

Comment 6 sefi litmanovich 2015-08-13 12:39:29 UTC
Reproduced again on  3.6.0-0.0.master.20150804111407.git122a3a0.el6, attaching relevant part of log (just created a new pool).

Comment 7 sefi litmanovich 2015-08-13 12:40:06 UTC
Created attachment 1062514 [details]
new_pool_engine_log

Comment 8 Omer Frenkel 2015-08-13 13:22:50 UTC
I could not see any issue in the add pool commands flow that would cause this, all the steps are finished but the job is still open.
so i think its not the same issue as in the original report, but same result..
also there seem to be some related exception in the log:

015-08-13 18:35:17,262 INFO  [org.ovirt.engine.core.bll.tasks.CommandAsyncTask] (org.ovirt.thread.pool-8-thread-13) [307c57f6] CommandAsyncTask::endCommandAction [within thread] context: Attempting to endAction 'AddVm', executionIndex: '0'
2015-08-13 18:35:17,279 INFO  [org.ovirt.engine.core.utils.transaction.TransactionSupport] (org.ovirt.thread.pool-8-thread-18) [fd52c13] transaction rolled back
2015-08-13 18:35:17,279 ERROR [org.ovirt.engine.core.bll.job.ExecutionHandler] (org.ovirt.thread.pool-8-thread-18) [fd52c13] Exception: javax.persistence.EntityNotFoundException: Unable to find org.ovirt.engine.core.common.job.Step with id f8b23151-8e0d-434b-ba41-321dcf3687c8


Oved, can someone from infra take a look on this?

Comment 9 Oved Ourfali 2015-08-13 16:48:22 UTC
Sure, Liran, can you take a look?

Comment 10 Oved Ourfali 2015-08-13 16:49:22 UTC
Also, Shira, can you test with latest 3.6.0 release?

Comment 11 Omer Frenkel 2015-08-16 13:27:56 UTC
Oved, i was able to reproduce this (easily) on latest master as well.
just create a vm-pool with 3 vms, use template that has a disk.
i could not reproduce this with template that has no disks, so it makes me believe there is a race around tasks/jobs infrastructure (how job is marked as finished)

in my testing, i got the exception mentioned in comment 8 only in some of the cases.

Comment 12 Shira Maximov 2015-08-18 09:36:07 UTC
reproduce this also in downstream :
Red Hat Enterprise Virtualization Manager Version: 3.6.0-0.11.master.el6

Comment 13 Ori Gofen 2015-08-23 14:47:01 UTC
*** Bug 1256030 has been marked as a duplicate of this bug. ***

Comment 14 Ondra Machacek 2015-09-14 07:58:48 UTC
Not sure if this bz should handle also task for allocation vm from pool, but
this task is stuck as well in 3.6.0-12.

Comment 15 Moti Asayag 2015-09-16 07:57:12 UTC
(In reply to Ondra Machacek from comment #14)
> Not sure if this bz should handle also task for allocation vm from pool, but
> this task is stuck as well in 3.6.0-12.

I'd treat it as a separate bug, since there is other known issue of RemovingVmPool which is also stuck and not related to the job/step mechanism.
So that issue should be examine and handled separately.

Comment 16 Oved Ourfali 2015-09-16 12:12:26 UTC
(In reply to Moti Asayag from comment #15)
> (In reply to Ondra Machacek from comment #14)
> > Not sure if this bz should handle also task for allocation vm from pool, but
> > this task is stuck as well in 3.6.0-12.
> 
> I'd treat it as a separate bug, since there is other known issue of
> RemovingVmPool which is also stuck and not related to the job/step mechanism.
> So that issue should be examine and handled separately.

Changing the title to reflect the right scope.

Comment 17 Shira Maximov 2015-10-11 12:12:11 UTC
verified on : Red Hat Enterprise Virtualization Manager Version: 3.6.0-0.18.el6

Comment 18 Arik 2016-02-07 17:06:32 UTC
*** Bug 1304663 has been marked as a duplicate of this bug. ***

Comment 19 Arik 2016-02-10 09:38:46 UTC
Shmuel, can you please take a look? Sefi reports that it happens again on 3.6.3 and I see it also on master

Comment 20 Shmuel Melamud 2016-02-22 12:14:00 UTC
Separate bug 1310426 was created for this issue.


Note You need to log in before you can comment on or make changes to this bug.