Bug 981420
Summary: | A running zombie task is not removed leading to it being repolled after engine restart and spamming engine.log. | ||
---|---|---|---|
Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Lee Yarwood <lyarwood> |
Component: | ovirt-engine | Assignee: | Ravi Nori <rnori> |
Status: | CLOSED ERRATA | QA Contact: | Pavel Stehlik <pstehlik> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 3.2.0 | CC: | aberezin, acathrow, bazulay, emesika, iheim, jkt, lpeer, lyarwood, pstehlik, Rhev-m-bugs, rnori, tpoitras, yeylon, yzaslavs |
Target Milestone: | --- | ||
Target Release: | 3.4.0 | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | infra | ||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2014-06-09 14:59:45 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | Infra | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Lee Yarwood
2013-07-04 16:06:59 UTC
This issue is not reproducible with 3.3. MoveImageGroupVDSCommand now performs a Copy and Remove (BZ 753549), copy can be stopped so when the zombie timeout is reduced the task is stopped and cleared from DB. (In reply to Ravi Nori from comment #3) > This issue is not reproducible with 3.3. > > MoveImageGroupVDSCommand now performs a Copy and Remove (BZ 753549), copy > can be stopped so when the zombie timeout is reduced the task is stopped and > cleared from DB. So the ClearAsyncTask call completes correctly in 3.3? Again the error in 3.2 was (I'm going to fire a commit upstream to improve the string) : (pool-4-thread-48) SPMAsyncTask::ClearAsyncTask: At time of attemp to clear task 44fcdad3-695a-462a-b11d-a09e004c8a90 the response code was Operation is not allowed in this task state: ("can't clean in state running",) and message was {3}. Task will not be cleaned Can you think of any other commands where we would be unable to clear a running vdsm task? Replying to a question Ravi emailed to me here.
> My question is are there any other cases where the vdsm would throw the
> error "can't clean in state running"
Quickly checking in the vdsm src it would appear that any task that is not 'done' would return this when we attempt to clean the task :
vdsm/vdsm/storage/task.py
1020 def clean(self):
1021 if not self.store:
1022 return
1023 if not self.isDone():
1024 raise se.TaskStateError("can't clean in state %s" % self.state)
1025 self._clean(self.store)
Thanks Lee. Yes the call completes in 3.3 and there is no "can't clean in state running" message The issue is not reproducible on 3.2 either. Is there more information about the clients setup that can help us reproduce it. Moving to 3.4 When we have a reproducer we can look at this for 3.3.z or even 3.4 move to QA per comment 12 As this issue cannot be reproduced, no doc text is required for errata. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHSA-2014-0506.html |