| Summary: | Cannot relocate SPM: The Storage Pool has running tasks. | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [oVirt] ovirt-engine | Reporter: | Markus Stockhausen <mst> | ||||
| Component: | Backend.Core | Assignee: | Liron Aravot <laravot> | ||||
| Status: | CLOSED NOTABUG | QA Contact: | Aharon Canan <acanan> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 3.6.7 | CC: | bugs, mst, nsoffer, tnisan, ylavi | ||||
| Target Milestone: | ovirt-4.0.6 | Flags: | tnisan:
ovirt-4.0.z?
rule-engine: planning_ack? rule-engine: devel_ack? rule-engine: testing_ack? |
||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2016-10-31 12:43:46 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | Storage | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Attachments: |
|
||||||
|
Description
Markus Stockhausen
2016-09-14 19:01:30 UTC
Can you share the output of:
vdsClient -s 0 getAllTasks
If you see any tasks on the spm, please provide vdsm and engine logs since these
tasks started.
If there are no tasks in vdsm, you probably have some tasks ids in engine logs, we
will need logs singe these tasks were created.
Created attachment 1200945 [details]
SPM
No running tasks in VDSM on SPM. see screenshot attached. We cannot do anything without logs, please see comment 1, thanks! I can provide more logs if you like. The only problem is that we do not know when all of this started. The engine ran for month without obvious issues. So it is hard to send you gigabytes of logs. First of all you could help to dentify the blocker. So why does engine think there are still running tasks? From looking at the DB there should be nothing left. engine=# select * from job order by start_time desc; job_id | action_type | description | status | owner_id | visible | ... --------+-------------+-------------+--------+----------+---------+---- (0 rows) Ok here the Output from the taskcleaner: ./share/ovirt-engine/setup/dbutils/taskcleaner.sh -u engine -d engine -z t 92ff6bd7-ae6b-4078-84c3-9f65972ef339 | 8 | 2 | 2016-08-18 19:36:38.805+02 | 0 | 1010 | 5153efbf-26a8-44c4-b4cf-cda9625877 07 | fd9617b0-e830-42cf-9bc3-26601904f365 | 94ed7a19-fade-4bd6-83f2-2cbb2f730b95 I'm sorry to say that we have no more logs from that date. Is there anything more I can provide before I cleanup this task? From reading different BZs that means we have a running (status=2) disk migration task (type 1010).
Regarding the VM:
engine=# select * from async_tasks_entities;
async_task_id | entity_id |tp
------------------------------------+------------------------------------+--
92ff6bd7-ae6b-4078-84c3-9f65972ef339|2ee5fd27-73d6-4efc-bbbd-0bcfac0715bf|VM
engine=# select vm_guid,status,vm_host from
vm_dynamic where vm_guid = '2ee5fd27-73d6-4efc-bbbd-0bcfac0715bf';
vm_guid | status | vm_host
--------------------------------------+--------+---------------------
2ee5fd27-73d6-4efc-bbbd-0bcfac0715bf | 1 | colvm01.collogia.de
Why doesn't engine cleanup this task?
We want to fix the error state. Can we do the task cleanup or do you need other data? Hi Markus, what is the current status of the disk that was being migrated? is it LOCKED? I'd suggest to not clear the task by yourself, but updating its status so that the migration operation will converge (and the task will be cleared) - otherwise the disk will remain locked. We fixed the state with taskcleaner. The disk state (locked) was resetted manually. This is by design, you can not relocate the SPM until the tasks are done or cleaned. We are making a effort to move some tasks to HSM which will help here. |