Bug 1859370
Summary: | Retype of RBD snapshot volume is failing | ||
---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | James Parker <jparker> |
Component: | openstack-cinder | Assignee: | Cinder Bugs List <cinder-bugs> |
Status: | CLOSED DUPLICATE | QA Contact: | Tzach Shefi <tshefi> |
Severity: | medium | Docs Contact: | Chuck Copello <ccopello> |
Priority: | medium | ||
Version: | 16.1 (Train) | CC: | abishop, gcharot, ltoscano, lyarwood, senrique |
Target Milestone: | --- | Keywords: | Triaged |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-04-01 09:48:23 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
James Parker
2020-07-21 20:16:17 UTC
We need to see the cinder logs (with DEBUG). I reviewed the logs (thanks for saving them on the hypervisor, James!), and see this in the cinder-volume log: 2020-07-21 14:07:15.071 79 INFO cinder.volume.drivers.rbd [req-a624464a-f0f3-41e7-b52e-54308ee5fccc 38438ec8d3d44978b417b1153327b587 8591c28be4224f9cbb6fb59556b50db8 - default default] Image volumes/volume-c5b26f69-87bc-494f-8d4f-9ea03c3a304d is dependent on the snapshot snapshot-81982518-e552-4f6e-a805-0647e9ea2cbf. 2020-07-21 14:07:15.079 79 ERROR cinder.volume.manager [req-a624464a-f0f3-41e7-b52e-54308ee5fccc 38438ec8d3d44978b417b1153327b587 8591c28be4224f9cbb6fb59556b50db8 - default default] Delete snapshot failed, due to snapshot busy.: cinder.exception.SnapshotIsBusy: deleting snapshot snapshot-81982518-e552-4f6e-a805-0647e9ea2cbf that has dependent volumes This occurs because tempest is using this [1] sequence to create the volume it plans to retype. [1] https://github.com/openstack/tempest/blob/6cb37d68b2cb40cec9dcbb9e26c0649c6e6c877a/tempest/api/volume/admin/test_volume_retype.py#L61-L67 The tempest test fails because the snapshot cannot be deleted, and this happens before attempting the actual migration/retype. The reason the snapshot cannot be deleted is the RBD driver creates a fast COW clone of the snapshot, and that creates a dependency on the snapshot that prevents it from being deleted. One solution is to configure the RBD driver with rbd_flatten_volume_from_snapshot=True, but a better solution is to rework the tempest test to defer deleting the snapshot until after the retype operation completes. Unless others object, I think this should be handled as a tempest bug. Ignore my previous comment about this being a tempest bug. Apparently the RBD driver is *not* supposed to behave this way, and the rbd_flatten_volume_from_snapshot parameter is not intended to address the behavior. There are other open BZs covering this problem (e.g. bug #1437392), and the cinder squad needs to do some bz cleanup and determine a course of action. This is going to be addressed in OSP 16.2 thanks to the usage of RBD Clone v2 API. Please see bug 1764324. *** This bug has been marked as a duplicate of bug 1764324 *** |