1052969 – GlusterFS: deleting a 30GB snapshot takes more than an hour, times out

Bug 1052969 - GlusterFS: deleting a 30GB snapshot takes more than an hour, times out

Summary: GlusterFS: deleting a 30GB snapshot takes more than an hour, times out

Keywords:
Status:	CLOSED DUPLICATE of bug 1040711
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	openstack-cinder
Sub Component:
Version:	4.0
Hardware:	All
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	5.0 (RHEL 7)
Assignee:	Eric Harney
QA Contact:	Dafna Ron
Docs Contact:
URL:
Whiteboard:	storage
Depends On:	1040711
Blocks:
TreeView+	depends on / blocked

Reported:	2014-01-14 14:14 UTC by Yogev Rabl
Modified:	2016-04-26 20:31 UTC (History)
CC List:	4 users (show)
Fixed In Version:
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2014-01-15 14:33:18 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
volume.log (229.30 KB, application/zip) 2014-01-14 14:14 UTC, Yogev Rabl	no flags	Details
compute.log (4.36 MB, application/x-bzip) 2014-01-14 15:25 UTC, Yogev Rabl	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Launchpad	1268984	0	None	None	None	Never

Description Yogev Rabl 2014-01-14 14:14:03 UTC

Created attachment 849959 [details]
volume.log

Description of problem:
Cinder failed to delete a snapshot create with instance snapshot. 
Cinder is using GlusterFS as its backend. 

1. Cinder is able to delete volumes.
2. Cinder is able to delete snapshots which haven't been created by instance snapshot. 

Version-Release number of selected component (if applicable):
python-cinderclient-1.0.7-2.el6ost.noarch
python-cinder-2013.2.1-4.el6ost.noarch
openstack-cinder-2013.2.1-4.el6ost.noarch

How reproducible:
100%

Steps to Reproduce:
1. Create an volume from an image.
2. Launch an instance with this volume
3. Take a snapshot of the instance.
4. Delete the volume snapshot.

Actual results:
The snapshot is in the status of 'deleting', until trying to delete it again and then it is in the status of 'error_deleting'.

Expected results:
The snapshot should be deleted. 

Additional info:

The logs are attached.

Comment 2 Dafna Ron 2014-01-14 14:35:34 UTC

is there a trace for the failure in the log? if so, can you paste it?

Comment 4 Eric Harney 2014-01-14 15:19:38 UTC

GlusterFS doesn't do zeroing.  The I/O done depends on which snapshot is being deleted, but in some cases (deleting the active snapshot) where the base volume image must be copied into the snapshot qcow2 file, so the time it takes is dependent on the size of the volume.

Can you attach a Nova compute log from when this occurred?

It is likely that we need Nova to send updates to Cinder while a long operation is occurring so it knows not to time out the operation.

Comment 5 Yogev Rabl 2014-01-14 15:25:35 UTC

Created attachment 850007 [details]
compute.log

Comment 6 Eric Harney 2014-01-14 15:49:53 UTC

Actually, after reviewing the compute log, this looks like an instance of bug 1040711.

Comment 7 Ayal Baron 2014-01-15 14:33:18 UTC

(In reply to Eric Harney from comment #6)
> Actually, after reviewing the compute log, this looks like an instance of
> bug 1040711.

Closing as dup.

*** This bug has been marked as a duplicate of bug 1040711 ***

Note You need to log in before you can comment on or make changes to this bug.