1228952 – Disperse volume : glusterfs crashed

Bug 1228952 - Disperse volume : glusterfs crashed

Summary: Disperse volume : glusterfs crashed

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	GlusterFS
Classification:	Community
Component:	disperse
Sub Component:
Version:	mainline
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Assignee:	bugs@gluster.org
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:	1228529
Blocks:	1223636 1229331
TreeView+	depends on / blocked

Reported:	2015-06-07 02:53 UTC by Pranith Kumar K
Modified:	2016-06-16 13:09 UTC (History)
CC List:	7 users (show)
Fixed In Version:	glusterfs-3.8rc2
Clone Of:	1228529
Clones:	1229331 (view as bug list)
Environment:
Last Closed:	2016-06-16 13:09:29 UTC
Regression:	---
Mount Type:	---
Documentation:	---
CRM:
Verified Versions:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Comment 1 Anand Avati 2015-06-07 03:00:28 UTC

REVIEW: http://review.gluster.org/11111 (cluster/ec: Prevent double unwind) posted (#1) for review on master by Pranith Kumar Karampuri (pkarampu)

Comment 2 Anand Avati 2015-06-07 05:38:42 UTC

REVIEW: http://review.gluster.org/11111 (cluster/ec: Prevent double unwind) posted (#2) for review on master by Pranith Kumar Karampuri (pkarampu)

Comment 3 Anand Avati 2015-06-08 12:00:58 UTC

REVIEW: http://review.gluster.org/11111 (cluster/ec: Prevent double unwind) posted (#3) for review on master by Pranith Kumar Karampuri (pkarampu)

Comment 4 Anand Avati 2015-06-09 05:09:24 UTC

COMMIT: http://review.gluster.org/11111 committed in master by Vijay Bellur (vbellur) 
------
commit 31866a490edf5b92886accac67eddf78b291d710
Author: Pranith Kumar K <pkarampu>
Date:   Sun Jun 7 10:24:08 2015 +0530

    cluster/ec: Prevent double unwind
    
    Problem:
    1) ec_access/ec_readlink_/ec_readdir[p] _cbks are trying to recover only from
    ENOTCONN.
    2) When the fop succeeds it unwinds right away. But when its
    ec_fop_manager resumes, if the number of bricks that are up is less than
    ec->fragments, the the state machine will resume with -EC_STATE_REPORT which
    unwinds again. This will lead to crashes.
    
    Fix:
    - If fop fails retry on other subvols, as ESTALE/ENOENT/EBADFD etc are also
    recoverable.
    - unwind success/failure in _cbks
    
    Change-Id: I2cac3c2f9669a4e6160f1ff4abc39f0299303222
    BUG: 1228952
    Signed-off-by: Pranith Kumar K <pkarampu>
    Reviewed-on: http://review.gluster.org/11111
    Reviewed-by: Xavier Hernandez <xhernandez>
    Tested-by: Gluster Build System <jenkins.com>

Comment 5 Nagaprasad Sathyanarayana 2015-10-25 15:01:13 UTC

Fix for this BZ is already present in a GlusterFS release. You can find clone of this BZ, fixed in a GlusterFS release and closed. Hence closing this mainline BZ as well.

Comment 6 Niels de Vos 2016-06-16 13:09:29 UTC

This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.8.0, please open a new bug report.

glusterfs-3.8.0 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution.

[1] http://blog.gluster.org/2016/06/glusterfs-3-8-released/
[2] http://thread.gmane.org/gmane.comp.file-systems.gluster.user

Note You need to log in before you can comment on or make changes to this bug.