1470533 – Brick Mux Setup: brick processes(glusterfsd) crash after a restart of volume which was preceded with some actions

Bug 1470533 - Brick Mux Setup: brick processes(glusterfsd) crash after a restart of volume which was preceded with some actions

Summary: Brick Mux Setup: brick processes(glusterfsd) crash after a restart of volume ...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	GlusterFS
Classification:	Community
Component:	core
Sub Component:
Version:	mainline
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	urgent
Target Milestone:	---
Assignee:	Mohit Agrawal
QA Contact:
Docs Contact:
URL:
Whiteboard:	brick-multiplexing
Depends On:
Blocks:	1468514
TreeView+	depends on / blocked

Reported:	2017-07-13 06:43 UTC by Mohit Agrawal
Modified:	2017-09-05 17:36 UTC (History)
CC List:	7 users (show)
Fixed In Version:	glusterfs-3.12.0
Clone Of:	1468514
Environment:
Last Closed:	2017-09-05 17:36:59 UTC
Regression:	---
Mount Type:	---
Documentation:	---
CRM:
Verified Versions:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Comment 1 Mohit Agrawal 2017-07-13 06:49:43 UTC

RCA: Brick process is getting crash at the time of stop the volume because in posix notify 
     we are trying to close dir for mount_lock but dir handle has not been set 
     to NULL after close directory.

>>>>>>>>>>>>>>>>>>>
      f 5
#5  0x00007f3b2d6e1c15 in sys_closedir (dir=<optimized out>) at syscall.c:113
113	        return closedir (dir);
(gdb) f 6
#6  0x00007f3b1bde9157 in notify (this=<optimized out>, event=<optimized out>, 
    data=<optimized out>) at posix.c:6618
6618	                        (void) sys_closedir (priv->mount_lock);
(gdb) l
6613	                if (priv->fsyncer) {
6614	                        (void) gf_thread_cleanup_xint (priv->fsyncer);
6615	                        priv->fsyncer = 0;
6616	                }
6617	                if (priv->mount_lock)
6618	                        (void) sys_closedir (priv->mount_lock);


>>>>>>>>>>>>>>>>>>>>>>>>>>>>

Regards
Mohit Agrawal

Comment 2 Worker Ant 2017-07-13 06:58:44 UTC

REVIEW: https://review.gluster.org/17767 (posix: brick process crash after stop the volume while brick mux is on) posted (#1) for review on master by MOHIT AGRAWAL (moagrawa)

Comment 3 Worker Ant 2017-07-13 12:39:34 UTC

COMMIT: https://review.gluster.org/17767 committed in master by Jeff Darcy (jeff.us) 
------
commit 61db7125a5b8db0bd4dd09b423bb54415c8bd484
Author: Mohit Agrawal <moagrawa>
Date:   Thu Jul 13 12:23:13 2017 +0530

    posix: brick process crash after stop the volume while brick mux is on
    
    Problem: sometime brick process is getting crash after stop the volume
             while brick mux is enabled and no. of volumes are high
    
    Solution: In posix notify at the time close mount_lock dir , dir handle
              needs to set NULL to avoid the reuse of same dir handle.
    
    BUG: 1470533
    Change-Id: Ifd41c20b3c597317851f91049a7c801949840b16
    Signed-off-by: Mohit Agrawal <moagrawa>
    Reviewed-on: https://review.gluster.org/17767
    Smoke: Gluster Build System <jenkins.org>
    Reviewed-by: Amar Tumballi <amarts>
    CentOS-regression: Gluster Build System <jenkins.org>
    Reviewed-by: Jeff Darcy <jeff.us>

Comment 4 Shyamsundar 2017-09-05 17:36:59 UTC

This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.12.0, please open a new bug report.

glusterfs-3.12.0 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution.

[1] http://lists.gluster.org/pipermail/announce/2017-September/000082.html
[2] https://www.gluster.org/pipermail/gluster-users/

Note You need to log in before you can comment on or make changes to this bug.