Bug 1300564

Summary: I/O failure during a graph change followed by an option change.
Product: [Community] GlusterFS Reporter: Mohammed Rafi KC <rkavunga>
Component: protocolAssignee: Mohammed Rafi KC <rkavunga>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: mainlineCC: bugs, rgowdapp
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: glusterfs-3.8rc2 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 1300978 (view as bug list) Environment:
Last Closed: 2016-06-16 13:55:08 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Bug Depends On:    
Bug Blocks: 1300301, 1300978    

Description Mohammed Rafi KC 2016-01-21 07:26:01 UTC
Description of problem:

I/O failed for fuse mount when a graph changed followed by an option changed

Version-Release number of selected component (if applicable):

mainline

How reproducible:

very inconsistent. It is a race

Steps to Reproduce:

1. Start i/o on fuse mount
2. Disable write-behind (just to have a graph change)
3. immediately enable trash (just to have an option change in server side)
4. I/o will fail with transport end point not connected

Actual results:

I/O failure

Expected results:

I/O should not fail

Additional info:

Comment 1 Vijay Bellur 2016-01-21 07:27:12 UTC
REVIEW: http://review.gluster.org/13271 (protocol/server: Race between server_reconfigure and server_setvolume) posted (#1) for review on master by mohammed rafi  kc (rkavunga)

Comment 2 Vijay Bellur 2016-01-21 07:38:55 UTC
REVIEW: http://review.gluster.org/13271 (protocol/server: Race between server_reconfigure and server_setvolume) posted (#2) for review on master by mohammed rafi  kc (rkavunga)

Comment 3 Vijay Bellur 2016-01-21 08:27:03 UTC
REVIEW: http://review.gluster.org/13271 (protocol/server: Race between server_reconfigure and server_setvolume) posted (#3) for review on master by mohammed rafi  kc (rkavunga)

Comment 4 Vijay Bellur 2016-01-22 08:52:23 UTC
COMMIT: http://review.gluster.org/13271 committed in master by Raghavendra G (rgowdapp) 
------
commit 16f6579cb3e1214b1386fb530b8e16c8cbfdef33
Author: Mohammed Rafi KC <rkavunga>
Date:   Thu Jan 21 12:41:55 2016 +0530

    protocol/server: Race between server_reconfigure and server_setvolume
    
    During server_reconfigure we authenticate each connected clients
    against the current options. To do this authentication we store
    previous values in a dictionary during the connection establishment
    phase (server_setvolume). If the authentication fails during
    reconfigure then we will disconnect the transport.
    
    Here it introduce a race between server_setvolume and reconfugure.
    If a reconfigure called before doing a setvolume, the transport
    will be disconnected
    
    Change-Id: Icce2c28a171481327a06efd3901f8a5ee67b05ab
    BUG: 1300564
    Signed-off-by: Mohammed Rafi KC <rkavunga>
    Reviewed-on: http://review.gluster.org/13271
    Smoke: Gluster Build System <jenkins.com>
    Reviewed-by: Raghavendra G <rgowdapp>
    CentOS-regression: Gluster Build System <jenkins.com>
    Reviewed-by: Raghavendra Talur <rtalur>
    NetBSD-regression: NetBSD Build System <jenkins.org>

Comment 5 Niels de Vos 2016-06-16 13:55:08 UTC
This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.8.0, please open a new bug report.

glusterfs-3.8.0 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution.

[1] http://blog.gluster.org/2016/06/glusterfs-3-8-released/
[2] http://thread.gmane.org/gmane.comp.file-systems.gluster.user