Bug 762653 (GLUSTER-921)

Summary: Client readdir crash after a remount in a replicated setup
Product: [Community] GlusterFS Reporter: Pavan Vilas Sondur <pavan>
Component: replicateAssignee: Pavan Vilas Sondur <pavan>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: high Docs Contact:
Priority: urgent    
Version: mainlineCC: gluster-bugs, rabhat
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Pavan Vilas Sondur 2010-05-12 10:36:42 UTC
Pasting the mail as reported in the users-list:

Hi Guys

I have 2 physical servers, which I'm trying to setup in a HA pair.. I'm
sharing 2 volumes (home-gfs and shared-gfs) across these servers in
RAID1 (replicated), then I'm mounting clients on each server as home/
and shared/, if I lose one of the Servers and then remount it, the
client on the other server seems to crash out. The following is the long
entries leading to the crash -


[2010-04-09 16:50:44] E [socket.c:762:socket_connect_finish]
192.168.100.31-2: connection to 192.168.100.31:6996 failed (Connection
refused)
[2010-04-09 16:50:44] E [socket.c:762:socket_connect_finish]
192.168.100.31-2: connection to 192.168.100.31:6996 failed (Connection
refused)
[2010-04-09 16:51:31] N [client-protocol.c:6246:client_setvolume_cbk]
192.168.100.31-2: Connected to 192.168.100.31:6996, attached to remote
volume 'brick2'.
[2010-04-09 16:51:31] E
[afr-self-heal-common.c:1237:sh_missing_entries_create] mirror-1:
unknown file type: 01
pending frames:
frame : type(1) op(READDIRP)

patchset: git://git.sv.gnu.org/gluster.git
signal received: 11
time of crash: 2010-04-09 16:51:31
configuration details:
argp 1
backtrace 1
dlfcn 1
fdatasync 1
libpthread 1
llistxattr 1
setfsid 1
spinlock 1
epoll.h 1
xattr.h 1
st_atim.tv_nsec 1
package-string: glusterfs 3.0.0git
[0x371420]
/usr/local/lib/glusterfs/3.0.0git/xlator/protocol/client.so(client_readd
irp+0x1b4)[0x28d224]
/usr/local/lib/glusterfs/3.0.0git/xlator/cluster/replicate.so(afr_do_rea
ddir+0x4e2)[0x98a722]
/usr/local/lib/glusterfs/3.0.0git/xlator/cluster/replicate.so(afr_readdi
rp+0x48)[0x98a988]
/usr/local/lib/glusterfs/3.0.0git/xlator/mount/fuse.so[0x113e98]
/usr/local/lib/glusterfs/3.0.0git/xlator/mount/fuse.so[0x11b25a]
/lib/libpthread.so.0[0xbeb73b]
/lib/libc.so.6(clone+0x5e)[0xb69cfe]

Comment 1 Pavan Vilas Sondur 2010-08-10 06:14:41 UTC
Received the following mail. Marking it as fixed. Please open the bug if the crash happens again. 
-------- Original Message --------
Subject: 	RE: [Gluster-users] Gluster crashing
Date: 	Wed, 12 May 2010 11:46:49 +0100
From: 	Kelvin Westlake <Kelvin.uk>
To: 	Pavan Sondur <pavan>


Hi Pavan

It seems my issue's have unknowingly been fixed with 3.0.4, the client is no longer crashing and replication is working :)

Thanks
Kelvin