Bug 762177 (GLUSTER-445) - Transport endpoint is not connected
Summary: Transport endpoint is not connected
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: GLUSTER-445
Product: GlusterFS
Classification: Community
Component: transport
Version: 2.0.7
Hardware: All
OS: Linux
low
high
Target Milestone: ---
Assignee: Raghavendra G
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2009-12-05 21:11 UTC by Kris Buytaert
Modified: 2010-03-31 10:50 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:
Regression: RTA
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:


Attachments (Terms of Use)

Description Kris Buytaert 2009-12-05 21:11:11 UTC
Upon trying to write a file to a glusterfs mounted volume 


cp /etc/glusterfs/glusterfs.vol.sample   .
cp: writing `./glusterfs.vol.sample': Transport endpoint is not
connected
cp: closing `./glusterfs.vol.sample': Resource temporarily unavailable




On the server log : 



[2009-12-04 18:14:05] N [glusterfsd.c:1306:main] glusterfs: Successfully
started
[2009-12-04 18:14:07] N [server-protocol.c:7065:mop_setvolume] server:
accepted client from 172.16.32.119:1021
[2009-12-04 18:14:08] N [server-protocol.c:7065:mop_setvolume] server:
accepted client from 172.16.32.119:1020
pending frames:

patchset: v2.0.8
signal received: 6
time of crash: 2009-12-04 18:14:35
configuration details:
argp 1
backtrace 1
db.h 1
dlfcn 1
fdatasync 1
libpthread 1
llistxattr 1
setfsid 1
spinlock 1
epoll.h 1
xattr.h 1
st_atim.tv_nsec 1
package-string: glusterfs 2.0.8
[0x520420]
/lib/libc.so.6(gsignal+0x46)[0x75c206]
/lib/libc.so.6(abort+0x101)[0x75dbd1]
/lib/libc.so.6[0x7952ab]
/lib/libc.so.6(__stack_chk_fail+0x44)[0x81d2c4]
/usr/lib/glusterfs/2.0.8/xlator/storage/posix.so[0x5b3374]
/usr/lib/glusterfs/2.0.8/xlator/storage/posix.so(do_xattrop
+0x6b1)[0x5ae0d1]
/usr/lib/glusterfs/2.0.8/xlator/storage/posix.so(posix_fxattrop
+0x41)[0x5ae1f1]
/usr/lib/libglusterfs.so.0(default_fxattrop+0xb2)[0x114892]
/usr/lib/libglusterfs.so.0(default_fxattrop+0xb2)[0x114892]
/usr/lib/glusterfs/2.0.8/xlator/protocol/server.so(server_fxattrop
+0x15e)[0x163d4e]
/usr/lib/glusterfs/2.0.8/xlator/protocol/server.so(protocol_server_interpret+0x146)[0x15ae36]
/usr/lib/glusterfs/2.0.8/xlator/protocol/server.so(protocol_server_pollin+0x97)[0x15b047]
/usr/lib/glusterfs/2.0.8/xlator/protocol/server.so(notify
+0xd1)[0x15b121]
/usr/lib/glusterfs/2.0.8/transport/socket.so(socket_event_poll_in
+0x3b)[0x263abb]
/usr/lib/glusterfs/2.0.8/transport/socket.so(socket_event_handler
+0xae)[0x264b2e]
/usr/lib/libglusterfs.so.0[0x127e9a]
/usr/lib/libglusterfs.so.0(event_dispatch+0x21)[0x126d61]
/usr/sbin/glusterfsd(main+0xcfa)[0x804b98a]
/lib/libc.so.6(__libc_start_main+0xdc)[0x748e9c]
/usr/sbin/glusterfsd[0x8049aa1]



Touching a file works,  writing actual data to it doesn't



Both glusterfs deamons are stopped.



My config is below




# file auto generated by /usr/bin/glusterfs-volgen (mount.vol)
# Cmd line:
# $ /usr/bin/glusterfs-volgen --export-directory /opt/ --raid 1 --name repstore1 172.16.32.159 172.16.32.119

# RAID 1
# TRANSPORT-TYPE tcp
# PORT 6996

volume 172.16.32.159
    type protocol/client
    option transport-type tcp
    option remote-host 172.16.32.159
    option remote-port 6996
    option remote-subvolume brick
end-volume

volume 172.16.32.119
    type protocol/client
    option transport-type tcp
    option remote-host 172.16.32.119
    option remote-port 6996
    option remote-subvolume brick
end-volume

volume mirror-0
    type cluster/replicate
    subvolumes 172.16.32.159 172.16.32.119
end-volume

volume writebehind
    type performance/write-behind
    option cache-size 4MB
    subvolumes mirror-0
end-volume

volume io-cache
    type performance/io-cache
    option cache-size 1GB
    subvolumes writebehind
end-volume

## file auto generated by /usr/bin/glusterfs-volgen (export.vol)
# Cmd line:
# $ /usr/bin/glusterfs-volgen --export-directory /opt/ --raid 1 --name repstore1 172.16.32.159 172.16.32.119

# TRANSPORT-TYPE tcp
# PORT 6996

volume posix
  type storage/posix
  option directory /opt/
end-volume

volume locks
    type features/locks
    subvolumes posix
end-volume

volume brick
    type performance/io-threads
    option thread-count 8
    subvolumes locks
end-volume

volume server
    type protocol/server
    option transport-type tcp
    option auth.addr.brick.allow *
    option listen-port 6996
    subvolumes brick
end-volume




I`m using a build from  the source  glusterfs-2.0.8-1.src.rpm on Centos i386

Comment 1 Harshavardhana 2009-12-08 19:07:53 UTC
Kris,

commit f88cd26064d7aa306720d4e4d48af9086d97fb13
Author: Amar Tumballi <amar>
Date:   Sun Nov 15 21:27:19 2009 +0000

    segfault fix in posix
    
    fd_ctx_get() was getting passed with wrong argument (type punned), and
    hence was getting segfaulted on 32bit systems.
    
    Signed-off-by: Amar Tumballi <amar>
    Signed-off-by: Anand V. Avati <avati.com>
    
    BUG: 383 (glusterfs server crash on 2.0.8)
    URL: http://bugs.gluster.com/cgi-bin/bugzilla3/show_bug.cgi?id=383

This bug was reproduced and its fixed in repository. Can you confirm that you
are running glusterfs server on a 32bit system?.

Comment 2 Kris Buytaert 2009-12-09 05:10:36 UTC
Yes,  running on a 32 bit version.

Comment 3 Harshavardhana 2009-12-09 05:14:12 UTC
Kris, 

Ok then we know this issue and its fixed in repository.  

we have fixed this in subsequent releases 2.0.9rc4 should have this fix. 

http://ftp.gluster.com/pub/gluster/glusterfs/qa-releases/glusterfs-2.0.9rc4.tar.gz

Comment 4 Kris Buytaert 2009-12-09 08:38:33 UTC
I can confirm that it has been fixed in that version !


Note You need to log in before you can comment on or make changes to this bug.