Description of problem: While running tests and doing graph change using volume set in a loop, the I/O on the fuse client exited with an ENOTCONN. It also made glusterd unresponsive. This was a 2x2 Distributed-replicate setup. Version-Release number of selected component (if applicable): 3.3.0qa38 (RHS AMI) Steps to Reproduce: 1. while true; do gluster volume set amitest performance.write-behind off; sleep 1; gluster volume set amitest performance.write-behind on; done 2. Run I/O on the mount point 3. Actual results: Mount point returned ENOTCONN. Glusterd was unresponsive. Expected results: Additional info: Glusterd log: [2012-05-02 10:32:48.484114] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484132] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484150] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484168] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484199] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484220] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484238] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484256] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484273] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484291] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484309] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484327] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484345] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484363] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484381] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484399] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484416] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484434] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files) [2012-05-02 10:32:48.484452] W [socket.c:1851:socket_server_event_handler] 0-socket.management: accept on 8 failed (Too many open files)
This is most likely a duplicate of bug #810089 - the fix to which did not go into v3.3.0qa38. I checked it on v3.3.0qa39 (which had the fix) and the number of sockets opened by glusterd did not go up. @Anush, Can you confirm my observations and if true, please mark this bug as duplicate of bug #810089. regards, Raghavendra.
need a update with qa40 or beyond.
Don't see this issue with qa41. Closing it now.