Bug 1460245 - [GSS]Glustershd process crashes intermittently on SSL enabled volume in RHGS 3.2
[GSS]Glustershd process crashes intermittently on SSL enabled volume in RHGS 3.2
Status: NEW
Product: Red Hat Gluster Storage
Classification: Red Hat
Component: core (Show other bugs)
All All
unspecified Severity high
: ---
: ---
Assigned To: Ravishankar N
Rahul Hinduja
: Reopened
: 1478010 1499666 (view as bug list)
Depends On: 1596513 1597229 1597230
  Show dependency treegraph
Reported: 2017-06-09 09:14 EDT by Riyas Abdulrasak
Modified: 2018-07-02 06:13 EDT (History)
10 users (show)

See Also:
Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
Last Closed: 2017-12-07 02:18:59 EST
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Knowledge Base (Solution) 3212251 None None None 2017-10-10 10:58 EDT

  None (edit)
Description Riyas Abdulrasak 2017-06-09 09:14:34 EDT
Description of problem:

Glustershd process crashes intermittently on different nodes on the cluster. 

Version-Release number of selected component (if applicable):

RHGS 3.2

How reproducible:

Happens intermittently on customer environment. 

Actual results:

glustershd crashes

Expected results:

glustershd should not crash. 

Additional info:

- Cluster /volume specific information are in the next comment
- Complete bt of the crashdump will be attached to the bz 'gdb.txt'
Comment 19 Mohit Agrawal 2017-06-28 12:16:28 EDT

shd is getting below logs and these logs are showing ssl_setup_connection is throwing connect error but it is failing because socket_connect is getting connection refused from glusterd side may be other end point is not available.


[2017-06-08 00:32:43.786637] E [socket.c:3142:socket_connect] 0-glusterfs: connection attempt on  failed, (Connection refused)
[2017-06-08 00:32:43.786851] I [MSGID: 101190] [event-epoll.c:628:event_dispatch_epoll_worker] 0-epoll: Started thread with index 1
[2017-06-08 00:32:43.786890] E [socket.c:358:ssl_setup_connection] 0-glusterfs: SSL connect error (client: )
[2017-06-08 00:32:43.786912] E [socket.c:2447:socket_poller] 0-glusterfs: client setup failed
[2017-06-08 00:32:43.786955] E [glusterfsd-mgmt.c:1928:mgmt_rpc_notify] 0-glusterfsd-mgmt: failed to connect with remote-host: localhost (Transport endpoint is not connected)


We already solved this problem in ssl code, now it will try to establish connection with peer only while socket_connect is getting success.

In downstream issue is fixed from below patch

Mohit Agrawal
Comment 22 Ravishankar N 2017-08-24 05:15:01 EDT
*** Bug 1478010 has been marked as a duplicate of this bug. ***
Comment 23 Simon Reber 2017-10-11 10:44:56 EDT
*** Bug 1499666 has been marked as a duplicate of this bug. ***

Note You need to log in before you can comment on or make changes to this bug.