Bug 2011549 - [AFR] Constant intermittent "Transport endpoint is not connected" errors disrupting operations
Summary: [AFR] Constant intermittent "Transport endpoint is not connected" errors disr...
Keywords:
Status: CLOSED DUPLICATE of bug 1937314
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: core
Version: rhgs-3.5
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: ---
Assignee: Mohit Agrawal
QA Contact: Pranav Prakash
URL:
Whiteboard:
Depends On: 2017641
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-10-06 19:21 UTC by Andrew Robinson
Modified: 2024-12-20 21:19 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-02-09 06:21:51 UTC
Embargoed:


Attachments (Terms of Use)

Description Andrew Robinson 2021-10-06 19:21:23 UTC
Before you record your issue, ensure you are using the latest version of Gluster.

Provide version-Release number of selected component (if applicable):

> glusterfs-6.0-56.2.el7rhgs.x86_64
 
Have you searched the Bugzilla archives for same/similar issues reported.

> 
 
Have you discovered any workarounds?. If not, Read the troubleshooting documentation to help solve your issue. ( https://mojo.redhat.com/groups/gss-gluster (Gluster feature and its troubleshooting)  https://access.redhat.com/articles/1365073 (Specific debug data that needs to be collected for GlusterFS to help troubleshooting)

> No


Describe the issue:(please be detailed as possible and provide log snippets)
[Provide TimeStamp when the issue is seen]

> The customer is getting intermittent but constant "Transport endpoint is not connected" errors when trying to perform gluster operations. The 'gluster volume status' shows all bricks online. However, if they run 'gluster volume heal <vol> info' or even an 'ls' on a volume mount, the command will run to completion sometimes and fail with a "Transport endpoint is not connected" error the other times. This is preventing them from getting work done with the cluster. This started happening about 15 hours before I write this.

The customer has rebooted all three gluster nodes and the three network switches the gluster nodes connect to. That does not seem to make any difference. 


Is this issue reproducible? If yes, share more details.:


Steps to Reproduce:
1.
2.
3.
Actual results:
 
Expected results:
 
Mandatory Information for all Bugs:
1 - gluster v <volname> info
2 - gluster v <volname> heal info
3 - gluster v <volname> status
4 - Fuse Mount/SMB/nfs-ganesha/OCS ???
 

Additional info:


Note You need to log in before you can comment on or make changes to this bug.