Bug 1300301
Summary: | IOs failed with transport end point error while attach tier(shows authentication problem) and mount gets unmounted | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Nag Pavan Chilakam <nchilaka> | ||||||
Component: | tier | Assignee: | Mohammed Rafi KC <rkavunga> | ||||||
Status: | CLOSED WONTFIX | QA Contact: | Nag Pavan Chilakam <nchilaka> | ||||||
Severity: | urgent | Docs Contact: | |||||||
Priority: | unspecified | ||||||||
Version: | rhgs-3.1 | CC: | hgowtham, mchangir, nbalacha, rhs-bugs, rkavunga, smohan | ||||||
Target Milestone: | --- | Keywords: | ZStream | ||||||
Target Release: | --- | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | tier-attach-detach | ||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2018-11-08 18:36:15 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | 1300564, 1300978 | ||||||||
Bug Blocks: | |||||||||
Attachments: |
|
Description
Nag Pavan Chilakam
2016-01-20 12:46:32 UTC
Created attachment 1116638 [details]
client error
It is very inconsistently reproducible. RCA: It is race between graph change in client graph and an option change in server graph. During server_reconfigure we authenticate each connected clients against the current options. To do this authentication we store previous values in a dictionary during the connection establishment phase (server_setvolume). If the authentication fails during reconfigure then we will disconnect the transport. Here it introduce a race between server_setvolume and reconfugure. If a reconfigure called before doing a setvolume, the transport will be disconnected. After three seconds time-out transport will be reconnected. Changing the component since this can be reproduced in any volume also this bug falls into protocol layer. NOTE: With RCA given in comment3, the failure should not umount upstream master patch merged. http://review.gluster.org/#/c/13271/ release 3.7 : http://review.gluster.org/#/c/13280/ Created attachment 1117135 [details]
mount log
patches mentioned in comment5 are merged in upstream, so the fix would be available for 3.2 as part of the rebase. Moving to MODIFIED. Patch available downstream as commit 30e4d0d. As tier is not being actively developed, I'm closing this bug. Feel free to open it if necessary. |