Bug 1340361
| Summary: | Call_bail of a frame due to not able to find a saved frame in reply | |||
|---|---|---|---|---|
| Product: | [Community] GlusterFS | Reporter: | Raghavendra G <rgowdapp> | |
| Component: | rpc | Assignee: | Raghavendra G <rgowdapp> | |
| Status: | CLOSED DUPLICATE | QA Contact: | ||
| Severity: | unspecified | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | mainline | CC: | bugs | |
| Target Milestone: | --- | Keywords: | Triaged | |
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1340362 1341182 1341183 1341184 (view as bug list) | Environment: | ||
| Last Closed: | 2017-09-06 10:48:18 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1341182, 1341183, 1341184 | |||
|
Description
Raghavendra G
2016-05-27 07:32:31 UTC
REVIEW: http://review.gluster.org/14547 (rpc-clnt: save the frame before submitting request to transport) posted (#1) for review on master by Raghavendra G (rgowdapp) > As for the RCA for not able to lookup a saved frame for response, I saw in
> rpc-clnt:
>
> 1. Submit the request to transport for transmission to brick.
> 2. Save the frame for future reference while processing reply.
>
> Now, if we get a response between 1 and 2 (before we were able to save the
> frame), we would be saving the frame of a request whose reply is already
> received. This can result in call-bail.
This RCA is incorrect, since both 1 and 2 happen atomically under lock. When looking for a saved frame during reply processing we acquire same lock and hence atomicity is preserved.
*** This bug has been marked as a duplicate of bug 1421937 *** bz 1421937 speaks of a corruption in call-back submit codepath. This can result in the corruption of rpc reply in general and xid in particular. If xid is corrupted, we would not be able to map the reply with saved frame resulting in a call-bail. Since there is no reproducer, we cannot confirm the above hypothesis. Closing this bug as a duplicate of bz 1421937. Please re-open if it reproduced in versions higher than 3.8.11 |