Bug 1245142
| Summary: | DHT-rebalance: Rebalance hangs on distribute volume when glusterd is stopped on peer node | |||
|---|---|---|---|---|
| Product: | [Community] GlusterFS | Reporter: | Anand Nekkunti <anekkunt> | |
| Component: | glusterd | Assignee: | Anand Nekkunti <anekkunt> | |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | ||
| Severity: | high | Docs Contact: | ||
| Priority: | high | |||
| Version: | mainline | CC: | amukherj, anekkunt, bugs, gluster-bugs, nsathyan, rcyriac, rhs-bugs, storage-qa-internal, trao | |
| Target Milestone: | --- | Keywords: | Reopened, Triaged | |
| Target Release: | --- | |||
| Hardware: | x86_64 | |||
| OS: | Linux | |||
| Whiteboard: | Rebalance | |||
| Fixed In Version: | glusterfs-3.8rc2 | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | ||
| Clone Of: | 1244527 | |||
| : | 1249925 (view as bug list) | Environment: | ||
| Last Closed: | 2016-06-16 13:25:33 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | 1244527 | |||
| Bug Blocks: | 1249925 | |||
|
Description
Anand Nekkunti
2015-07-21 10:05:12 UTC
REVIEW: http://review.gluster.org/11728 (glusrterd:getting txn_if from my_frame in op_sm call back RCA: when one of the glusterd went down during rebalance start then call back function (_glusterd_commit_op_cbk ) called with rpc_status is -1, In case of rpc success we are getting txn_id from response but in failure case of rpc, we are referring global_txn_id which is always Zero,this resulting op_sm into inconsistent state.) posted (#1) for review on master by Anand Nekkunti (anekkunt) REVIEW: http://review.gluster.org/11728 (glusrterd: getting txn_id from myframe in op_sm call back. RCA: when one of the glusterd went down during rebalance start then call back function (_glusterd_commit_op_cbk ) called with rpc_status is -1, In case of rpc success we are getting txn_id from response but in failure case of rpc, we are referring global_txn_id which is always Zero,this resulting op_sm into inconsistent state.) posted (#2) for review on master by Anand Nekkunti (anekkunt) REVIEW: http://review.gluster.org/11728 (glusterd: getting txn_id from frame->cookie in op_sm call back) posted (#3) for review on master by Anand Nekkunti (anekkunt) REVIEW: http://review.gluster.org/11728 (glusterd: getting txn_id from frame->cookie in op_sm call back) posted (#5) for review on master by Anand Nekkunti (anekkunt) Fix for this BZ is already present in a GlusterFS release. You can find clone of this BZ, fixed in a GlusterFS release and closed. Hence closing this mainline BZ as well. This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.8.0, please open a new bug report. glusterfs-3.8.0 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution. [1] http://blog.gluster.org/2016/06/glusterfs-3-8-released/ [2] http://thread.gmane.org/gmane.comp.file-systems.gluster.user |