Bug 1697866 - Provide a way to detach a failed node
Summary: Provide a way to detach a failed node
Keywords:
Status: CLOSED NEXTRELEASE
Alias: None
Product: GlusterFS
Classification: Community
Component: glusterd
Version: mainline
Hardware: Unspecified
OS: Unspecified
low
low
Target Milestone: ---
Assignee: Sanju
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks: 1696334
TreeView+ depends on / blocked
 
Reported: 2019-04-09 08:28 UTC by Sanju
Modified: 2019-04-22 14:18 UTC (History)
7 users (show)

Fixed In Version:
Clone Of: 1696334
Environment:
Last Closed: 2019-04-11 15:16:12 UTC
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Gluster.org Gerrit 22534 0 None Merged glusterd: provide a way to detach failed node 2019-04-11 15:16:11 UTC

Description Sanju 2019-04-09 08:28:17 UTC
Description of problem:

When a gluster peer node has failed due to hardware issues, it should be possible to detach it.

Currently, the peer detach command fails because the peer hosts one or more bricks.

If delete of the volume that has that brick is attempted then volume delete fails with "Not all peers are up" error.

One way out is to use a replace-brick command and move the brick to some other node.

However, it might not be possible to replace-brick sometimes. 

A trick that worked for us was to use remove-brick to convert the replica 3 volume to replica 2 and then peer detach the node.


May be the peer detach command can show the trick in output. Something on the lines:


"This peer has one or more bricks. If the peer is lost and is not recoverable then you should use either replace-brick or remove-brick procedure to remove all bricks from the peer and attempt the peer detach again"

Comment 1 Worker Ant 2019-04-09 08:39:48 UTC
REVIEW: https://review.gluster.org/22534 (glusterd: provide a way to detach failed node) posted (#1) for review on master by Sanju Rakonde

Comment 2 Worker Ant 2019-04-11 15:16:12 UTC
REVIEW: https://review.gluster.org/22534 (glusterd: provide a way to detach failed node) merged (#3) on master by Atin Mukherjee


Note You need to log in before you can comment on or make changes to this bug.