1697866 – Provide a way to detach a failed node

Bug 1697866 - Provide a way to detach a failed node

Summary: Provide a way to detach a failed node

Keywords:
Status:	CLOSED NEXTRELEASE
Alias:	None
Product:	GlusterFS
Classification:	Community
Component:	glusterd
Sub Component:
Version:	mainline
Hardware:	Unspecified
OS:	Unspecified
Priority:	low
Severity:	low
Target Milestone:	---
Assignee:	Sanju
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1696334
TreeView+	depends on / blocked

Reported:	2019-04-09 08:28 UTC by Sanju
Modified:	2019-04-22 14:18 UTC (History)
CC List:	7 users (show)
Fixed In Version:
Clone Of:	1696334
Environment:
Last Closed:	2019-04-11 15:16:12 UTC
Regression:	---
Mount Type:	---
Documentation:	---
CRM:
Verified Versions:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Gluster.org Gerrit	22534	0	None	Merged	glusterd: provide a way to detach failed node	2019-04-11 15:16:11 UTC

Description Sanju 2019-04-09 08:28:17 UTC

Description of problem:

When a gluster peer node has failed due to hardware issues, it should be possible to detach it.

Currently, the peer detach command fails because the peer hosts one or more bricks.

If delete of the volume that has that brick is attempted then volume delete fails with "Not all peers are up" error.

One way out is to use a replace-brick command and move the brick to some other node.

However, it might not be possible to replace-brick sometimes. 

A trick that worked for us was to use remove-brick to convert the replica 3 volume to replica 2 and then peer detach the node.


May be the peer detach command can show the trick in output. Something on the lines:


"This peer has one or more bricks. If the peer is lost and is not recoverable then you should use either replace-brick or remove-brick procedure to remove all bricks from the peer and attempt the peer detach again"

Comment 1 Worker Ant 2019-04-09 08:39:48 UTC

REVIEW: https://review.gluster.org/22534 (glusterd: provide a way to detach failed node) posted (#1) for review on master by Sanju Rakonde

Comment 2 Worker Ant 2019-04-11 15:16:12 UTC

REVIEW: https://review.gluster.org/22534 (glusterd: provide a way to detach failed node) merged (#3) on master by Atin Mukherjee

Note You need to log in before you can comment on or make changes to this bug.