Bug 1734554 - Create a script to remove a failed etcd member and to allow it to be replaced
Summary: Create a script to remove a failed etcd member and to allow it to be replaced
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Etcd
Version: 4.1.0
Hardware: x86_64
OS: Linux
Target Milestone: ---
: 4.2.0
Assignee: Sam Batschelet
QA Contact: ge liu
Depends On:
TreeView+ depends on / blocked
Reported: 2019-07-30 20:53 UTC by Suresh Kolichala
Modified: 2019-10-16 06:34 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Last Closed: 2019-10-16 06:34:08 UTC
Target Upstream Version:

Attachments (Terms of Use)

System ID Private Priority Status Summary Last Updated
Github openshift machine-config-operator pull 1056 0 None closed Bug 1734554: DR: add etcd-member-remove.sh 2020-08-10 19:02:28 UTC
Github openshift machine-config-operator pull 1073 0 None closed Bug 1734554: provide etcd-member-add.sh for adding back a member with valid certs 2020-08-10 19:02:28 UTC
Red Hat Product Errata RHBA-2019:2922 0 None None None 2019-10-16 06:34:20 UTC

Description Suresh Kolichala 2019-07-30 20:53:03 UTC
Description of problem:
Currently, we have various Disaster Recovery (DR) scenarios that are covered in 4.1. There are docs and scripts describing these recovery processes.

In a more general admin action we want to provide a script that will remove a failed etcd member and allow us to replace it while the cluster is still running. This script would assume TLS certs already exist.

Version-Release number of selected component (if applicable):

How reproducible:
This is a request for a new script to delete remove/replace one of the etcd members.

Steps to Reproduce:

Actual results:

Expected results:

Additional info:

Comment 1 Suresh Kolichala 2019-07-30 21:13:04 UTC
Sam adds in a personal communication:
The idea is to provide something like the following.

$ ./etcd-member-remove.sh $name

$ ./etcd-member-add.sh $peer-urls

Comment 5 ge liu 2019-09-03 10:05:02 UTC
Hello Sam, are there scripts ready for test? if yes, I have strong interest to test it. thx

Comment 6 Sam Batschelet 2019-09-03 11:47:23 UTC

Yes member remove[1] and member add[2] have merged.

[1] https://github.com/openshift/machine-config-operator/pull/1056
[2] https://github.com/openshift/machine-config-operator/pull/1073

Comment 8 ge liu 2019-09-05 08:27:16 UTC
The scripts is ready in 4.2 payload, and tested it, file another bug to trace the script itself issue.

Comment 9 errata-xmlrpc 2019-10-16 06:34:08 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.


Note You need to log in before you can comment on or make changes to this bug.