2090628 – sometimes a voting member is not added to the etcd-endpoints configmap

Bug 2090628 - sometimes a voting member is not added to the etcd-endpoints configmap

Summary: sometimes a voting member is not added to the etcd-endpoints configmap

Keywords:
Status:	CLOSED DUPLICATE of bug 2093819
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Etcd
Sub Component:
Version:	4.11
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	4.11.0
Assignee:	Allen Ray
QA Contact:	ge liu
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2022-05-26 07:29 UTC by Lukasz Szaszkiewicz
Modified:	2022-07-20 11:22 UTC (History)
CC List:	8 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2022-07-20 11:22:31 UTC
Target Upstream Version:
Embargoed:
Flags:	alray: needinfo- alray: needinfo-

Attachments	(Terms of Use)

Description Lukasz Szaszkiewicz 2022-05-26 07:29:55 UTC

I don't know how common is the issue. It was captured by the scaling test we added. A newly created machine was promoted to a voting member by never made it to the etcd-endpoints configmap.

Timeline:

At 19:37:48: CEO successfully promoted learner member https://10.0.0.7:2380
At ~19:42:33 newly promoted member (ID: 9fc4382989977f7e) was elected as a leader at term 8
At ~19.44:22 the test deleted the machine

The machine was never deleted because the removal controller reads data from the etcd-endpoints configmap which indicated no excessive machines.


Link to CI run: https://prow.ci.openshift.org/view/gcs/origin-ci-test/logs/periodic-ci-openshift-release-master-nightly-4.11-e2e-gcp-fips-serial/1527713140295340032

Comment 2 Lukasz Szaszkiewicz 2022-06-07 07:47:49 UTC

I think I have another run with this issue https://prow.ci.openshift.org/view/gs/origin-ci-test/logs/periodic-ci-openshift-multiarch-master-nightly-4.11-ocp-e2e-serial-aws-arm64/1533937747629182976

Comment 3 Lukasz Szaszkiewicz 2022-06-07 07:56:53 UTC

more results https://search.ci.openshift.org/?search=unexpected+number+of+voting+members+in+the+openshift-etcd%2Fetcd-endpoints&maxAge=48h&context=1&type=all&name=&excludeName=&maxMatches=5&maxBytes=20971520&groupBy=job

Comment 6 Allen Ray 2022-06-22 13:52:44 UTC

After discussing with @tjungblu and @htariq, we decided that this shouldn't be a blocker+ because it isn't perma-failing, currently doesn't have a reproducer, and haven't heard anything from the field.

Note You need to log in before you can comment on or make changes to this bug.