1796965 – bootstrap take a long time to completed with cluster-etcd-operator

Bug 1796965 - bootstrap take a long time to completed with cluster-etcd-operator

Summary: bootstrap take a long time to completed with cluster-etcd-operator

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Etcd
Sub Component:
Version:	4.4
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Target Release:	4.4.0
Assignee:	Sam Batschelet
QA Contact:	ge liu
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2020-01-31 16:01 UTC by Alay Patel
Modified:	2020-05-04 11:28 UTC (History)
CC List:	0 users
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-05-04 11:28:32 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift cluster-etcd-operator pull 66	0	None	closed	Bug 1796965: *: reduce the bootstrap time by fixing implementation details	2020-03-20 13:03:41 UTC
Red Hat Product Errata	RHBA-2020:0581	0	None	None	None	2020-05-04 11:28:51 UTC

Description Alay Patel 2020-01-31 16:01:06 UTC

Description of problem:

1. The cluster-etcd-operator introduced an init container that waits service account(namespace, token, ca.crt, service-ca.crt) related files to be synced to the static pod. The CI runs have indicated takes a long time for service-ca.crt to be synced, even after kube is up. 

2. The cluster-etcd-operator also introduced a new wait-for-ceo[1] command in the installer, that blocks the installer from destroying the bootstrap node before etcd has scaled to all 4 masters. The wait-for-ceo command also needs to wait for kube apiserver to roll out valid config. The loop checking this config errors out without checking all the config. It should collect the errors and attempt to check the config for all kube-apiserver pods.


[1] https://github.com/openshift/installer/blob/993c8cec1e8b1dbf58383fa52b264695187656be/data/data/bootstrap/files/usr/local/bin/bootkube.sh.template#L403
Version-Release number of selected component (if applicable):

Comment 2 ge liu 2020-02-14 04:55:19 UTC

This issue seems still be hit in installation randomly, https://bugzilla.redhat.com/show_bug.cgi?id=1798945, we may trace this issue with one bug, so close this one, pls correct me if there is misunderstanding.

Comment 4 errata-xmlrpc 2020-05-04 11:28:32 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0581

Note You need to log in before you can comment on or make changes to this bug.