Bug 1796965 - bootstrap take a long time to completed with cluster-etcd-operator
Summary: bootstrap take a long time to completed with cluster-etcd-operator
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Etcd
Version: 4.4
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: 4.4.0
Assignee: Sam Batschelet
QA Contact: ge liu
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-01-31 16:01 UTC by Alay Patel
Modified: 2020-05-04 11:28 UTC (History)
0 users

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-05-04 11:28:32 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift cluster-etcd-operator pull 66 0 None closed Bug 1796965: *: reduce the bootstrap time by fixing implementation details 2020-03-20 13:03:41 UTC
Red Hat Product Errata RHBA-2020:0581 0 None None None 2020-05-04 11:28:51 UTC

Description Alay Patel 2020-01-31 16:01:06 UTC
Description of problem:

1. The cluster-etcd-operator introduced an init container that waits service account(namespace, token, ca.crt, service-ca.crt) related files to be synced to the static pod. The CI runs have indicated takes a long time for service-ca.crt to be synced, even after kube is up. 

2. The cluster-etcd-operator also introduced a new wait-for-ceo[1] command in the installer, that blocks the installer from destroying the bootstrap node before etcd has scaled to all 4 masters. The wait-for-ceo command also needs to wait for kube apiserver to roll out valid config. The loop checking this config errors out without checking all the config. It should collect the errors and attempt to check the config for all kube-apiserver pods.


[1] https://github.com/openshift/installer/blob/993c8cec1e8b1dbf58383fa52b264695187656be/data/data/bootstrap/files/usr/local/bin/bootkube.sh.template#L403
Version-Release number of selected component (if applicable):

Comment 2 ge liu 2020-02-14 04:55:19 UTC
This issue seems still be hit in installation randomly, https://bugzilla.redhat.com/show_bug.cgi?id=1798945, we may trace this issue with one bug, so close this one, pls correct me if there is misunderstanding.

Comment 4 errata-xmlrpc 2020-05-04 11:28:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0581


Note You need to log in before you can comment on or make changes to this bug.