Bug 1745909 - [upi-gcp] control plane needs to be added to lb before "Monitor for bootstrap-complete"
Summary: [upi-gcp] control plane needs to be added to lb before "Monitor for bootstrap...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer
Version: 4.2.0
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: ---
: 4.2.0
Assignee: Jeremiah Stuever
QA Contact: liujia
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-08-27 08:15 UTC by liujia
Modified: 2019-10-16 06:38 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-10-16 06:37:54 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2019:2922 0 None None None 2019-10-16 06:38:02 UTC

Description liujia 2019-08-27 08:15:01 UTC
Document URL: 
https://github.com/openshift/installer/blob/master/docs/user/gcp/install_upi.md

Section Number and Name: 

Describe the issue: 
1. [Monitor for bootstrap-complete] step should be placed after [Pivot load balancers to control plane] step.

If follow the steps in above doc, the "wait for bootstrap" step will fail due to temporary control plane teared down after bootstrap complete. But at that time, the api and int lb still target to bootstrap node. So the api can not access at all. And from the gcp console we can find that the lb of api and int are not healthy all the time.

# ./openshift-install wait-for bootstrap-complete --dir test
INFO Waiting up to 30m0s for the Kubernetes API at https://api.jliu.origin-gce.dev.openshift.com:6443... 
INFO Use the following commands to gather logs from the cluster 
INFO openshift-install gather bootstrap --help    
FATAL waiting for Kubernetes API: context deadline exceeded 

So do step [Pivot load balancers to control plane] to update the lb's instances. And then ran "wait for bootstrap" will succeed and curl https://api.jliu.origin-gce.dev.openshift.com:6443/ succeed.

# gcloud compute target-pools add-instances ${INFRA_ID}-api-target-pool --instances-zone="${REGION}-d" --instances=${INFRA_ID}-m-0
Updated [https://www.googleapis.com/compute/v1/projects/openshift-gce-devel/regions/us-east1/targetPools/jliu-68n2v-api-target-pool].

# ./openshift-install wait-for bootstrap-complete --dir testINFO Waiting up to 30m0s for the Kubernetes API at https://api.jliu.origin-gce.dev.openshift.com:6443... 
INFO API v1.14.0+b7c8409 up                       
INFO Waiting up to 30m0s for bootstrapping to complete... 
INFO It is now safe to remove the bootstrap resources 

Suggestions for improvement: 

Additional information:

Comment 1 liujia 2019-09-09 05:45:19 UTC
I noticed the bug should be fixed in https://github.com/openshift/installer/commit/6f9230abda376b05969ecf0012f310c1969f07da#diff-6247f5de2172c3e64ed804505b299339

@Jeremiah Stuever 

Could u help update this bug's status?thx

Comment 2 Jeremiah Stuever 2019-09-09 16:47:17 UTC
This resolved with https://github.com/openshift/installer/pull/2289

Comment 4 liujia 2019-09-11 06:36:10 UTC
LGTM, and it works now following the latest steps.

Comment 5 errata-xmlrpc 2019-10-16 06:37:54 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2019:2922


Note You need to log in before you can comment on or make changes to this bug.