Bug 1816732
Summary: | [UPI][4.4] Worker nodes doesn't join cluster | ||||||
---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Lukas Bednar <lbednar> | ||||
Component: | Installer | Assignee: | Abhinav Dahiya <adahiya> | ||||
Installer sub component: | openshift-installer | QA Contact: | Johnny Liu <jialiu> | ||||
Status: | CLOSED DUPLICATE | Docs Contact: | |||||
Severity: | urgent | ||||||
Priority: | unspecified | CC: | bleanhar, ginger, lbednar, wsun, ycui | ||||
Version: | 4.4 | Keywords: | TestBlockerForLayeredProduct | ||||
Target Milestone: | --- | ||||||
Target Release: | 4.5.0 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2020-04-06 14:41:59 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Lukas Bednar
2020-03-24 15:50:24 UTC
I tried again with ocp-4.4.0-rc.4 and now only two workers joint :-/ ... [cnv-qe-jenkins@cnv-executor-lbednar ~]$ oc get nodes NAME STATUS ROLES AGE VERSION host-172-16-0-16 Ready master 21m v1.17.1 host-172-16-0-19 Ready master 19m v1.17.1 host-172-16-0-27 Ready master 19m v1.17.1 host-172-16-0-41 Ready worker 12m v1.17.1 host-172-16-0-42 Ready worker 12m v1.17.1 It happened to me too with OCP-4.3.5. The cluster has 2 workers instead of 3. [cnv-qe-jenkins@cnv-executor-ginger2 ~]$ oc get nodes NAME STATUS ROLES AGE VERSION host-172-16-0-13 Ready master 26h v1.16.2 host-172-16-0-20 Ready master 26h v1.16.2 host-172-16-0-25 Ready master 26h v1.16.2 host-172-16-0-40 Ready worker 26h v1.16.2 host-172-16-0-53 Ready worker 26h v1.16.2 Do you have pending CSR approvals? `oc get csr` Please provide `oc adm must-gather` and/or details of your CSR approval process. (In reply to Scott Dodson from comment #3) > Do you have pending CSR approvals? `oc get csr` > Please provide `oc adm must-gather` and/or details of your CSR approval > process. You are right, there were pending CSRs, but this wasn't happening before. We did approve CSRs right after installation was completed. So now as a workaround, we run loop on background while openshift-installer is running: ``` worker_num=3 while true ; do sleep 120 for crs in $(oc get csr -ojson | jq -r '.items[] | select(.status == {} ) | .metadata.name') ; do echo "Approve cert for node ${crs}" oc adm certificate approve ${crs} || true done worker_joint=$(oc get node -l node-role.kubernetes.io/worker --no-headers | wc -l) if [ ${worker_num} -eq ${worker_joint} ] ; then echo "All workers have joint cluster" break fi done ``` *** This bug has been marked as a duplicate of bug 1818961 *** |