Bug 1883887
| Summary: | bootstrap took more than 30 minutes. | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Fred Rolland <frolland> | ||||||
| Component: | assisted-installer | Assignee: | Rom Freiman <rfreiman> | ||||||
| assisted-installer sub component: | discovery-agent | QA Contact: | Yuri Obshansky <yobshans> | ||||||
| Status: | CLOSED CURRENTRELEASE | Docs Contact: | |||||||
| Severity: | unspecified | ||||||||
| Priority: | unspecified | CC: | aos-bugs, ercohen, sasha, vlaad | ||||||
| Version: | 4.6 | ||||||||
| Target Milestone: | --- | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | Unspecified | ||||||||
| OS: | Unspecified | ||||||||
| Whiteboard: | assisted-installer-prod | ||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | Environment: | ||||||||
| Last Closed: | 2022-08-25 21:42:33 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Attachments: |
|
||||||||
Created attachment 1717873 [details]
inventory logs
master1: HostID:5717d511-dc92-4966-027d-4170840e882c Sep 29 17:58:26 localhost installer[4081]: time="2020-09-29T17:58:26Z" level=info msg="Updating node installation stage: Rebooting - " Master:2 HostID:123870f4-f6e3-e440-c8c3-b1d619a1da62 Sep 29 17:54:54 localhost installer[4534]: time="2020-09-29T17:54:54Z" level=info msg="Updating node installation stage: Rebooting - " Bootstrap: bootkube seems to hang for 19 minutes: Sep 29 17:57:33 localhost bootkube.sh[3296]: Created "99_openshift-cluster-api_hosts-5.yaml" baremetalhosts.v1alpha1.metal3.io/openshift-worker-2 -n openshift-machine-api Sep 29 17:57:33 localhost bootkube.sh[3296]: Updated status for "99_openshift-cluster-api_hosts-5.yaml" baremetalhosts.v1alpha1.metal3.io/openshift-worker-2 -n openshift-machine-api Sep 29 18:16:04 localhost bootkube.sh[3296]: Pod Status:openshift-kube-apiserver/kube-apiserver Pending Sep 29 18:16:04 localhost bootkube.sh[3296]: Pod Status:openshift-kube-scheduler/openshift-kube-scheduler DoesNotExist Sep 29 18:16:04 localhost bootkube.sh[3296]: Pod Status:openshift-kube-controller-manager/kube-controller-manager DoesNotExist Sep 29 18:16:04 localhost bootkube.sh[3296]: Pod Status:openshift-cluster-version/cluster-version-operator Pending Sep 29 18:16:14 localhost bootkube.sh[3296]: Error: error while checking pod status: timed out waiting for the condition Sep 29 18:16:14 localhost bootkube.sh[3296]: Tearing down temporary bootstrap control plane... Sep 29 18:16:14 localhost bootkube.sh[3296]: Error: error while checking pod status: timed out waiting for the condition Sep 29 18:16:14 localhost systemd[1]: bootkube.service: Main process exited, code=exited, status=1/FAILURE Sep 29 18:16:14 localhost systemd[1]: bootkube.service: Failed with result 'exit-code'. Bootkube was restarted and to this error at some point Sep 29 18:17:24 localhost bootkube.sh[18209]: E0929 18:17:24.599843 1 streamwatcher.go:109] Unable to decode an event from the watch stream: http2: server sent GOAWAY and closed the connection; LastStreamID=3, ErrCode=NO_ERROR, debug="" just a test *** Bug 1881661 has been marked as a duplicate of this bug. *** Verified on staging:
{
"release_tag": "v1.0.10.1",
"versions": {
"assisted-ignition-generator": "",
"assisted-installer": "registry-proxy.engineering.redhat.com/rh-osbs/openshift4-assisted-installer-rhel8:v4.6.0-28",
"assisted-installer-controller": "registry-proxy.engineering.redhat.com/rh-osbs/openshift4-assisted-installer-reporter-rhel8:v4.6.0-25",
"assisted-installer-service": "quay.io/app-sre/assisted-service:fd4fdf8",
"discovery-agent": "quay.io/ocpmetal/assisted-installer-agent:latest",
"image-builder": "quay.io/app-sre/assisted-iso-create:fd4fdf8"
}
}
|
Created attachment 1717872 [details] logs The is a gap in the bootkube log: Sep 29 17:57:33 localhost bootkube.sh[3296]: Updated status for "99_openshift-cluster-api_hosts-5.yaml" baremetalhosts.v1alpha1.metal3.io/openshift-worker-2 -n openshift-machine-api Sep 29 18:16:04 localhost bootkube.sh[3296]: Pod Status:openshift-kube-apiserver/kube-apiserver Pending