Bug 1920483
| Summary: | Bootstrap completed failed: Network is not ready with error "No CNI configuration file in /etc/kubernetes/cni/net.d/" when running a cluster behind proxy. | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | jima | ||||
| Component: | Machine Config Operator | Assignee: | Antonio Murdaca <amurdaca> | ||||
| Status: | CLOSED DUPLICATE | QA Contact: | Michael Nguyen <mnguyen> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 4.7 | CC: | behoward, jialiu, mstaeble, tsze, wking | ||||
| Target Milestone: | --- | Keywords: | Regression, TestBlocker | ||||
| Target Release: | 4.7.0 | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2021-01-27 22:55:32 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
jima
2021-01-26 11:49:26 UTC
Created attachment 1750876 [details]
Attached bootstrap bundle log
After a bit more dig on this failed cluster, seem like this issue is introduced by https://github.com/openshift/machine-config-operator/pull/2342 [root@control-plane-2 ~]# grep -r "etc/mco/proxy.env" /etc/systemd/ /etc/systemd/system/machine-config-daemon-firstboot.service:EnvironmentFile=/etc/mco/proxy.env /etc/systemd/system/machine-config-daemon-pull.service:EnvironmentFile=/etc/mco/proxy.env /etc/systemd/system/nodeip-configuration.service:EnvironmentFile=/etc/mco/proxy.env /etc/systemd/system/pivot.service.d/10-mco-default-env.conf:EnvironmentFile=/etc/mco/proxy.env [root@control-plane-2 ~]# cd /etc/systemd/system/ [root@control-plane-2 system]# pwd /etc/systemd/system [root@control-plane-2 system]# ls kubelet.service kubelet.service kubelet.service.d/ kubelet.service.requires/ [root@control-plane-2 system]# ls kubelet.service* kubelet.service kubelet.service.d: 10-mco-default-env.conf 20-logging.conf kubelet.service.requires: machine-config-daemon-firstboot.service [root@control-plane-2 system]# cat kubelet.service.d/10-mco-default-env.conf [Service] Environment="GODEBUG=x509ignoreCN=0,madvdontneed=1" proxy configuration file is not dropped into kubelet service folder. [root@control-plane-2 manifests]# systemctl status machine-config-daemon-firstboot.service ● machine-config-daemon-firstboot.service - Machine Config Daemon Firstboot Loaded: loaded (/etc/systemd/system/machine-config-daemon-firstboot.service; enabled; vendor preset: enabled) Active: inactive (dead) Condition: start condition failed at Tue 2021-01-26 08:31:31 UTC; 5h 30min ago └─ ConditionPathExists=/etc/ignition-machine-config-encapsulated.json was not met [root@control-plane-2 manifests]# journalctl -f -u machine-config-daemon-firstboot.service -- Logs begin at Tue 2021-01-26 08:28:53 UTC. -- Jan 26 08:31:02 control-plane-2 machine-config-daemon[2138]: I0126 08:31:02.747194 2138 rpm-ostree.go:184] Current origin is not custom Jan 26 08:31:04 control-plane-2 machine-config-daemon[2138]: I0126 08:31:04.576186 2138 rpm-ostree.go:211] Pivoting to: 47.83.202101251242-0 (2413c3249f9661b967a2cd9eef5822fe20b87b4b41bc4901721da9f5b6760391) Jan 26 08:31:04 control-plane-2 machine-config-daemon[2138]: I0126 08:31:04.576501 2138 rpm-ostree.go:243] Executing rebase from repo path /run/mco-machine-os-content/os-content-492386407/srv/repo with customImageURL pivot://quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:e52aead8f8025eb8fe12a385a826a822fa94f9dc89e8d55abcc2bbf718f4b11f and checksum 2413c3249f9661b967a2cd9eef5822fe20b87b4b41bc4901721da9f5b6760391 Jan 26 08:31:04 control-plane-2 machine-config-daemon[2138]: I0126 08:31:04.576553 2138 rpm-ostree.go:261] Running captured: rpm-ostree rebase --experimental /run/mco-machine-os-content/os-content-492386407/srv/repo:2413c3249f9661b967a2cd9eef5822fe20b87b4b41bc4901721da9f5b6760391 --custom-origin-url pivot://quay.io/openshift-release-dev/ocp-v4.0-art-dev@sha256:e52aead8f8025eb8fe12a385a826a822fa94f9dc89e8d55abcc2bbf718f4b11f --custom-origin-description Managed by machine-config-operator Jan 26 08:31:11 control-plane-2 machine-config-daemon[2138]: I0126 08:31:11.771676 2138 update.go:1858] Rebooting node Jan 26 08:31:11 control-plane-2 machine-config-daemon[2138]: I0126 08:31:11.776457 2138 update.go:1858] initiating reboot: Completing firstboot provisioning to rendered-master-e54b9be49bc54ed97eb6ff27e32f043d Jan 26 08:31:11 control-plane-2 systemd[1]: machine-config-daemon-firstboot.service: Main process exited, code=killed, status=15/TERM Jan 26 08:31:11 control-plane-2 systemd[1]: machine-config-daemon-firstboot.service: Failed with result 'signal'. Jan 26 08:31:11 control-plane-2 systemd[1]: Stopped Machine Config Daemon Firstboot. Jan 26 08:31:11 control-plane-2 systemd[1]: machine-config-daemon-firstboot.service: Consumed 16.058s CPU time Assuming the analysis in comment 3 is correct, I am moving this bug to MCO. *** This bug has been marked as a duplicate of bug 1920027 *** |