Bug 1927244
Summary: | UPI installation with Kuryr timing out on bootstrap stage | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | rlobillo | ||||||
Component: | Networking | Assignee: | Maysa Macedo <mdemaced> | ||||||
Networking sub component: | kuryr | QA Contact: | GenadiC <gcheresh> | ||||||
Status: | CLOSED ERRATA | Docs Contact: | |||||||
Severity: | urgent | ||||||||
Priority: | high | CC: | mbridges, mdulko | ||||||
Version: | 4.7 | ||||||||
Target Milestone: | --- | ||||||||
Target Release: | 4.8.0 | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: |
Cause:
Kuryr changed the mechanism to detect the OpenStack Subnet used by the cluster's nodes. Kuryr relied on the Network of the cluster's nodes Subnet having a specific tag, but the tag was removed for IPI Installations causing the need to discover it from the OpenShift Machine objects, which the creation is removed on one of the UPI steps.
Consequence:
Installations with Kuryr SDN timing out on the Bootstrap stage.
Fix:
Continue adding the ID of the Neutron Subnet to Kuryr, instead of only relying on Machine objects.
Result:
Installation with Kuryr on UPI succeeds.
|
Story Points: | --- | ||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2021-07-27 22:43:43 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | |||||||||
Bug Blocks: | 1929168, 1931347 | ||||||||
Attachments: |
|
Description
rlobillo
2021-02-10 11:55:58 UTC
Created attachment 1756207 [details]
openshift-installer log bundle
Verified on OCP4.8.0-0.nightly-2021-02-21-102854 over OSP13 (2021-01-20.1) with Amphora provider. OCP installation with UPI succeeded: time="2021-02-21T13:54:03-05:00" level=debug msg="Cluster is initialized" time="2021-02-21T13:54:03-05:00" level=info msg="Waiting up to 10m0s for the openshift-console route to be created..." time="2021-02-21T13:54:03-05:00" level=debug msg="Route found in openshift-console namespace: console" time="2021-02-21T13:54:03-05:00" level=debug msg="OpenShift console route is admitted" time="2021-02-21T13:54:03-05:00" level=info msg="Install complete!" time="2021-02-21T13:54:03-05:00" level=info msg="To access the cluster as the system:admin user when using 'oc', run 'export KUBECONFIG=/home/cloud-user/ostest/auth/kubeconfig'" time="2021-02-21T13:54:03-05:00" level=info msg="Access the OpenShift web-console here: https://console-openshift-console.apps.ostest.shiftstack.com" time="2021-02-21T13:54:03-05:00" level=info msg="Login to the console with user: \"kubeadmin\", and password: \"fYnBX-8rtrM-KeteY-fhoDT\"" time="2021-02-21T13:54:03-05:00" level=debug msg="Time elapsed per stage:" time="2021-02-21T13:54:03-05:00" level=debug msg="Cluster Operators: 17m8s" time="2021-02-21T13:54:03-05:00" level=info msg="Time elapsed: 17m8s" Bootstrapping stage was performed succesfully: time="2021-02-21T13:13:22-05:00" level=info msg="API v1.20.0+01ab7fd up" time="2021-02-21T13:13:22-05:00" level=info msg="Waiting up to 30m0s for bootstrapping to complete..." time="2021-02-21T13:29:08-05:00" level=debug msg="Bootstrap status: complete" time="2021-02-21T13:29:08-05:00" level=info msg="It is now safe to remove the bootstrap resources" time="2021-02-21T13:29:08-05:00" level=debug msg="Time elapsed per stage:" time="2021-02-21T13:29:08-05:00" level=debug msg="Bootstrap Complete: 16m40s" time="2021-02-21T13:29:08-05:00" level=debug msg=" API: 54s" time="2021-02-21T13:29:08-05:00" level=info msg="Time elapsed: 16m40s" Tempest tests were executed succesfully: https://rhos-ci-staging-jenkins.lab.eng.tlv2.redhat.com/job/DFG-osasinfra-shiftstack_ci-ocp_verification-osp13-ocp4.7-upi/7//artifact/tempest-results/tempest-results-kuryr.1.html NP tests were executed succesfully: https://rhos-ci-staging-jenkins.lab.eng.tlv2.redhat.com/job/DFG-osasinfra-shiftstack_ci-ocp_verification-osp13-ocp4.7-upi/7//artifact/np_test_results/np_kubetest.html#a7c8b2ea-dafb-435a-ae63-ea6c5c596374 "NetworkPolicy_between_server_and_client_should_enforce_policy_based_on_PodSelector_and_NamespaceSelector_[Feature:NetworkPolicy-07]" needed to be re-executed and it passed: Verified on OCP4.8.0-0.nightly-2021-02-21-102854 over OSP13 (2021-01-20.1) with Amphora provider. OCP installation with UPI succeeded: time="2021-02-21T13:54:03-05:00" level=debug msg="Cluster is initialized" time="2021-02-21T13:54:03-05:00" level=info msg="Waiting up to 10m0s for the openshift-console route to be created..." time="2021-02-21T13:54:03-05:00" level=debug msg="Route found in openshift-console namespace: console" time="2021-02-21T13:54:03-05:00" level=debug msg="OpenShift console route is admitted" time="2021-02-21T13:54:03-05:00" level=info msg="Install complete!" time="2021-02-21T13:54:03-05:00" level=info msg="To access the cluster as the system:admin user when using 'oc', run 'export KUBECONFIG=/home/cloud-user/ostest/auth/kubeconfig'" time="2021-02-21T13:54:03-05:00" level=info msg="Access the OpenShift web-console here: https://console-openshift-console.apps.ostest.shiftstack.com" time="2021-02-21T13:54:03-05:00" level=info msg="Login to the console with user: \"kubeadmin\", and password: \"fYnBX-8rtrM-KeteY-fhoDT\"" time="2021-02-21T13:54:03-05:00" level=debug msg="Time elapsed per stage:" time="2021-02-21T13:54:03-05:00" level=debug msg="Cluster Operators: 17m8s" time="2021-02-21T13:54:03-05:00" level=info msg="Time elapsed: 17m8s" Bootstrapping stage was performed successfully: time="2021-02-21T13:13:22-05:00" level=info msg="API v1.20.0+01ab7fd up" time="2021-02-21T13:13:22-05:00" level=info msg="Waiting up to 30m0s for bootstrapping to complete..." time="2021-02-21T13:29:08-05:00" level=debug msg="Bootstrap status: complete" time="2021-02-21T13:29:08-05:00" level=info msg="It is now safe to remove the bootstrap resources" time="2021-02-21T13:29:08-05:00" level=debug msg="Time elapsed per stage:" time="2021-02-21T13:29:08-05:00" level=debug msg="Bootstrap Complete: 16m40s" time="2021-02-21T13:29:08-05:00" level=debug msg=" API: 54s" time="2021-02-21T13:29:08-05:00" level=info msg="Time elapsed: 16m40s" Tempest tests passed: https://rhos-ci-staging-jenkins.lab.eng.tlv2.redhat.com/job/DFG-osasinfra-shiftstack_ci-ocp_verification-osp13-ocp4.7-upi/7//artifact/tempest-results/tempest-results-kuryr.1.html NP tests passed: https://rhos-ci-staging-jenkins.lab.eng.tlv2.redhat.com/job/DFG-osasinfra-shiftstack_ci-ocp_verification-osp13-ocp4.7-upi/7//artifact/np_test_results/np_kubetest.html#a7c8b2ea-dafb-435a-ae63-ea6c5c596374 (*) (*) "NetworkPolicy_between_server_and_client_should_enforce_policy_based_on_PodSelector_and_NamespaceSelector_[Feature:NetworkPolicy-07]" failed on first attempt but passed on second one. Logs attached. Conformance tests passed: https://rhos-ci-staging-jenkins.lab.eng.tlv2.redhat.com/job/DFG-osasinfra-shiftstack_ci-ocp_verification-osp13-ocp4.7-upi/7//artifact/conformance-test-results/conformance_ocp-tests.html (**) (**) [sig-scheduling]_SchedulerPredicates_[Serial]_validates_resource_limits_of_pods_that_are_allowed_to_run and [sig-api-machinery]_AdmissionWebhook_[Privileged:ClusterAdmin]_should_mutate_pod_and_apply_defaults_after_mutation failed on first attempt but they passed on second execution. Logs attached. Installation and test logs attached. Created attachment 1758574 [details]
installation and test logs
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |