Bug 2042315 - Baremetal IPI deployment with IPv6 control plane and disabled provisioning network fails as the nodes do not pass introspection
Summary: Baremetal IPI deployment with IPv6 control plane and disabled provisioning ne...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer
Version: 4.10
Hardware: Unspecified
OS: Unspecified
urgent
urgent
Target Milestone: ---
: 4.10.0
Assignee: Derek Higgins
QA Contact: Lubov
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-01-19 08:53 UTC by Marius Cornea
Modified: 2022-03-10 16:41 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-03-10 16:40:51 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
install-config.yaml (1.86 KB, text/plain)
2022-01-19 08:53 UTC, Marius Cornea
no flags Details
introspection_console.png (51.69 KB, image/png)
2022-01-19 08:54 UTC, Marius Cornea
no flags Details
grub kernel cmdline (49.44 KB, image/png)
2022-01-19 08:59 UTC, Marius Cornea
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github openshift installer pull 5548 0 None open Bug 2042315: Fallback to External IP_OPTIONS when ProvisioningNetwork is Disabled 2022-01-19 11:58:49 UTC
Red Hat Product Errata RHSA-2022:0056 0 None None None 2022-03-10 16:41:06 UTC

Description Marius Cornea 2022-01-19 08:53:37 UTC
Created attachment 1851812 [details]
install-config.yaml

Description of problem:

Baremetal IPI deployment with IPv6 control plane and disabled provisioning network fails as the nodes do not pass introspection.

Version-Release number of selected component (if applicable):
quay.io/openshift-release-dev/ocp-release:4.10.0-fc.1-x86_64

How reproducible:
100%

Steps to Reproduce:
1. Run baremetal IPI process with the attached install-config.yaml

Actual results:

time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error

Expected results:

Deployment succeeds.

Additional info:

Nodes are unreachable via their reserved DHCP addresses and the console reports localhost in the prompt which indicates that the nodes didn't get and address via DHCP, attaching screenshot..

Comment 1 Marius Cornea 2022-01-19 08:54:36 UTC
Created attachment 1851813 [details]
introspection_console.png

Comment 2 Marius Cornea 2022-01-19 08:58:34 UTC
Looks like the kernel commandline in grub has ip=dhcp, attached screenshot. When changing it to ip=dhcp6 the node obtains DHCP address correctly.

Comment 3 Marius Cornea 2022-01-19 08:59:29 UTC
Created attachment 1851817 [details]
grub kernel cmdline

Comment 4 Marius Cornea 2022-01-19 10:05:24 UTC
The issue reproduced on 4.10.0-0.nightly-2022-01-18-044014 as well.

Comment 6 Dmitry Tantsur 2022-01-19 10:49:42 UTC
I assume it's a blocker since an important scenario is broken. The fix is on the way.

Comment 9 Lubov 2022-01-23 10:12:08 UTC
verified on 4.10.0-0.nightly-2022-01-22-102609

Comment 12 errata-xmlrpc 2022-03-10 16:40:51 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056


Note You need to log in before you can comment on or make changes to this bug.