Bug 2042315

Summary: Baremetal IPI deployment with IPv6 control plane and disabled provisioning network fails as the nodes do not pass introspection
Product: OpenShift Container Platform Reporter: Marius Cornea <mcornea>
Component: InstallerAssignee: Derek Higgins <derekh>
Installer sub component: OpenShift on Bare Metal IPI QA Contact: Lubov <lshilin>
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: urgent CC: lshilin, smiron
Version: 4.10Keywords: Triaged
Target Milestone: ---   
Target Release: 4.10.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-03-10 16:40:51 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
install-config.yaml
none
introspection_console.png
none
grub kernel cmdline none

Description Marius Cornea 2022-01-19 08:53:37 UTC
Created attachment 1851812 [details]
install-config.yaml

Description of problem:

Baremetal IPI deployment with IPv6 control plane and disabled provisioning network fails as the nodes do not pass introspection.

Version-Release number of selected component (if applicable):
quay.io/openshift-release-dev/ocp-release:4.10.0-fc.1-x86_64

How reproducible:
100%

Steps to Reproduce:
1. Run baremetal IPI process with the attached install-config.yaml

Actual results:

time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed' , last error was 'timeout reached while inspecting the node'"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error msg="  on ../../tmp/openshift-install-masters-1588391933/main.tf line 13, in resource \"ironic_node_v1\" \"openshift-master-host\":"
time="2022-01-18T17:28:53-05:00" level=error msg="  13: resource \"ironic_node_v1\" \"openshift-master-host\" {"
time="2022-01-18T17:28:53-05:00" level=error
time="2022-01-18T17:28:53-05:00" level=error

Expected results:

Deployment succeeds.

Additional info:

Nodes are unreachable via their reserved DHCP addresses and the console reports localhost in the prompt which indicates that the nodes didn't get and address via DHCP, attaching screenshot..

Comment 1 Marius Cornea 2022-01-19 08:54:36 UTC
Created attachment 1851813 [details]
introspection_console.png

Comment 2 Marius Cornea 2022-01-19 08:58:34 UTC
Looks like the kernel commandline in grub has ip=dhcp, attached screenshot. When changing it to ip=dhcp6 the node obtains DHCP address correctly.

Comment 3 Marius Cornea 2022-01-19 08:59:29 UTC
Created attachment 1851817 [details]
grub kernel cmdline

Comment 4 Marius Cornea 2022-01-19 10:05:24 UTC
The issue reproduced on 4.10.0-0.nightly-2022-01-18-044014 as well.

Comment 6 Dmitry Tantsur 2022-01-19 10:49:42 UTC
I assume it's a blocker since an important scenario is broken. The fix is on the way.

Comment 9 Lubov 2022-01-23 10:12:08 UTC
verified on 4.10.0-0.nightly-2022-01-22-102609

Comment 12 errata-xmlrpc 2022-03-10 16:40:51 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056