Bug 2034909 - SNO BMH node deployed via ZTP stuck in inspecting state
Summary: SNO BMH node deployed via ZTP stuck in inspecting state
Keywords:
Status: CLOSED DUPLICATE of bug 2035239
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Bare Metal Hardware Provisioning
Version: 4.10
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: ---
Assignee: Zane Bitter
QA Contact: Amit Ugol
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-12-22 13:21 UTC by Marius Cornea
Modified: 2022-01-04 19:39 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-01-04 19:39:52 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Marius Cornea 2021-12-22 13:21:37 UTC
Description of problem:

SNO BMH node deployed via ZTP stuck in inspecting state

Version-Release number of selected component (if applicable):
4.10.0-0.nightly-2021-12-21-130047

How reproducible:
100%

Steps to Reproduce:
1. Deploy an SNO cluster via ZTP process
2. Create bmh object

Actual results:

 oc -n kni-qe-1 get bmh sno.kni-qe-1.lab.eng.rdu2.redhat.com 
NAME                                   STATE        CONSUMER   ONLINE   ERROR   AGE
sno.kni-qe-1.lab.eng.rdu2.redhat.com   inspecting              true             167m


Expected results:
bmh node gets provisioned and allows installation to move forward

Additional info:

Attaching must-gather and CRs

Comment 2 Zane Bitter 2022-01-04 19:39:52 UTC
The issue is that in a configuration with no provisioning network, CBO was failing to deploy the image-customization-controller:

2021-12-21T18:07:59.206806724Z E1221 18:07:59.206746       1 controller.go:317] controller/provisioning "msg"="Reconciler error" "error"="unable to determine Ironic's IP to pass to the image-customization-controller: pods is forbidden: User \"system:serviceaccount:openshift-machine-api:cluster-baremetal-operator\" cannot list resource \"pods\" in API group \"\" in the namespace \"openshift-machine-api\"" "name"="provisioning-configuration" "namespace"="" "reconciler group"="metal3.io" "reconciler kind"="Provisioning" 

This has been fixed.

*** This bug has been marked as a duplicate of bug 2035239 ***


Note You need to log in before you can comment on or make changes to this bug.