Bug 2013591

Summary: [4.7] Openshift Installer| UEFI mode | BM hosts have BIOS halted
Product: OpenShift Container Platform Reporter: Yurii Prokulevych <yprokule>
Component: Bare Metal Hardware ProvisioningAssignee: Derek Higgins <derekh>
Bare Metal Hardware Provisioning sub component: ironic QA Contact: Amit Ugol <augol>
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: urgent CC: achernet, derekh, eglottma, lshilin, sdodson
Version: 4.7Keywords: Triaged
Target Milestone: ---   
Target Release: 4.7.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-10-27 08:22:59 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1972213    
Bug Blocks: 2025495    

Description Yurii Prokulevych 2021-10-13 10:14:29 UTC
This bug was initially created as a copy of Bug #1966129


Version:

$ ocp 4.7.33

Platform:
baremetal


Please specify:
IPI

What happened?
--------------
OCP installation failed 
3 baremetal masters stuck(screenshot attached)


What did you expect to happen?

BM nodes up and running. Probably something wrong with boot loader

How to reproduce it (as minimally and precisely as possible)?

This issue reproducible on all 4.7.33  on Dell Power Edge R740.

Comment 2 Derek Higgins 2021-10-13 10:38:51 UTC
Looks like 4.7.33 brought with it a newer RHCOS
https://github.com/openshift/installer/pull/5229

This updated to a newer version of shim
shim-x64-15-16.el8.x86_64 -> shim-x64-15.4-2.el8_1.x86_64

This shim package is broken, if the UEFI fallback boot patch is used 
(see https://bugzilla.redhat.com/show_bug.cgi?id=1970632 )

We have two options here
1. revert the RHCOS bump
2. Backport the ironic workaround into 4.7 (https://review.opendev.org/c/openstack/ironic-python-agent/+/795862)

Comment 7 Lubov 2021-10-17 07:35:53 UTC
@yprokule could you, please, verify on your setup? We've seen the problem is not happening on our Dell setup

Comment 9 Lubov 2021-10-20 13:54:32 UTC
verified on 4.7.0-0.nightly-2021-10-15-152957

Comment 11 errata-xmlrpc 2021-10-27 08:22:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.7.36 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3931