Bug 2059567

Summary: vMedia based IPI installation of OpenShift fails on Nokia servers due to issues with virtual media attachment and boot source override
Product: OpenShift Container Platform Reporter: Jacob Anders <janders>
Component: Bare Metal Hardware ProvisioningAssignee: Jacob Anders <janders>
Bare Metal Hardware Provisioning sub component: ironic QA Contact: Amit Ugol <augol>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: urgent CC: elajoie, imelofer, lshilin, phuet, racedoro, rpittau, tsedovic, yliu1
Version: 4.10Keywords: Triaged
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: x86_64   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Release Note text: Previously, Ironic was unable to use virtual media to provision Nokia OE 20 servers because the BMC required TransferProtocolType attribute to be explicitly set in the request despite this being an optional attribute. In addition to this, the BMC also required the use of a dedicated RedFish Settings Resource to override boot order while most BMCs just use the System resource for this purpose. Cause: Nokia OE 20 strictly requires an optional "TransferProtocolType" attribute for vMedia attachment and also requires the use of RedFish Settings Resource (in this case "Systems/Self/SD") for overriding boot sequence. Consequence: Virtual media based provisioning fails on Nokia OE20. Fix: 1) When vMedia attachment request fails with an error indicating that TransferProtocolType attribute is missing, retry the request explicitly specifying this attribute. 2) Check for presence of a Redfish Settings Resource for the system and use it for boot sequence override if present. Result: Virtual media based provisioning will succeed on Nokia OE20 machines.
Story Points: ---
Clone Of:
: 2061600 (view as bug list) Environment:
Last Closed: 2022-08-10 10:51:27 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2061600    

Description Jacob Anders 2022-03-01 10:57:15 UTC
Description of problem:

Currently it is not possible to perform a successful IPI installation of OpenShift on Nokia hardware due to incompatibilities in Redfish implementation between Ironic and Nokia BMCs

Version-Release number of selected component (if applicable):

4.8, 4.9, 4.10

How reproducible:

Always


Steps to Reproduce:
1. Attempt IPI installation of OpenShift on a Nokia server using virtual media
2.
3.

Actual results:
Installation fails with virtual media attachment issues


Expected results:
Installation succeeds


Additional info:
Nokia BMC requires an additional TransferProtocolType attribute to be specified for vmedia attachment to work:

curl -X POST -ksu USER:PASS https://BMC_IP/redfish/v1/Managers/Self/VirtualMedia/CD1/Actions/VirtualMedia.InsertMedia -d '{"Image": "//HOST//image.iso", "TransferProtocolType": "HTTP"}' -H "Content-Type: application/json" 

With regards to boot sequence, a custom URL needs to be used (it can be fetched from Redfish Settings resource):

curl -X PATCH -ksu USER:PASS https://BMC_IP/redfish/v1/Systems/Self/SD -d '{"Boot": {"BootSourceOverrideTarget": "Cd", "BootSourceOverrideEnabled": "Once"}}' -H "Content-Type: application/json" -H If-Match:"\"1234567890\""

Comment 1 Jacob Anders 2022-03-01 11:09:49 UTC
The first fix (boot sequence) has merged: https://review.opendev.org/c/openstack/sushy/+/830553
The other one (vmedia attach) is under review: https://review.opendev.org/c/openstack/sushy/+/830322
There is also a minor follow-up to 830553 ( https://review.opendev.org/c/openstack/sushy/+/831355 ) that needs to merge before we can commence backporting
I will create an OpenShift PR and required backports as all relevant fixes merge.

Comment 2 Jacob Anders 2022-03-03 11:42:36 UTC
All upstream changes have merged into master. We do need a sushy release before we can open a PR against openshift/ironic-image next, so keeping the BZ in POST.

Comment 4 Jacob Anders 2022-03-08 00:17:59 UTC
Addomg OCP 4.11 ironic-image PR: https://github.com/openshift/ironic-image/pull/257 to get the sushy fixes into the product.

Comment 12 Eric Lajoie 2022-05-10 09:59:18 UTC
The patch was successfully tested with Open Edge 20 firmware 4.50 and no issues deploying OCP. Thanks for picking up my request for this based on Telenor's needs.

Comment 13 Lubov 2022-05-10 23:16:07 UTC
many thanks

Comment 15 errata-xmlrpc 2022-08-10 10:51:27 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069