PXE boot is failing when using the iDrac driver in OSP-13z7. Chris has documented the sequence that introduced the failure: 1. We submitted this fix upstream to ironic/master, and it was merged in December of last year: https://review.opendev.org/#/c/617951/ a. The fix made it into stein and is presently in master 2. We backported it to queens and rocky: https://review.opendev.org/#/c/624388/ https://review.opendev.org/#/c/624272/ 3. Red Hat cut z6 which included these patches 4. We removed the monkey patch from JetStream that patched in the fix since the fix was present in z6 5. The ironic community insisted that we remove the fix from queens and rocky because it relied on a newer version of python-dracclient that was beyond what was in driver-requirements.txt. The fix worked fine for RHOSP, but that was because RHOSP was shipping the latest python-dracclient. The community’s point was that the fix would not work for other OpenStack distributions if they elected to not ship the latest python-dracclient, but instead ship the version specified in driver-requirements.txt 6. We backed out the change from queens and rocky even though it was working fine with RHOSP: https://review.opendev.org/#/c/650134/ https://review.opendev.org/#/c/650116/ 7. Digambar started on a new version of the backport that did not depend on a newer version of python-dracclient. At the time, we thought the backport would be quick, but it has taken quite a while to reach its current completed form: https://review.opendev.org/#/c/648360/ This patch has really been through the mill, so hopefully we’ll get some +2’s on it soon Note that this patch is for stable/rocky. The plan was (and is) that once this is approved, backport it to queens, which should be identical to the rocky backport 8. RH cut z7 after the fix was backed out of stable/queens and before the new backport landed 9. But the monkey patch was no longer present in JetStream 10. BOOM! To fix the issue that we were seeing, I put the monkey patch back in JetStream for the time being, and all is well.
The fix in https://review.opendev.org/#/c/648360/ will need to be backported to queens.
The patch from Dell is upstream - https://review.opendev.org/#/c/648360/. We are waiting for it to merge and will backport it to osp-13.
The backport to stable/queens / OSP13 has merged: https://review.opendev.org/#/c/677660/
If this bug requires doc text for errata release, please set the 'Doc Type' and provide draft text according to the template in the 'Doc Text' field. The documentation team will review, edit, and approve the text. If this bug does not require doc text, please set the 'requires_doc_text' flag to -.
Because of lack of hardware was not able to test the fix but have verified that the package is available and verified that it contains the patch. $ sudo rpm -qa | grep ironic python-ironic-inspector-client-3.1.2-1.el7ost.noarch openstack-ironic-common-10.1.9-1.el7ost.noarch puppet-ironic-12.4.0-5.el7ost.noarch openstack-ironic-api-10.1.9-1.el7ost.noarch openstack-ironic-inspector-7.2.4-1.el7ost.noarch python-ironic-lib-2.12.3-1.el7ost.noarch openstack-ironic-conductor-10.1.9-1.el7ost.noarch python2-ironic-neutron-agent-1.0.1-1.el7ost.noarch python2-ironicclient-2.2.2-1.el7ost.noarch openstack-ironic-staging-drivers-0.9.1-2.el7ost.noarch
Sorry for the lateness of this reply, but I have verified that the code is present in the 10/31 puddle. I'm working on verifying it in the lab, but apparently I had a switch die, so I won't be able to test it until tomorrow at the earliest.
FWIW, I have verified that the fix is working as intended on Dell hardware.
According to our records, this should be resolved by openstack-ironic-10.1.9-1.el7ost. This build is available now.
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days