Bug 1563000 - The pxe_ilo driver ignores power requests under certain conditions with HP BL460
Summary: The pxe_ilo driver ignores power requests under certain conditions with HP BL460
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-proliantutils
Version: 10.0 (Newton)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: z9
: 10.0 (Newton)
Assignee: RHOS Maint
QA Contact: Shai Revivo
URL:
Whiteboard:
Depends On:
Blocks: 1573149 1573150 1573151
TreeView+ depends on / blocked
 
Reported: 2018-04-02 21:26 UTC by coldford@redhat.com
Modified: 2022-08-09 11:00 UTC (History)
14 users (show)

Fixed In Version: python-proliantutils-2.2.0-4.el7ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1573149 1573150 1573151 (view as bug list)
Environment:
Last Closed: 2018-09-17 16:59:16 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
ris.py (13.16 KB, application/x-gzip)
2018-04-04 05:23 UTC, PURANDHAR SAIRAM MANNIDI
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1761243 0 None None None 2018-04-04 16:57:56 UTC
OpenStack gerrit 559906 0 None None None 2018-05-08 06:18:28 UTC
Red Hat Issue Tracker OSP-8913 0 None None None 2022-08-09 11:00:31 UTC
Red Hat Product Errata RHBA-2018:2671 0 None None None 2018-09-17 17:00:47 UTC

Description coldford@redhat.com 2018-04-02 21:26:36 UTC
Description of problem:
Ironic fails to complete the deployment even though the disk creation is successful and shutdown the node

Version-Release number of selected component (if applicable):
RH OSP 10

How reproducible:
Always


Steps to Reproduce:
1. Import baremetal nodes with pxe_ilo driver with appropriate profile. configure deploy images, introspect the nodes.
2. Run openstack overcloud deploy
3. Controllers were able to finish and the provisioning state is active but the compute node is going into deploy_failed

Actual results:
Compute node deployment failing

Expected results:
compute node deployment failing  with error "iLO failed to change state to power on within 12 sec"

Additional info:
Increased the power_wait timeout to 75 seconds, still facing the same issue. As observed from iLO web-ui, the power state was off for much time until the error message. Manual power on via web-ui is a current pseudo-workaround.

Comment 1 coldford@redhat.com 2018-04-02 21:32:21 UTC
The customer is using the version(2.2.0-3) that was release in the following errata:

https://access.redhat.com/errata/RHBA-2018:0365

Issue returned after they upgraded the firmware.

A case has also been opened with HP.

Comment 15 PURANDHAR SAIRAM MANNIDI 2018-04-04 05:23:37 UTC
Created attachment 1417025 [details]
ris.py

Comment 17 PURANDHAR SAIRAM MANNIDI 2018-04-04 05:43:17 UTC
Created attachment 1417026 [details]
Ironic conductor logs

Comment 18 David Vallee Delisle 2018-04-04 16:57:32 UTC
After adding some debug logs, we see that ris.py is never entering the retry loop because the model returned is matching for Proliant BL but the new model is ProLiant BL. 

I've opened a LaunchPad with a fix: https://bugs.launchpad.net/proliantutils/+bug/1761243

Comment 19 Bob Fournier 2018-04-04 17:37:42 UTC
David - nice find!  I assume that the Product Name is a fixed string and can't be changed through the iLO UI?

Comment 20 Nisha 2018-04-04 18:45:38 UTC
Thanks David for the RCA.

Comment 21 Nisha 2018-05-02 17:16:52 UTC
The fix in proliantutils has been released for this issue as proliantutils v2.5.2. the fix URL is https://review.openstack.org/559906.

Comment 25 Bob Fournier 2018-06-13 13:19:23 UTC
Sai - this will be in the OSP-10z9 release and will be tested when the puddle is available.

Comment 32 Alex McLeod 2018-09-03 07:57:23 UTC
Hi there,

If this bug requires doc text for errata release, please set the 'Doc Type' and provide draft text according to the template in the 'Doc Text' field.

The documentation team will review, edit, and approve the text.

If this bug does not require doc text, please set the 'requires_doc_text' flag to -.

Thanks,
Alex

Comment 35 errata-xmlrpc 2018-09-17 16:59:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2671


Note You need to log in before you can comment on or make changes to this bug.