Bug 2011422 - After upgrading from 16.1 to 16.2 all baremetal nodes listed in ironic are stuck in maintenance mode in Director & Overcloud
Summary: After upgrading from 16.1 to 16.2 all baremetal nodes listed in ironic are st...
Keywords:
Status: CLOSED DUPLICATE of bug 2007268
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-ironic
Version: 16.2 (Train)
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: OSP Team
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-10-06 15:01 UTC by Darin Sorrentino
Modified: 2022-08-08 13:13 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-10-12 20:02:59 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-10230 0 None None None 2022-08-08 13:13:26 UTC

Description Darin Sorrentino 2021-10-06 15:01:05 UTC
Description of problem:

Ironic is setting maintenance mode enabled on all registered nodes after upgrading from 16.1 to 16.2.

Environment summary:

Environment was a 16.1 DCN deployment with Ironic in the Overcloud which I performed an upgrade to 16.2 on.  While upgrading the overcloud, it was noticed all of the baremetal nodes registered to director were in maintenance mode. At the conclusion of the upgrade, I attempted to deploy a baremetal instance in the overcloud and saw I was getting NoValidHost Found.

Ironic in the overcloud had it's baremetal nodes set to maintenance true.  I unset them and they were immediately changed back to maintenance mode true.

I checked Ironic in the undercloud and noted the same behaviour.

ironic-conductor.log shows:

2021-10-06 10:39:15.653 7 ERROR ironic.conductor.manager [req-9d2a85ef-2fca-497b-8ef2-7aa28f10386d - - - - -] During sync_power_state, max retries exceeded for node cad0d948-b462-4d11-af75-65f15ee6ef23, node state None does not m│·······
atch expected state 'None'. Updating DB state to 'None' Switching node to maintenance mode. Error: An exclusive lock is required, but the current context has a shared lock.: ironic.common.exception.ExclusiveLockRequired: An exclu│·······
sive lock is required, but the current context has a shared lock. 



I confirmed I can use ipmitool from within the ironic conductor container on Director using the credentials registered with ironic and successfully obtain a power status on the above referenced node.

Version-Release number of selected component (if applicable):
16.2

How reproducible:


Steps to Reproduce:
1. Upgrade environment from 16.1 to 16.2
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 Darin Sorrentino 2021-10-06 17:26:30 UTC
Issue looks like the same as https://bugzilla.redhat.com/show_bug.cgi?id=2007268

Testing workaround.

Comment 2 Steve Baker 2021-10-12 20:02:59 UTC

*** This bug has been marked as a duplicate of bug 2007268 ***


Note You need to log in before you can comment on or make changes to this bug.