1982051 – resource provider record is not deleted by service delete if any allocation is left on the node after evacuation

Bug 1982051 - resource provider record is not deleted by service delete if any allocation is left on the node after evacuation

Summary: resource provider record is not deleted by service delete if any allocation i...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	openstack-nova
Sub Component:
Version:	16.1 (Train)
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	z4
Target Release:	17.1
Assignee:	Amit Uniyal
QA Contact:	OSP DFG:Compute
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-07-14 05:13 UTC by Takashi Kajinami
Modified:	2024-11-21 09:38 UTC (History)
CC List:	13 users (show)
Fixed In Version:	openstack-nova-23.2.3-17.1.20240614020750.2ace99d.el9ost
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2024-11-21 09:38:04 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Launchpad	1829479	None	None	None	2021-07-14 05:16:00 UTC
Red Hat Issue Tracker	OSP-31130	None	None	None	2024-01-10 20:42:56 UTC
Red Hat Issue Tracker	OSP-6174	None	None	None	2024-11-20 12:55:40 UTC
Red Hat Product Errata	RHBA-2024:9974	None	None	None	2024-11-21 09:38:07 UTC

Description Takashi Kajinami 2021-07-14 05:13:47 UTC

Description of problem:

When instances on a compute node are evacuated, allocations about these evacuated instances are still left on that source node.
These allocations are deleted when the nova-compute service on that node is again started.

However if the compute node is completely broken, it is not possible to start nova-compute service again.
In this situation a user is likely to remove the compute node.

Because allocation records are left on the node, "openstack compute service delete" removes only the service record and leave the resource provider record[1].
https://github.com/openstack/nova/blob/1aa571103f90228ddf3dc27386486196ad58ba0e/nova/scheduler/client/report.py#L2152-L2156

If a user tries to delete the remaining resource provider record, the delete request is rejected because allocations are still left.
The resource provider record left prevents adding back nova-compute service when a user tries to readd a new compute node with the same hostname.

Version-Release number of selected component (if applicable):

How reproducible:
Always

Steps to Reproduce:
1. Deploy overcloud with multiple compute nodes and create instances on one of them
2. Shutoff one of the compute nodes
3. Evacuate instances from the compute node
4. Delete nova-compute service of the compute node
5. Delete resource provider record of the compute node

Actual results:
the resource provider record is left after step 4, and can't be deleted in step 5.

Expected results:
the resource provider record is deleted by step 4 or can be deleted in step 5.

Additional info:

Comment 4 Artom Lifshitz 2024-01-31 16:45:05 UTC

Make sure to cherry-pick https://review.opendev.org/c/openstack/nova/+/844753 to 17.1 once that lands on upstream wallaby.

Comment 16 errata-xmlrpc 2024-11-21 09:38:04 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (RHOSP 17.1.4 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:9974

Note You need to log in before you can comment on or make changes to this bug.