1396255 – [RFE] Specify specific node for decommissioning in Heat

Bug 1396255 - [RFE] Specify specific node for decommissioning in Heat

Summary: [RFE] Specify specific node for decommissioning in Heat

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	rhosp-director
Sub Component:
Version:	10.0 (Newton)
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Target Release:	---
Assignee:	Jeff Brown
QA Contact:	Yogev Rabl
Docs Contact:
URL:
Whiteboard:
Depends On:	1298768
Blocks:	1387431 1396252 1414467 1425155
TreeView+	depends on / blocked

Reported:	2016-11-17 19:25 UTC by jomurphy
Modified:	2021-01-25 12:39 UTC (History)
CC List:	16 users (show)
Fixed In Version:
Doc Type:	Enhancement
Doc Text:
Clone Of:	1298768
Environment:
Last Closed:	2021-01-15 12:44:31 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Comment 1 Jeff Brown 2016-11-17 19:47:23 UTC

Today is is not possible to decommission specific nodes with OSPd.  We need that capability to correctly scale down storage when required.  This capability is required beyond storage, but this bug is specifically addressing the Ceph DFG needs.

Comment 2 Giulio Fidente 2016-11-29 10:10:39 UTC

The end goal for the Ceph DFG would be to delete a storage node without disruptions.

There are two scenarios we'd need to cover:

a) the storage node went down and can't be recovered
b) the storage node is purposely deleted

It seems to me that for both these scenarios we could delete the node from the stack using a command like the one we document for the compute nodes [ref1], is this correct?

Before deleting the node from the stack though, the user needs to execute some manual steps to cleanup (scenario A) or quiesce (scenario B) the pre-exising storage node; similarily to what happens is documented for the compute nodes [ref2].

To fully automate the process we'll need to be able to:

1) trigger a command execution on DELETE before the resource is actually deleted
2) to deal specifically with the scenario A (where the node goes down without notice), we need to execute commands on a node different from the one which is targeted for deletion

ref1. http://tripleo.org/post_deployment/delete_nodes.html
ref2. http://tripleo.org/post_deployment/quiesce_compute.html#quiesce-compute

Comment 8 Giulio Fidente 2021-01-15 12:44:31 UTC

On scale down the node to remove can be passed to "overcloud node delete" command with:

$ openstack overcloud node delete $nova_node_id

Note You need to log in before you can comment on or make changes to this bug.