Bug 2283976 - Abort nvmeof deployment when tried to deploy nvmeof on node running nvmeof service already
Summary: Abort nvmeof deployment when tried to deploy nvmeof on node running nvmeof se...
Keywords:
Status: VERIFIED
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Cephadm
Version: 7.1
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 8.1
Assignee: Adam King
QA Contact: harika chebrolu
Rivka Pollack
URL:
Whiteboard:
: 2304808 (view as bug list)
Depends On: 2317218
Blocks: 2351689 2267614 2298578 2298579
TreeView+ depends on / blocked
 
Reported: 2024-05-30 11:36 UTC by harika chebrolu
Modified: 2025-06-09 10:03 UTC (History)
8 users (show)

Fixed In Version: ceph-19.2.1-121.el9cp
Doc Type: Bug Fix
Doc Text:
.NVMe-oF deployment no longer stops when deploying on a node with an existing NVMe-oF service Previously, there was no built-in restriction in Cephadm to prevent the deployment of NVMe-oF services, such as gateway groups, multiple times on the same host. As a result, this would sometimes lead to deployment and high availability issues, as previously deployed services on the nodes were disrupted. With this fix, restrictions are in place, and deploying the same gateway nodes across multiple gateway groups is not supported. Gateway entities, such as subsystems and namespaces, remain intact, and the `nvme-statemap` is preserved, as expected.
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-9115 0 None None None 2024-05-30 11:39:11 UTC

Description harika chebrolu 2024-05-30 11:36:30 UTC
Description of problem:

NVMe-of service was getting created when we try to create service using different pools.

Version-Release number of selected component (if applicable):

version : cp.stg.icr.io/cp/ibm-ceph/nvmeof-rhel9:1.2.13-2

How reproducible:


Steps to Reproduce:
1. Create a pool and service on gw nodes.
2. Create different pool and apply service using it in client node.

Actual results:

2 services are getting created.

Expected results:
As we do not support 2 daemons running on the same node, but what if mistakenly happens at customer(which we do not want to) , we want the GW code to abort deployment with different pool name ( cuz we may apply again with same pool_name for various reasons like GW scaling) with a error message if a node already has a nvmeof service.


Additional info:

Comment 1 Aviv Caro 2024-05-30 12:32:44 UTC
The issue here is restricting deploying nvmeof service on more than 1 pool for 7.1. We need to see how to restrict this for 7.1.z. For 7.1 we will add this to the RN.


Note You need to log in before you can comment on or make changes to this bug.