Bug 2324172

Summary: NVMe Alerts needs update with new features
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Sunil Kumar Nagaraju <sunnagar>
Component: NVMeOFAssignee: Vallari <vaagrawa>
Status: VERIFIED --- QA Contact: Rahul Lepakshi <rlepaksh>
Severity: high Docs Contact: ceph-doc-bot <ceph-doc-bugzilla>
Priority: high    
Version: 8.0CC: acaro, afrahman, aindenba, bdavidov, bkunal, ceph-eng-bugs, cephqe-warriors, gbregman, kjosy, kramaswa, lchernin, linuxkidd, pdhange, rlepaksh, rpollack, tserlin, vumrao
Target Milestone: ---Keywords: External
Target Release: 8.0z2   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-19.2.0-63.el9cp Doc Type: Bug Fix
Doc Text:
.NVMe-oF gateway alerts are now emitted as expected Previously, NVMe-oF gateway alerts did not support the multiple gateway configurations or scale limits introduced in IBM Storage Ceph 8.0. As a result, alerts were emitted based off of older version configurations and unexpected alerts were sometimes be emitted. With this fix, the NVMe-oF gateway alerts support the current release configurations and all alerts are emitted as expected.
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2317218    

Description Sunil Kumar Nagaraju 2024-11-06 13:49:31 UTC
Description of problem:

In current build, all avialable NVMe alerts need to get updated with new features in 8.0 which are,

- multi gateway group
- Scale numbers 
  - 4 gateway groups per cluster
  - 8 GWS in a gateway group
  - 1024 namespaces in a gateway group.
  - 256 subsystems in a gateway group.
- Alerts need to be on all available services and its entities like gateways to subsystem namespaces.


Currently,
- All NVMeoF alerts are always bound to one gateway group say (group1).
- all rules still follows older release scale numbers.



Version-Release number of selected component (if applicable):
8.0 19.2.0-52
NVMe 1.3.3-6

Comment 5 Vallari 2025-01-16 11:15:24 UTC
Downstream PR ready: https://gitlab.cee.redhat.com/ceph/ceph/-/merge_requests/893