Bug 1956906

Summary: [RFE][docs] Add dashboard using ceph metrics from collectd
Product: Service Telemetry Framework Reporter: Joanne O'Flynn <joflynn>
Component: DocumentationAssignee: Leif Madsen <lmadsen>
Status: CLOSED DEFERRED QA Contact: Leonid Natapov <lnatapov>
Severity: medium Docs Contact: Joanne O'Flynn <joflynn>
Priority: high    
Version: 1.1CC: lmadsen
Target Milestone: z4Keywords: FutureFeature, Triaged
Target Release: 1.4 (STF)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: DFG:Docs DFG:CloudOps Squad:MetMon
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-18 18:32:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1702811    
Bug Blocks:    

Description Joanne O'Flynn 2021-05-04 15:47:27 UTC
Currently Ceph Metrics and the Ceph Dashboard are separate installs and separate "panes of glass" to view compared to all other metrics gathered by Service Assurance into its Prometheus instances.  

This RFE is to integrate Ceph metrics and/or Ceph dashboard into SAF so that a customer can view metrics for an entire OpenStack instance through SAF.

Comment 2 Leif Madsen 2021-12-15 18:46:37 UTC
I need to break this out into two separate issues. One is on the OSP side where the Ceph plugin for collectd is not installed on the Ceph Mons which are (by default) not scheduled on the storage nodes (they are scheduled on the controllers). This results in limited data for monitoring Ceph. Step 1 is to update the puppet in OSP (and perform the backports) so that the ceph plugin is available on the controllers (or wherever the mons are scheduled).

Second to this is the documentation, testing, and verification of the ceph dashboard, which does exist in the dashboards repository already. Maybe need refinement for multi-cloud purposes, but we have a pattern we can follow there.

Leif to break this out as 2 separate issues for tracking and delivery.

Comment 3 Leif Madsen 2022-08-18 18:32:25 UTC
This task is kind of overloaded and will change to something different in the future than what was originally scoped. I'm just going to close this for now, as it will need a rescoping effort as part of STF 2.0 and beyond.