Currently Ceph Metrics and the Ceph Dashboard are separate installs and separate "panes of glass" to view compared to all other metrics gathered by Service Assurance into its Prometheus instances. This RFE is to integrate Ceph metrics and/or Ceph dashboard into SAF so that a customer can view metrics for an entire OpenStack instance through SAF.
I need to break this out into two separate issues. One is on the OSP side where the Ceph plugin for collectd is not installed on the Ceph Mons which are (by default) not scheduled on the storage nodes (they are scheduled on the controllers). This results in limited data for monitoring Ceph. Step 1 is to update the puppet in OSP (and perform the backports) so that the ceph plugin is available on the controllers (or wherever the mons are scheduled). Second to this is the documentation, testing, and verification of the ceph dashboard, which does exist in the dashboards repository already. Maybe need refinement for multi-cloud purposes, but we have a pattern we can follow there. Leif to break this out as 2 separate issues for tracking and delivery.
This task is kind of overloaded and will change to something different in the future than what was originally scoped. I'm just going to close this for now, as it will need a rescoping effort as part of STF 2.0 and beyond.