Description of problem: In 4.4 a new mechanism to configure telemetry was introduced, this silently dropped the comments that described what each metric represents. These descriptions should be available for transparency. Version-Release number of selected component (if applicable): How reproducible: Steps to Reproduce: 1. 2. 3. Actual results: Expected results: Additional info:
tested with 4.4.0-0.nightly-2020-02-03-163409, just some defects in the comments. the following 3 metrics are not the same with that in the comments # job:ceph_pools_iops:total is the total iops (reads+writes) value in bytes for all the pools in ceph cluster - '{__name__="job:ceph_pools_iops_bytes:total"}' # cluster:network_attachment_definition_enabled_instance_up informs (1 or 0) if the cluster has # at least max of one instance with k8s.v1.cni.cncf.io/networks annotation, labelled by networks (any or sriov). - '{__name__="cluster:network_attachment_definition_enabled_instance_up:max"}' # insightsclient_request_send tracks the number of metrics sends. - '{__name__="insightsclient_request_send_total"}'
This sounds like the problem with the metrics as we did not touch any of the recording rules, think we should verify to confirm this before these changes landed.
I double checked and while not totally correct as you identified, those are the comments those entries had previously, see: https://github.com/openshift/telemeter/blob/de3c19e9675f55b22e53104094926c584c5a8a60/docs/data-collection.md As this bug was about transferring the comments, I would call this verified, and we can open a bug against each component that is responsible for these metrics, to have them fix the comments.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0581
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days