Bug 2003792

Summary: Monitoring metrics query graph flyover panel is useless
Product: OpenShift Container Platform Reporter: Ben England <bengland>
Component: MonitoringAssignee: Andrew Pickering <anpicker>
Status: CLOSED ERRATA QA Contact: Junqi Zhao <juzhao>
Severity: low Docs Contact:
Priority: medium    
Version: 4.8CC: amuller, anpicker, aos-bugs, erooth, hongyli, pgough
Target Milestone: ---   
Target Release: 4.10.0   
Hardware: Unspecified   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-03-10 16:10:01 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
screenshot of monitoring metrics GUI with flyover panel none

Description Ben England 2021-09-13 17:49:40 UTC
Description of problem:

(originally reported to me by Matt Currier) 
Flyover panel for Console -> Monitoring -> Metrics GUI interface is useless.   Consequently you cannot evaluate the metric's absolute value at a point in time other than the present (shown in table below the graph).


Version-Release number of selected component (if applicable):

OCP 4.8


How reproducible:

Every time


Steps to Reproduce:
1. Install OCP 4.8
2. fire up a workload utilizing I/O to storage devices
3. try to monitor storage block device throughput 

I don't believe this bug is specific to storage devices, this is just the example I'm using, details below


Actual results:

Flyover panel is useless because you cannot determine which device name and host (instance) corresponded to a particular number.  See the example screenshot below.


Expected results:

I should be able to move my mouse to a point over the x-axis and see all the values in descending order of value, along with the parameters that led to these values.   In this example, the parameters that should be displayed are the ones that are different for each curve, specifically device name and host (instance).


Additional info:

I added a link to a doc.

I'll attach a screen shot that illustrates the problem.  In this case, the metric queries that were being displayed were:

irate(node_disk_writes_completed_total{device=~"nvme[0-9]n1"}[1m])

and

irate(node_disk_reads_completed_total{device=~"nvme[0-9]n1"}[1m])))))

Comment 1 Ben England 2021-09-13 17:51:35 UTC
Created attachment 1822831 [details]
screenshot of monitoring metrics GUI with flyover panel

this screenshot just illustrates what I was talking about in the bz.

Comment 8 Junqi Zhao 2021-10-28 08:16:30 UTC
checked with 4.10.0-0.nightly-2021-10-27-230233, searched with "irate(node_disk_writes_completed_total{device=~"nvme[0-9]n1"}[1m])", the labels are ordered intelligently so that those with different values are visible, from the picture, it shows the pod in the first place

Comment 12 errata-xmlrpc 2022-03-10 16:10:01 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056