Bug 2026144

Summary: Utilization values are misreported and isn't matching
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Aman Agrawal <amagrawa>
Component: ocs-operatorAssignee: arun kumar mohan <amohan>
Status: CLOSED CURRENTRELEASE QA Contact: Tiffany Nguyen <tunguyen>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.9CC: amohan, badhikar, bkunal, mbukatov, mmuench, muagarwa, musoni, nberry, nigoyal, ocs-bugs, odf-bz-bot, sostapov
Target Milestone: ---   
Target Release: ODF 4.12.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 4.12.0-100 Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of:
: 2027681 (view as bug list) Environment:
Last Closed: 2023-02-08 14:06:28 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2027681    

Comment 2 Nitin Goyal 2021-11-24 05:21:55 UTC
@badhikar Can you pls take a look?

Comment 3 Bipul Adhikari 2021-11-24 06:13:51 UTC
Utilization card reports the actual utilization of storage(non-replicated, actual storage should be 3 x value + staticUsage) . It's the non-replicated storage, whereas the main dashboard shows the exact raw capacity used of the system. 
One of the reasons we cannot show the logical capacity(non-replicated) in the main Overview(ODF) dashboard is that the total logical capacity is dependent on various factors such as number of pools and their replica counts which can change the total capacity of the system all the time. 
This is a technical limitation and I see no workarounds at this point.

Comment 7 Bipul Adhikari 2021-12-08 04:58:57 UTC
I have sent the findings of the bug on an email thread.

Comment 9 arun kumar mohan 2021-12-20 18:44:32 UTC
Send a PR: https://github.com/red-hat-storage/ocs-operator/pull/1433, in OCS-Operator.
This PR will reduce the collection interval of that particular rule group, so that the metrics collected will be more granular.

Comment 12 Nitin Goyal 2022-03-01 07:34:55 UTC
Moving it to ocs as it is fixed there.

Comment 16 Mudit Agarwal 2022-03-14 03:31:40 UTC
Not a 4.10 blocker, moving it out.

Arun, PTAL.

Comment 20 arun kumar mohan 2022-07-01 19:16:34 UTC
Pointing out some general observations,

ODF Overview page graphs (IOPS, Throughput and Latency) shows the sum of Read+Writes of I/O when compared with the separate Read and Write graphs in StorageSystem's -> Overview Block & File page. So both set of graphs won't have a one to one correlation.

For,
IOPS
      ODF Overview page and StorageSystem Overview page has different time intervals.

Throughput
      Underlying queries are different with different collection interval.

Latency
     Again two totally different queries (where one taking an average and the other taking sum).

For IOPS we can correct the time interval and make it same and for the other two need to check/investigate a bit further on the queries.

Comment 23 arun kumar mohan 2022-08-01 10:08:34 UTC
PR submitted to both ODF-Console and OCS-Operator repos

https://github.com/red-hat-storage/ocs-operator/pull/1761
https://github.com/red-hat-storage/odf-console/pull/352

Comment 24 Tiffany Nguyen 2022-11-21 21:32:23 UTC
Using version 4.12.0-114: IOPS, Throughput, Latency from Performance card and Used Capacity value from System Capacity are very closed to each other.