Bug 2244623
Summary: | [CDI] Metrics are missing default value | ||
---|---|---|---|
Product: | Container Native Virtualization (CNV) | Reporter: | Aviv Litman <alitman> |
Component: | Metrics | Assignee: | Aviv Litman <alitman> |
Status: | CLOSED MIGRATED | QA Contact: | Ahmad <ahafe> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.14.0 | CC: | dbasunag, kmajcher, sradco, stirabos |
Target Milestone: | --- | ||
Target Release: | 4.14.2 | ||
Hardware: | All | ||
OS: | All | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause:
no deafult value for some metrics
Consequence:
some metrics is not available on Prometheus UI if they have no value.
Fix:
Add default value to all metrics.
Result:
All metrics are available in Prometheus UI.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2023-12-05 13:41:36 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Aviv Litman
2023-10-17 11:42:54 UTC
Shirly, Debarati believes that this might be by design, can you please take a look and advise? From what I understand this is indeed a bug and we should report zero as the default value, which is generally a good practice. Reporting zero as a default accurately reflects the state of the system when it has not experienced any restarts. This is a clear and unambiguous way to indicate that there have been no restarts up to that point. Prometheus works best with continuous time series data. Having a consistent metric (like a restart count starting at zero and incrementing) makes it easier to write queries and create meaningful visualizations. It helps in understanding trends over time and detecting anomalies. When we have a consistent baseline (zero in this case), it becomes easier to set up alerts. For example, we might want to be alerted when the restart count exceeds a certain threshold. If the metric is always present, it's simpler to define these alerts. If we don't report anything until a restart occurs, we might not be able to easily differentiate between a lack of data (due to issues like collection problems or system down) and a situation where there simply haven't been any restarts. Reporting zero eliminates this ambiguity. |