Bug 2244623
| Summary: | [CDI] Metrics are missing default value | ||
|---|---|---|---|
| Product: | Container Native Virtualization (CNV) | Reporter: | Aviv Litman <alitman> |
| Component: | Metrics | Assignee: | Aviv Litman <alitman> |
| Status: | CLOSED MIGRATED | QA Contact: | Ahmad <ahafe> |
| Severity: | medium | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 4.14.0 | CC: | dbasunag, kmajcher, sradco, stirabos |
| Target Milestone: | --- | ||
| Target Release: | 4.14.2 | ||
| Hardware: | All | ||
| OS: | All | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: |
Cause:
no deafult value for some metrics
Consequence:
some metrics is not available on Prometheus UI if they have no value.
Fix:
Add default value to all metrics.
Result:
All metrics are available in Prometheus UI.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2023-12-05 13:41:36 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Aviv Litman
2023-10-17 11:42:54 UTC
Shirly, Debarati believes that this might be by design, can you please take a look and advise? From what I understand this is indeed a bug and we should report zero as the default value, which is generally a good practice. Reporting zero as a default accurately reflects the state of the system when it has not experienced any restarts. This is a clear and unambiguous way to indicate that there have been no restarts up to that point. Prometheus works best with continuous time series data. Having a consistent metric (like a restart count starting at zero and incrementing) makes it easier to write queries and create meaningful visualizations. It helps in understanding trends over time and detecting anomalies. When we have a consistent baseline (zero in this case), it becomes easier to set up alerts. For example, we might want to be alerted when the restart count exceeds a certain threshold. If the metric is always present, it's simpler to define these alerts. If we don't report anything until a restart occurs, we might not be able to easily differentiate between a lack of data (due to issues like collection problems or system down) and a situation where there simply haven't been any restarts. Reporting zero eliminates this ambiguity. |