Bug 1949123
Summary: | Node_filesystem_usage are not being collect and it's not possible to modify the Operator Object | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Odilon Sousa <osousa> |
Component: | Monitoring | Assignee: | Damien Grisonnet <dgrisonn> |
Status: | CLOSED ERRATA | QA Contact: | Junqi Zhao <juzhao> |
Severity: | low | Docs Contact: | |
Priority: | unspecified | ||
Version: | 3.11.0 | CC: | alegrand, anpicker, dgrisonn, erooth, hongyli, hyagi, kakkoyun, lcosic, pkrupa, spasquie |
Target Milestone: | --- | ||
Target Release: | 3.11.z | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | No Doc Update | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2021-06-09 17:06:30 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Odilon Sousa
2021-04-13 13:25:09 UTC
It's intended that your changes to the monitoring stack are not persisted as we don't want any user to break their stack. The only way to customize the stack is by tweaking some predefined Ansible variable during installation, but that wouldn't allow you to modify Prometheus rule. In your case, this might be because of a regression in the kernel considering your discovery, but we might still be able to improve the current Prometheus rule. From what I can see, it is not really meaningful to only consider the `/` or `/host/root` mountpoint as we want to account for all the filesystem. I'll update the recording rule to reflect that. We suspect that there might be something else to this bug. Could you please provide the list of mountpoints shown by the `sum(node_filesystem_size_bytes) by (mountpoint) > 0` query with both kernel versions? checked with ose-cluster-monitoring-operator/images/v3.11.445, expr for "instance:node_filesystem_usage:sum" is updated - expr: sum((node_filesystem_size{mountpoint="/host/root"} - node_filesystem_free{mountpoint="/host/root"})) BY (instance) record: instance:node_filesystem_usage:sum Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Important: OpenShift Container Platform 3.11.452 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2150 |