tested with the PR, search metrics from prometheus with
count(container_fs_writes_total) by (id)
count(container_blkio_device_usage_total) by (id)
could see the metrics with id
{id="/"}
{id="/system.slice"}
{id="/system.slice/crio.service"}
{id="/kubepods.slice"}
{id="/kubepods.slice/kubepods-besteffort.slice"}
note: if you want to see result for "{id="/kubepods.slice/kubepods-besteffort.slice"}", make sure you have qosClass: BestEffort running pods
# oc -n openshift-monitoring get servicemonitor kubelet -oyaml
...
metricRelabelings:
- action: drop
regex: container_(network_tcp_usage_total|network_udp_usage_total|tasks_state|cpu_load_average_10s)
sourceLabels:
- __name__
- action: drop
regex: (container_spec_.*|container_file_descriptors|container_sockets|container_threads_max|container_threads|container_start_time_seconds|container_last_seen);;
sourceLabels:
- __name__
- pod
- namespace
- action: drop
regex: (container_blkio_device_usage_total);.+
sourceLabels:
- __name__
- container
- action: drop
regex: container_memory_failures_total
sourceLabels:
- __name__
- action: drop
regex: (container_fs_.*);.+
sourceLabels:
- __name__
- container
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory (OpenShift Container Platform 4.9.4 bug fix update), and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.
https://access.redhat.com/errata/RHBA-2021:3935
tested with the PR, search metrics from prometheus with count(container_fs_writes_total) by (id) count(container_blkio_device_usage_total) by (id) could see the metrics with id {id="/"} {id="/system.slice"} {id="/system.slice/crio.service"} {id="/kubepods.slice"} {id="/kubepods.slice/kubepods-besteffort.slice"} note: if you want to see result for "{id="/kubepods.slice/kubepods-besteffort.slice"}", make sure you have qosClass: BestEffort running pods # oc -n openshift-monitoring get servicemonitor kubelet -oyaml ... metricRelabelings: - action: drop regex: container_(network_tcp_usage_total|network_udp_usage_total|tasks_state|cpu_load_average_10s) sourceLabels: - __name__ - action: drop regex: (container_spec_.*|container_file_descriptors|container_sockets|container_threads_max|container_threads|container_start_time_seconds|container_last_seen);; sourceLabels: - __name__ - pod - namespace - action: drop regex: (container_blkio_device_usage_total);.+ sourceLabels: - __name__ - container - action: drop regex: container_memory_failures_total sourceLabels: - __name__ - action: drop regex: (container_fs_.*);.+ sourceLabels: - __name__ - container