1810838 – [3.11] - alert KubePersistentVolumeFullInFourDays is showing even there is enough storage

Bug 1810838 - [3.11] - alert KubePersistentVolumeFullInFourDays is showing even there is enough storage

Summary: [3.11] - alert KubePersistentVolumeFullInFourDays is showing even there is en...

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Monitoring
Sub Component:
Version:	3.11.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	3.11.z
Assignee:	Pawel Krupa
QA Contact:	Junqi Zhao
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2020-03-06 01:17 UTC by Vladislav Walek
Modified:	2023-09-07 22:13 UTC (History)
CC List:	8 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-04-02 10:46:28 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Description Vladislav Walek 2020-03-06 01:17:33 UTC

Description of problem:

IHAC where the alert "KubePersistentVolumeFullInFourDays" is triggered for the logging-es-0 pvc will be filled within 4 days.

However, running the command in the Prometheus shows that the storage is enough.

The graph in the attachments shows that there is little spike when the value is below 0 (as in the alert), however, it stabilize on the correct value after couple of minutes.

Labels
alertname = KubePersistentVolumeFullInFourDays
cluster = <cluster>
endpoint = https-metrics
instance = <node-ip>:10250
job = kubelet
namespace = openshift-logging
persistentvolumeclaim = logging-es-0
prometheus = openshift-monitoring/k8s
service = kubelet
severity = critical
Annotations
message = Based on recent sampling, the persistent volume claimed by logging-es-0 in namespace openshift-logging is expected to fill up within four days. Currently 4.838066176e+10 bytes are available.


Version-Release number of selected component (if applicable):
OpenShift Container Platform 3.11

How reproducible:
n/a

Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:
the problem could be related to bug - https://bugzilla.redhat.com/1809375 where the prometheus pods are constantly restarted.

Comment 6 Pawel Krupa 2020-04-02 10:46:28 UTC

This is a prediction based alert that should be of a warning type. It doesn't have any direct impact on running cluster and there is `KubePersistentVolumeUsageCritical` alert which if firing needs fast reaction.

KubePersistentVolumeFullInFourDays is greatly improved in the latest OpenShift versions and currently, we don't have plans for backport.

Note You need to log in before you can comment on or make changes to this bug.