Bug 1980844
Summary: | The SystemMemoryExceedsReserved alert released in 4.6 seems to trigger on many clusters under load (default increase if possible?) | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | pmoses |
Component: | Node | Assignee: | Harshal Patil <harpatil> |
Node sub component: | Kubelet | QA Contact: | Sunil Choudhary <schoudha> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | medium | ||
Priority: | unspecified | CC: | aos-bugs, ccoleman, harpatil, kgordeev, lmohanty, openshift-bugzilla-robot, pmoses, rphillips, sdodson, snalawad, sramanat, wking |
Version: | 4.6 | Keywords: | Reopened, Upgrades |
Target Milestone: | --- | ||
Target Release: | 4.9.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: Alerts would fire at 90% utilization.
Consequence:
Fix: Alerts will now fire at 95% utilization.
Result:
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-10-18 17:39:11 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2000500 |
Description
pmoses
2021-07-09 16:00:39 UTC
Same error was seen on my customer environment as well. System memory usage of 1.073G on Node <redacted> exceeds 90% of the reservation. Reserved memory ensures system processes can function even when the node is fully allocated and protects against workload out of memory events impacting the proper functioning of the node. The reservation may be increased (https://docs.openshift.com/container-platform/latest/nodes/nodes/nodes-nodes-managing.html) when running nodes with high numbers of pods. Please contact me directly if a list of recent cases opened for this matter if this would be a help. (query on 'System memory usage exceeds 90% of the reservation.' will provide all cases) pmoses *** Bug 1993218 has been marked as a duplicate of this bug. *** Verified on 4.9.0-0.nightly-2021-09-01-193941. $ oc get clusterversion NAME VERSION AVAILABLE PROGRESSING SINCE STATUS version 4.9.0-0.nightly-2021-09-01-193941 True False 125m Cluster version is 4.9.0-0.nightly-2021-09-01-193941 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.9.0 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:3759 |