Created attachment 1555285 [details]
We need to remove limit blocks from the following containers in the prom pod
As it is, there is needless cpu throttling and likely needless OOM killing (theorized, not observed), especially with the 10Mi limit container.
Created attachment 1555286 [details]
Unfortunately these are hardcoded into the prometheus-operator as of right now. We're going to go with the following strategy:
* Patch the fork we ship in OpenShift to completely remove the limits.
* Upstream since has added flags to configure these requests/limits, in addition we will add that the 0 value will make the limit/request be removed entirely, and this is what we'll switch to in a future OpenShift version.
The PR patch for our fork has been opened: https://github.com/openshift/prometheus-operator/pull/24
And the PR to allow configuring this to be disabled on upstream: https://github.com/coreos/prometheus-operator/pull/2560
The change that fixes this in the immediate situation is merged.
There is not available OCP payload which packages the fix to test, so postpone the testing until we have available payload
resource limits for alertmanager-main/prometheus-k8s statefulset are removed
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.
For information on the advisory, and where to find the updated
files, follow the link below.
If the solution does not work for you, open a new bug report.