Hide Forgot
Created attachment 1555285 [details] throttle.png We need to remove limit blocks from the following containers in the prom pod name: prometheus-config-reloader resources: limits: cpu: 50m memory: 50Mi requests: cpu: 50m memory: 50Mi name: rules-configmap-reloader resources: limits: cpu: 25m memory: 10Mi requests: cpu: 25m memory: 10Mi As it is, there is needless cpu throttling and likely needless OOM killing (theorized, not observed), especially with the 10Mi limit container.
Created attachment 1555286 [details] prom-throttle-graph.png
Unfortunately these are hardcoded into the prometheus-operator as of right now. We're going to go with the following strategy: * Patch the fork we ship in OpenShift to completely remove the limits. * Upstream since has added flags to configure these requests/limits, in addition we will add that the 0 value will make the limit/request be removed entirely, and this is what we'll switch to in a future OpenShift version.
The PR patch for our fork has been opened: https://github.com/openshift/prometheus-operator/pull/24
And the PR to allow configuring this to be disabled on upstream: https://github.com/coreos/prometheus-operator/pull/2560
The change that fixes this in the immediate situation is merged.
There is not available OCP payload which packages the fix to test, so postpone the testing until we have available payload
resource limits for alertmanager-main/prometheus-k8s statefulset are removed payload 4.0.0-0.nightly-2019-04-18-190537
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2019:0758