Bug 1613008
| Summary: | [3.10,3.11] Memory leak on master node | ||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Vikas Laad <vlaad> | ||||||||||||||||||||||||||
| Component: | Master | Assignee: | Stefan Schimanski <sttts> | ||||||||||||||||||||||||||
| Status: | CLOSED DEFERRED | QA Contact: | Xingxing Xia <xxia> | ||||||||||||||||||||||||||
| Severity: | low | Docs Contact: | |||||||||||||||||||||||||||
| Priority: | unspecified | ||||||||||||||||||||||||||||
| Version: | 3.10.0 | CC: | aos-bugs, jeder, jokerman, mifiedle, mmccomas, schituku, vlaad | ||||||||||||||||||||||||||
| Target Milestone: | --- | ||||||||||||||||||||||||||||
| Target Release: | 3.10.z | ||||||||||||||||||||||||||||
| Hardware: | Unspecified | ||||||||||||||||||||||||||||
| OS: | Unspecified | ||||||||||||||||||||||||||||
| Whiteboard: | |||||||||||||||||||||||||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||||||||||||||||||||||||
| Doc Text: | Story Points: | --- | |||||||||||||||||||||||||||
| Clone Of: | Environment: | ||||||||||||||||||||||||||||
| Last Closed: | 2019-11-20 19:09:44 UTC | Type: | Bug | ||||||||||||||||||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||||||||||||||||||
| Documentation: | --- | CRM: | |||||||||||||||||||||||||||
| Verified Versions: | Category: | --- | |||||||||||||||||||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||||||||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||||||||||||||||
| Embargoed: | |||||||||||||||||||||||||||||
| Attachments: |
|
||||||||||||||||||||||||||||
|
Description
Vikas Laad
2018-08-06 17:52:39 UTC
Created attachment 1473702 [details]
memory usage
I still have the cluster around, please let me know if you want to look at it. The blue line on the graph is memory usage on master. Can we get prometheus metrics from this cluster to see what process is causing the memory grow? Also some object counts (how many images, daemonsets, secrets, etc..)... Created attachment 1474034 [details]
memory on master
Please see attached prometheus data for few mins, we were trying to configure prometheus for longer duration and we lost data. I will update this bz again when we have some more data. please let me know if you need anything else.
root@ip-172-31-13-187: ~ # oc get project | wc -l
35
root@ip-172-31-13-187: ~ # oc get images | wc -l
219
root@ip-172-31-13-187: ~ # oc get ds --all-namespaces | wc -l
9
root@ip-172-31-13-187: ~ # oc get secrets --all-namespaces | wc -l
537
Created attachment 1474681 [details]
few graphs from prometheus
do we know which process is using the memory? (apiserver, controllers, etc) Created attachment 1476633 [details]
api-memory.png
Created attachment 1476634 [details]
controllers-1-memory.png
Created attachment 1476635 [details]
controller-2-memory.png
Created attachment 1476636 [details]
api-2-memory.png
Created attachment 1476637 [details]
api-3-memory.png
I'm seeing memory growth of ~10MB per day in some components. That amount of growth doesn't seem concerning to me. Are we seeing faster growth under certain tests/workloads? Created attachment 1483355 [details]
3.11 Memory graph on master
This is the graph for 9 days of the memory usage
Created attachment 1483356 [details]
3.11 pods memory graph on master
Created attachment 1486854 [details]
3.11 Memory graph on master based on docker
3.11 docker environment memory consumption graph on master node
Created attachment 1486856 [details]
3.11 pods memory graph on master based on docker
3.11 docker environment memory graphs of the pods that show mem leak. The duration of graph is 11 days
OCP 3.6-3.10 is no longer on full support [1]. Marking CLOSED DEFERRED. If you have a customer case with a support exception or have reproduced on 3.11+, please reopen and include those details. When reopening, please set the Target Release to the appropriate version where needed. [1]: https://access.redhat.com/support/policy/updates/openshift |