Bug 1121217
| Summary: | watchman takes up gigs of memory, times out on restart | |||
|---|---|---|---|---|
| Product: | OpenShift Online | Reporter: | Sten Turpin <sten> | |
| Component: | Containers | Assignee: | Jhon Honce <jhonce> | |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | libra bugs <libra-bugs> | |
| Severity: | medium | Docs Contact: | ||
| Priority: | medium | |||
| Version: | 2.x | CC: | agrimm, bmeng, jhonce, jokerman, mmccomas | |
| Target Milestone: | --- | |||
| Target Release: | 2.x | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | Bug Fix | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1127714 (view as bug list) | Environment: | ||
| Last Closed: | 2014-10-10 00:49:11 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1127714 | |||
|
Description
Sten Turpin
2014-07-18 16:29:58 UTC
Put in some debug messages to print memory information after each watchman plugin is invoked. The messages go in /var/log/messages and the debug mode can be enabled by setting an env var 'WATCHMAN_DEBUG' to true. Hopefully we can narrow it down which plugin causes the leak. https://github.com/openshift/origin-server/pull/5670 Checked on devenv-stage_946, the debug option was added to watchman config. # cat /etc/sysconfig/watchman WATCHMAN_DEBUG=true # tail -f /var/log/messages Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Watchman debug is set to true Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36484, Plugin : JbossPlugin Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36560, Plugin : OomPlugin Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36608, Plugin : EnvPlugin Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36608, Plugin : ThrottlerPlugin Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36688, Plugin : GearStatePlugin Aug 12 00:14:58 ip-10-99-163-60 watchman[21483]: Memory : 36688, Plugin : MetricsPlugin Commits pushed to master at https://github.com/openshift/origin-server https://github.com/openshift/origin-server/commit/a0149a176f417aee7cc82190b90859158a38c09d Bug 1121217 - Symbol leak in Throttler cgroup code * Enhance debugging output * Remove to_sym in keys https://github.com/openshift/origin-server/commit/e00d653b764334fb5da6c2b301b5dd52629c9234 Bug 1121217 - Symbol leak in Throttler cgroup code * fix tests *** Bug 1096270 has been marked as a duplicate of this bug. *** Checked on devenv-stage_952, with about 80 gears running on a m3.medium node. With following config in sysconfig: # cat /etc/sysconfig/watchman GEAR_RETRIES=3 RETRY_DELAY=30 RETRY_PERIOD=60 STATE_CHANGE_DELAY=10 STATE_CHECK_PERIOD=1 THROTTLER_CHECK_PERIOD=1 OOM_CHECK_PERIOD=1 WATCHMAN_DEBUG=true Wathcman running with about 50% cpu usage and memory usage will not greater than 10%. And watchman can be restarted. Also do regression testing for throttle plugin, gear_state_plugin and oom_plugin. All of them working well. Move bug to verified. |