Bug 1105225
Summary: | Watchman OOM plugin fails to restart gears | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Brenton Leanhardt <bleanhar> |
Component: | Containers | Assignee: | Brenton Leanhardt <bleanhar> |
Status: | CLOSED ERRATA | QA Contact: | libra bugs <libra-bugs> |
Severity: | medium | Docs Contact: | |
Priority: | high | ||
Version: | 2.1.0 | CC: | adellape, agrimm, anli, jkeck, jokerman, libra-onpremise-devel, mmccomas, xjia |
Target Milestone: | --- | Keywords: | Upstream |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | openshift-origin-node-util-1.22.11.1-1.el6op | Doc Type: | Bug Fix |
Doc Text: |
In certain scenarios when using the Watchman OOM plug-in, gears would fail to be restarted after running out of memory. This bug fix addresses several Watchman issues, and Watchman now restarts gears that have run out of memory, as expected.
|
Story Points: | --- |
Clone Of: | 1104902 | Environment: | |
Last Closed: | 2014-08-04 13:27:19 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1096863, 1104902 | ||
Bug Blocks: |
Description
Brenton Leanhardt
2014-06-05 15:43:28 UTC
Upstream commits: commit 115b72f8260e4fea936161d22d6e3dab8407e46e Author: Andy Grimm <agrimm> Date: Thu Jun 5 10:37:34 2014 -0400 Bug 1104902 - Fix several bugs in OOM Plugin app restarts Several code paths had not been properly tested in this code, and various typos and logic errors have been corrected. commit 4a5e999a9561ff0aef01b203f360e6a2b87be0cc Author: Jhon Honce <jhonce> Date: Tue Jun 17 11:55:45 2014 -0700 Bug 1104902 - Fix unit tests Verified and pass on puddle-2-1-2014-07-15 Highlight: in puddle-2-1-2014-07-15,the oom_plugin was imported to OSE2.1z. oo-cgroup-disable/enable must be executed for all containers to enable this plugin. 1. rhc app create jbosseap jbosseap 2. run application to swallow memory until out of memory. 3. Watch /var/log/message, the task was killed by kernel for resource limit. Jul 16 04:07:12 node kernel: Task in /openshift/53c62dd24cfeff6c83000001 killed as a result of limit of /openshift/53c62dd24cfeff6c83000001 Jul 16 04:07:12 node kernel: memory: usage 524224kB, limit 524288kB, failcnt 21135 Jul 16 04:07:12 node kernel: memory+swap: usage 626688kB, limit 626688kB, failcnt 27 Verified on puddle-2-1-2014-07-15,Run same steps as per above. 1)OOM Plugin works. Jul 16 06:04:15 node dhclient[1016]: bound to 192.168.55.38 -- renewal in 51 seconds. Jul 16 06:04:31 node watchman[24846]: OOM Plugin: Found gear 53c64d4e4cfeff1e1b00003f under OOM. Jul 16 06:04:31 node watchman[24846]: OOM Plugin: Increasing memory for gear 53c64d4e4cfeff1e1b00003f to 705901363 and restarting 2)The gears was restarted by watchman. July 16 06:04:41 INFO AdminGearsControl: initialized for gear(s) 53c64d4e4cfeff1e1b00003f AdminGearsControl: initialized with timeout 360s AdminGearsControl: initialized with 1 process per CPU July 16 06:04:42 INFO 53c64d4e4cfeff1e1b00003f start against 'jbosseap' July 16 06:05:07 INFO Shell command '/sbin/runuser -s /bin/sh 53c64d4e4cfeff1e1b00003f ****** Found 127.2.247.129:8080 listening port Found 127.2.247.129:9999 listening port ~/jbosseap/standalone/deployments ~/jbosseap ~/jbosseap Artifacts deployed: ./ROOT.war July 16 06:05:07 INFO (20525) Starting gear 53c64d4e4cfeff1e1b00003f ... [ OK ] Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHBA-2014-0999.html |