| Summary: | Broker does not handle mcollective server failover correctly | |||
|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Evgheni Dereveanchin <ederevea> | |
| Component: | Unknown | Assignee: | Rory Thrasher <rthrashe> | |
| Status: | CLOSED WONTFIX | QA Contact: | Johnny Liu <jialiu> | |
| Severity: | urgent | Docs Contact: | ||
| Priority: | high | |||
| Version: | 2.2.0 | CC: | aos-bugs, bleanhar, erich, fcami, jkaur, jokerman, mmccomas, rthrashe, saime, trogers | |
| Target Milestone: | --- | Keywords: | Reopened | |
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | ||||
| Fixed In Version: | Doc Type: | Bug Fix | ||
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1401118 (view as bug list) | Environment: | ||
| Last Closed: | 2018-08-13 15:40:18 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Bug Depends On: | ||||
| Bug Blocks: | 1401118 | |||
|
Description
Evgheni Dereveanchin
2016-04-28 13:01:38 UTC
Handing off to Tim for triage. Temporarily moving this to modified to add it to the 2.2.11 errata. Investigation is still ongoing. Going to close this bug as a WONTFIX. This will be documented as a Known Issue in the RHOSE 2.2 asynchronous release notes.
In our testing, we have found that this issue is extremely rare and cannot be avoided with changes to openshift code. Currently, the recommendation is to ensure that activemq servers have adequate available memory with plenty of swap space in case of a spike in usage. OOM conditions on the activemq server are the most likely cause of this issue.
Additionally, it could help to configure stomp to randomly select a server from the pool, rather than always using the first in the list:
In /opt/rh/root/etc/mcollective/server.cfg on nodes and in
/opt/rh/root/etc/mcollective/client.cfg on brokers:
plugin.stomp.pool.randomize = true
|