Bug 891689
| Summary: | New HA regularly shutting down active node | ||||||
|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise MRG | Reporter: | Pavel Moravec <pmoravec> | ||||
| Component: | qpid-cpp | Assignee: | Alan Conway <aconway> | ||||
| Status: | CLOSED CURRENTRELEASE | QA Contact: | mick <mgoulish> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | high | ||||||
| Version: | 2.3 | CC: | esammons, freznice, iboverma, jross, mcressma, mgoulish, mtoth | ||||
| Target Milestone: | 3.0 | Keywords: | OtherQA | ||||
| Target Release: | --- | ||||||
| Hardware: | All | ||||||
| OS: | Linux | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | qpid-cpp-0.22 | Doc Type: | Bug Fix | ||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | |||||||
| : | 955711 (view as bug list) | Environment: | |||||
| Last Closed: | 2015-01-21 12:56:11 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Bug Depends On: | |||||||
| Bug Blocks: | 955711 | ||||||
| Attachments: |
|
||||||
|
Description
Pavel Moravec
2013-01-03 17:33:31 UTC
Created attachment 672138 [details]
qpid traces
when testing with manually started qpidd / qpidd-primary services (i.e. rgmanager off, cman on), no issue appears, brokers are stable. But why rgmanager can affect this? If a process / service it manages is running it should not intervene.. Alan, is this expected? Background: if a broker is started when there is already an active primary, that broker cannot be promoted until it connects and becomes a READY backup, otherwise messages can be lost. If the primary is killed before that and rgmanager tries to promote the unready backup, it will die with that error message, so that rgmanager can hopefully promote a broker that is ready. It shouldn't be happening so frequently however, so this probably bears investigation. Fixed http://mrg1.lab.bos.redhat.com/cgit/qpid.git/commit/?h=0.18-mrg-aconway-bz891689&id=d0262927d32bdd043125373a7f3a969e7600713d commit d0262927d32bdd043125373a7f3a969e7600713d Author: Alan Conway <aconway> Commit: Alan Conway <aconway> Bug 891689 - New HA regularly shutting down active node qpid-primary script was incorrect and failing on status calls, causing the broker to be restarted by rgmanager. Trunk checkin svn rev: 1449870 With these qpid pkgs: qpid-qmf-0.22-24.el6.x86_64 qpid-cpp-client-devel-0.22-29.el6.x86_64 qpid-proton-c-0.5-9.el6.x86_64 qpid-cpp-server-0.22-29.el6.x86_64 python-qpid-0.22-8.el6.noarch qpid-tools-0.22-7.el6.noarch qpid-cpp-server-ha-0.22-29.el6.x86_64 qpid-cpp-client-0.22-29.el6.x86_64 python-qpid-qmf-0.22-24.el6.x86_64 with cman amd rgmanager running -- 3 separate physical boxes (mrg3, mrg25, mrg28) -- no broker shutdown, no error messages after 30 mins with HA cluster running. --> verified. |