Help Desk Ticket Reference: https://na7.salesforce.com/500A0000009elou project_key: JBPAPP6 This is basically same issue reported in EAP 5.1.2: https://issues.jboss.org/browse/JBPAPP-8529 Core bridge doesn't failover in HornetQ live-backup setup. When killed the live server, backup HornetQ comes up live but never deploy a bridge. Configuration and DEBUG logs are attached.
Attachment: Added: failover-core-bridge-eap6.zip
Link: Added: This issue is related to JBPAPP-8529
Docs QE Status: Removed: NEW
The problem looks like org.hornetq.core.server.cluster.impl.ClusterManagerImpl.start() is invoked before org.hornetq.core.server.impl.HornetQServerImpl.initialisePart2() is invoked. When this happens the server tries to deploy the bridge before the queue is available and gives this kind of warning: WARN [org.hornetq.core.server.cluster.impl.ClusterManagerImpl] (Activation for server HornetQServerImpl::serverUUID=6a5f0557-90c7-11e2-bfe4-e56356058f3d) No queue found with name jms.queue.source bridge will not be deployed. See org.hornetq.core.server.impl.HornetQServerImpl.SharedStoreBackupActivation.run() for the problematic code. One thing to note here is that initialisePart2() actually invokes ClusterManagerImpl.start() so I doubt that SharedStoreBackupActivation.run() really needs to call ClusterManagerImpl.start(). What do you think, Clebert?
I think Justin is right. However the ClusterManagerImpl.start() is needed because the backup need to start the cluster connections in order to announce its connectors to the cluster. It seems the bridges need not be deployed at this stage.
Case is fixed, I just have to finish backporting its test case to 2.2 before sending PRs around.
EAP https://github.com/hornetq/hornetq/pull/967 AS7 https://github.com/hornetq/hornetq/pull/966 master https://github.com/hornetq/hornetq/pull/968
This issue was confirmed and fixed in all our 3 maintained branches, see commits master: https://github.com/hornetq/hornetq/commit/6acabc8cb361e27270abcd0a4659da65bf8e2166 AS7/EAP https://github.com/hornetq/hornetq/commit/dd512ad646499a91ef0b37c15f27ba46a42baf28
I'm trying to verify this bugzilla but I'm not able to figure out the test scenario. Could you provide me more info, please? Thanks, Mirek
Hi Mirek, You can try deploy a core bridge to a hornetq node (let's called it nodeA) that has a backup (nodeB) node up and running. Make the core bridge to forward messages from nodeA to some other node (called nodeC). Then kill nodeA and let the nodeB to be activated. Verify if the core-bridge can still work normally. Without the fix the core-bridge won't be able to deployed and started correctly.
Thanks for the quick update. This scenario I successfully tried yesterday. I just need to confirm that configuration of HQ bridge must be also on backup's (not only live's) configuration.
Yes the core-bridge configure needs to be in both live and backup configs.
For the purpose of verification of this bz I wrote new set of tests for testing HA with HQ core bridges and found a new issue - bz#959789 - but it's not this test scenario. Use case described in this bz is ok. Verified in EAP 6.1.0.ER6. Thanks to all involved!
Details inside vpn: https://post-office.corp.redhat.com/mailman/private/eap-pm-list/2013-July/msg00105.html