Bug 970668
Summary: | qpid segfaults in qpid::ha::BrokerReplicator::disconnected | |||
---|---|---|---|---|
Product: | Red Hat Enterprise MRG | Reporter: | Pavel Moravec <pmoravec> | |
Component: | qpid-cpp | Assignee: | Alan Conway <aconway> | |
Status: | CLOSED ERRATA | QA Contact: | Irina Boverman <iboverma> | |
Severity: | high | Docs Contact: | ||
Priority: | high | |||
Version: | 2.3 | CC: | aconway, freznice, iboverma, jross, lzhaldyb, mcressma, mrgqe-bugs, mtoth | |
Target Milestone: | 3.0 | Keywords: | OtherQA, Reopened, TestCaseProvided | |
Target Release: | --- | |||
Hardware: | All | |||
OS: | Linux | |||
Whiteboard: | ||||
Fixed In Version: | qpid-cpp-0.22-30 | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1030608 (view as bug list) | Environment: | ||
Last Closed: | 2014-09-24 15:08:08 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1030608 |
Description
Pavel Moravec
2013-06-04 14:36:47 UTC
Reproducer: - having usual cluster.conf (as per docs) - relocating primary service (this can happen automatically after a primary broker failure / stop): while true; do for i in 1 2 3; do clusvcadm -r service:qpidd-primary-service service:node${i}-qpidd-service; sleep 3; clustat; done; done What's wrong: (gdb) frame 8 #8 qpid::ha::BrokerReplicator::disconnected (this=0x1dba220) at qpid/ha/BrokerReplicator.cpp:878 878 QPID_LOG(info, logPrefix << "Disconnected from " << primary); (gdb) p logPrefix $11 = "\000\000\000", <incomplete sequence \375> (gdb) i.e. logPrefix is null. Why?? Possibly a race where BrokerReplicator is disconnected after being deleted, it might be fixed on trunk I recall a fix along those lines. Can you reproduce easily? (In reply to Alan Conway from comment #4) > Possibly a race where BrokerReplicator is disconnected after being deleted, > it might be fixed on trunk I recall a fix along those lines. Can you > reproduce easily? When I generate lots of clients (dis)connections like using below script, then it's almost certainty to get the segfault in 30 minutes on one node: min=3 while true; do for i in 1 2 3; do if [ $(ps aux | grep qpid-send | grep -c "node${i}") -lt $min ]; then qpid-send -a "amq.fanout" -m 1 -b node${i} --connection-option="{reconnect:true, 'reconnect_limit':'1000', 'reconnect_interval_max':0 }" & fi done done I can't reproduce on the current 0.22-mrg which is at 6006677 bz1019645: QPID-5239: preserve type of 'annotations' i.e. application properties added by broker Please confirm if the bug still exists. (In reply to Alan Conway from comment #6) > I can't reproduce on the current 0.22-mrg which is at > > 6006677 bz1019645: QPID-5239: preserve type of 'annotations' i.e. > application properties added by broker > > Please confirm if the bug still exists. Indeed, I can't replicate it further as well. Closing it. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2014-1296.html |