Bug 695263
Summary: | qpidd broker crashes under stress test in qpid::broker::SessionHandler::getConnection (this=0x0) | ||
---|---|---|---|
Product: | Red Hat Enterprise MRG | Reporter: | Frantisek Reznicek <freznice> |
Component: | qpid-cpp | Assignee: | Ken Giusti <kgiusti> |
Status: | CLOSED ERRATA | QA Contact: | Frantisek Reznicek <freznice> |
Severity: | urgent | Docs Contact: | |
Priority: | urgent | ||
Version: | Development | CC: | esammons, iboverma, jneedle, kgiusti, tross |
Target Milestone: | 2.0 | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | qpid-cpp-mrg-0.10-4 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2011-06-23 15:44:23 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Frantisek Reznicek
2011-04-11 08:22:19 UTC
The issue is confirmed on RHEL 4.9 / 5.6 / 6.1 i[36]86 / x86_64, see below: case RHEL5.6i Thread 1 (Thread 15453): #0 0x008cd9d7 in qpid::broker::SessionHandler::getConnection (this=0xa22ead58) at qpid/broker/SessionHandler.cpp:62 #1 0x008d2ee0 in qpid::broker::SessionState::AsyncCommandCompleter::scheduleMsgCompletion (this=0xa27421d8, cmd=..., requiresAccept=false, requiresSync=false) at qpid/broker/SessionState.cpp:494 #2 0x008d6c1a in qpid::broker::SessionState::IncompleteIngressMsgXfer::completed (this=0xaf71bd18, sync=false) at qpid/broker/SessionState.cpp:4 58 #3 0x0044ced1 in finishCompleter (this=0xaf7bc700) at /usr/src/debug/qpid-cpp-mrg-0.10/cpp/src/qpid/broker/AsyncCompletion.h:110 #4 qpid::broker::PersistableMessage::enqueueComplete (this=0xaf7bc700) at /usr/src/debug/qpid-cpp-mrg-0.10/cpp/src/qpid/broker/PersistableMessage.h:122 #5 0x004412f0 in mrg::msgstore::JournalImpl::wr_aio_cb (this=0xb05444b0, dtokl=std::vector of length 46, capacity 46 = {...}) at JournalImpl.cpp:538 #6 0x004c46bb in mrg::journal::wmgr::get_events (this=0xb05446f4, state=UNUSED, timeout=0x0, flush=false) at jrnl/wmgr.cpp:799 #7 0x004bf940 in mrg::journal::wmgr::write_flush (this=0xb05446f4) at jrnl/wmgr.cpp:650 #8 0x004bfbf5 in mrg::journal::wmgr::flush (this=0xb05446f4) at jrnl/wmgr.cpp:594 #9 0x0049321e in mrg::journal::jcntl::flush (this=0xb05444b4, block_till_aio_cmpl=false) at jrnl/jcntl.cpp:412 #10 0x00440886 in mrg::msgstore::JournalImpl::flush (this=0xb05444b0, block_till_aio_cmpl=false) at JournalImpl.cpp:471 #11 0x00440a5d in mrg::msgstore::JournalImpl::flushFire (this=0xb05444b0) at JournalImpl.cpp:517 #12 0x0044133e in mrg::msgstore::InactivityFireEvent::fire (this=0xb010de10) at JournalImpl.cpp:47 #13 0x002e9fc5 in qpid::sys::TimerTask::fireTask (this=0xb010de10) at qpid/sys/Timer.cpp:57 #14 0x002ea02f in qpid::sys::Timer::fire (this=0x995d548, t=...) at qpid/sys/Timer.cpp:188 #15 0x002eb413 in qpid::sys::Timer::run (this=0x995d548) at qpid/sys/Timer.cpp:123 #16 0x001f00c1 in qpid::sys::(anonymous namespace)::runRunnable (p=0x995d548) at qpid/sys/posix/Thread.cpp:35 #17 0x00586832 in start_thread () from /lib/libpthread.so.0 #18 0x00bb30ae in clone () from /lib/libc.so.6 case RHEL6.1x Core was generated by `qpidd --auth=no -p 5672 --data-dir=/root/qpid_stress_test/dd5672 --log-to-file='. Program terminated with signal 11, Segmentation fault. #0 qpid::broker::SessionHandler::getConnection (this=0x0) at qpid/broker/SessionHandler.cpp:62 62 ConnectionState& SessionHandler::getConnection() { return connection; } (gdb) bt #0 qpid::broker::SessionHandler::getConnection (this=0x0) at qpid/broker/SessionHandler.cpp:62 #1 0x0000003f333f569d in qpid::broker::SessionState::AsyncCommandCompleter::scheduleMsgCompletion (this=0x7fc37c70ce20, cmd=..., requiresAccept=<value optimized out>, requiresSync=<value optimized out>) at qpid/broker/SessionState.cpp:494 #2 0x0000003f333f6878 in qpid::broker::SessionState::IncompleteIngressMsgXfer::completed (this=0x15d1140, sync=<value optimized out>) at qpid/broker/SessionState.cpp:458 #3 0x00007fc39c309a5d in invokeCallback (this=<value optimized out>, dtokl=std::vector of length 38, capacity 38 = {...}) at /usr/src/debug/qpid-cpp-0.10/cpp/src/qpid/broker/AsyncCompletion.h:110 #4 finishCompleter (this=<value optimized out>, dtokl=std::vector of length 38, capacity 38 = {...}) at /usr/src/debug/qpid-cpp-0.10/cpp/src/qpid/broker/AsyncCompletion.h:151 #5 enqueueComplete (this=<value optimized out>, dtokl=std::vector of length 38, capacity 38 = {...}) at /usr/src/debug/qpid-cpp-0.10/cpp/src/qpid/broker/PersistableMessage.h:122 #6 mrg::msgstore::JournalImpl::wr_aio_cb (this=<value optimized out>, dtokl=std::vector of length 38, capacity 38 = {...}) at JournalImpl.cpp:538 #7 0x00007fc39c371077 in mrg::journal::wmgr::get_events (this=0x7fc37c83b740, state=mrg::journal::pmgr::UNUSED, timeout=<value optimized out>, flush=<value optimized out>) at jrnl/wmgr.cpp:799 #8 0x00007fc39c36cf2f in mrg::journal::wmgr::write_flush (this=0x7fc37c83b740) at jrnl/wmgr.cpp:650 #9 0x00007fc39c36d136 in mrg::journal::wmgr::flush (this=0x7fc37c83b740) at jrnl/wmgr.cpp:594 #10 0x00007fc39c347dc7 in mrg::journal::jcntl::flush (this=0x7fc37c83b3b8, block_till_aio_cmpl=false) at jrnl/jcntl.cpp:412 #11 0x00007fc39c30938f in mrg::msgstore::JournalImpl::flush (this=0x7fc37c83b3b0, block_till_aio_cmpl=<value optimized out>) at JournalImpl.cpp:471 #12 0x00007fc39c3094f8 in mrg::msgstore::JournalImpl::flushFire (this=0x7fc37c83b3b0) at JournalImpl.cpp:517 #13 0x00007fc39c30955b in mrg::msgstore::InactivityFireEvent::fire (this=0x7fc37c7e7b90) at JournalImpl.cpp:47 #14 0x0000003f32dfb1b1 in qpid::sys::Timer::run (this=0x10c5370) at qpid/sys/Timer.cpp:123 #15 0x0000003f32d34faa in qpid::sys::(anonymous namespace)::runRunnable (p=<value optimized out>) at qpid/sys/posix/Thread.cpp:35 #16 0x00000032c24077e1 in start_thread () from /lib64/libpthread.so.0 #17 0x00000032c20e678d in clone () from /lib64/libc.so.6 (gdb) quit Upstream JIRA: https://issues.apache.org/jira/browse/QPID-3197 ReviewBoard of proposed fix: https://reviews.apache.org/r/575/ Pushed to upstream trunk: http://svn.apache.org/viewvc?view=revision&revision=1091167 pending port to 0.10 branch. Merged to upstream 0.10 branch: http://svn.apache.org/viewvc?view=revision&revision=1091560 Merged to mgr_2.0.x branch: http://mrg1.lab.bos.redhat.com/git/?p=qpid.git;a=commitdiff;h=132a343e8043d9ba0b077584f8cc8cccd54de465 The issue has been fixed, stress test run over last weekend + Monday went ok on RHEL 4.9 / 5.6 / 6.1snap i386 / x86_64 on packages: python-qpid-0.10-1.el5.noarch python-qpid-qmf-0.10-6.el5.x86_64 qpid-cpp-client-0.10-4.el5.x86_64 qpid-cpp-client-devel-0.10-4.el5.x86_64 qpid-cpp-client-devel-docs-0.10-4.el5.x86_64 qpid-cpp-client-rdma-0.10-4.el5.x86_64 qpid-cpp-client-ssl-0.10-4.el5.x86_64 qpid-cpp-mrg-debuginfo-0.10-4.el5.x86_64 qpid-cpp-server-0.10-4.el5.x86_64 qpid-cpp-server-cluster-0.10-4.el5.x86_64 qpid-cpp-server-devel-0.10-4.el5.x86_64 qpid-cpp-server-rdma-0.10-4.el5.x86_64 qpid-cpp-server-ssl-0.10-4.el5.x86_64 qpid-cpp-server-store-0.10-4.el5.x86_64 qpid-cpp-server-xml-0.10-4.el5.x86_64 qpid-dotnet-0.4.738274-2.el5.x86_64 qpid-java-client-0.10-4.el5.noarch qpid-java-common-0.10-4.el5.noarch qpid-java-example-0.10-4.el5.noarch qpid-qmf-0.10-6.el5.x86_64 qpid-qmf-debuginfo-0.10-6.el5.x86_64 qpid-qmf-devel-0.10-6.el5.x86_64 qpid-tests-0.10-1.el5.noarch qpid-tools-0.10-4.el5.noarch rh-qpid-cpp-tests-0.10-4.el5.x86_64 ruby-qpid-qmf-0.10-6.el5.x86_64 sesame-0.10-1.el5.x86_64 sesame-debuginfo-0.10-1.el5.x86_64 -> VERIFIED An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHEA-2011-0890.html |