Bug 468821
Summary: | SIGSEGV running kolchak (from qpid::sys::SystemInfo::getLocalIpAddresses) | ||
---|---|---|---|
Product: | Red Hat Enterprise MRG | Reporter: | Gordon Sim <gsim> |
Component: | qpid-cpp | Assignee: | Andrew Stitcher <astitcher> |
Status: | CLOSED DUPLICATE | QA Contact: | Kim van der Riet <kim.vdriet> |
Severity: | urgent | Docs Contact: | |
Priority: | urgent | ||
Version: | 1.0 | CC: | astitcher, duck |
Target Milestone: | 1.1 | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2008-11-18 14:24:25 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Gordon Sim
2008-10-28 09:15:51 UTC
Also tracked by https://issues.apache.org/jira/browse/QPID-1415. It's not clear to me that this represents a real bug in qpidd as the end of the log seems to suggest that this might be a bug in valgrind itself: --9801-- VALGRIND INTERNAL ERROR: Valgrind received a signal 11 (SIGSEGV) - exiting --9801-- si_code=80; Faulting address: 0x0; sp: 0x404EA2628 valgrind: m_signals.c:1772 (sync_signalhandler): Assertion 'tid != 0' failed. ==9801== at 0x380176D7: report_and_quit (m_libcassert.c:136) ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? ==9801== by 0xDEADBEEFDEADBEEE: ??? So I'd suggest running the same soak test *without* valgrind and with core file size set to unlimited. Then if we get a similar SIGSEGV and stack trace we should investigate again. The fact that the stack trace (first one) has getKnownBrokersImpl() near the top makes me suspect that the fix to BZ471247 *may* resolve this in practice. I can't quite work out why concurrent calls to qpid::sys::SystemInfo::getLocalIpAddresses would cause problems, so haven't marked this as a dup. qpid::sys::SystemInfo::getLocalIpAddresses contains a call to inet_ntoa() which returns its result in a statically allocated buffer (and this indeed) is where the crash occurs so multiple simultaneous calls to getLocalAddresses will indeed cause problems. *** This bug has been marked as a duplicate of bug 471247 *** |