Bug 1484028
Summary: | qdrouterd segfaults on frequent goferd reconnect | ||
---|---|---|---|
Product: | Red Hat Satellite | Reporter: | Pavel Moravec <pmoravec> |
Component: | Qpid | Assignee: | Mike Cressman <mcressma> |
Status: | CLOSED DUPLICATE | QA Contact: | Katello QA List <katello-qa-list> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 6.2.11 | CC: | bbuckingham, pmoravec |
Target Milestone: | Unspecified | ||
Target Release: | Unused | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-09-15 20:19:16 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Pavel Moravec
2017-08-22 13:26:43 UTC
Sorry for confusion, in fact there must be two different segfaults / two BZs. Second BZ will follow once I reproduce it (backtrace shall match the KCS /ENTMQ JIRA but reproducer will differ). This one follows reproducer described above but generates different backtrace / coredump: .. Program terminated with signal 11, Segmentation fault. #0 qd_link_close (link=0x0) at /usr/src/debug/qpid-dispatch-0.4/src/container.c:994 994 if (link->pn_link) (gdb) bt full #0 qd_link_close (link=0x0) at /usr/src/debug/qpid-dispatch-0.4/src/container.c:994 No locals. #1 0x00007f8d0f62ea7b in qd_router_detach_routed_link (context=0x7f8cfc0a1ed0, discard=<optimized out>) at /usr/src/debug/qpid-dispatch-0.4/src/router_node.c:1253 link = 0x0 pn_link = <optimized out> ld = 0x7f8cfc0a1ed0 #2 0x00007f8d0f6320b6 in invoke_deferred_calls (conn=conn@entry=0x7f8cfc00cb30, discard=discard@entry=false) at /usr/src/debug/qpid-dispatch-0.4/src/server.c:304 calls = {head = 0x0, tail = <optimized out>, scratch = 0x1b3fde0, size = <optimized out>} dc = 0x1b3fde0 #3 0x00007f8d0f63293d in process_connector (cxtr=0x7f8cfc010290, qd_server=0x1a9cff0) at /usr/src/debug/qpid-dispatch-0.4/src/server.c:397 ctx = 0x7f8cfc00cb30 events = <optimized out> passes = <optimized out> #4 thread_run (arg=<optimized out>) at /usr/src/debug/qpid-dispatch-0.4/src/server.c:626 work_done = 1 timer = <optimized out> thread = <optimized out> work = <optimized out> cxtr = 0x7f8cfc010290 conn = <optimized out> ctx = <optimized out> error = <optimized out> poll_result = <optimized out> qd_server = 0x1a9cff0 #5 0x00007f8d0f1a3e25 in start_thread (arg=0x7f8cf9dfa700) at pthread_create.c:308 __res = <optimized out> pd = 0x7f8cf9dfa700 now = <optimized out> unwind_buf = {cancel_jmp_buf = {{jmp_buf = {140243464333056, 6390606335961769466, 0, 140243464333760, 140243464333056, 0, -6361882968312732166, -6361931856609892870}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x0, 0x0}, data = {prev = 0x0, cleanup = 0x0, canceltype = 0}}} not_first_call = <optimized out> pagesize_m1 = <optimized out> sp = <optimized out> freesize = <optimized out> #6 0x00007f8d0e6f934d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 No locals. (gdb) See the qd_link_close called for link=0x0. (the reproducer triggers the segfault _after_ iterating via existing queues, once qpidd starts to complain "Error on attach: Node not found: pulp.pmoravec.2.83" or so - even after these errors start to appear, you can hit the segfault) The bug is reproducible with qpid-proton-0.9-20.el7 as well. I dont have a system with qpid-proton-0.16.0-6.el7 / qpid-dispatch-0.8.0-9 or -10 now, will try later on. Unable to reproduce within Satellite, and the Satellite's relevant segfault is tracked under https://bugzilla.redhat.com/show_bug.cgi?id=1491160 . So closing this as WONTFIX (no fix required for Sat). (technically, as it was just realized, the segfault is fixed via 0.4-29 build / 1535891) *** This bug has been marked as a duplicate of bug 1535891 *** |