Bug 1283139
Summary: | glusterd crashed | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | RajeshReddy <rmekala> |
Component: | core | Assignee: | Bug Updates Notification Mailing List <rhs-bugs> |
Status: | CLOSED DUPLICATE | QA Contact: | Anoop <annair> |
Severity: | unspecified | Docs Contact: | |
Priority: | unspecified | ||
Version: | rhgs-3.1 | CC: | amukherj, anekkunt, mzywusko, rhs-bugs, rmekala, storage-qa-internal |
Target Milestone: | --- | Keywords: | ZStream |
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2015-11-19 10:44:01 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
RajeshReddy
2015-11-18 11:13:49 UTC
Could you attach the sosreport? core file is a mandate to debug any crashes. sosreport and core are available @ /home/repo/sosreports/bug.1283139 on rhsqe-repo.lab.eng.blr.redhat.com Bt in description was showed incorrectly due to mismatch between core file and specified executable file. I got set up from Rajesh Reddy , below is the back trash for crash 0x00007f3ab1d9b0ad in rcu_read_lock_bp () from /lib64/liburcu-bp.so.1 #1 0x00007f3ab242f342 in gd_peerinfo_find_from_hostname (hoststr=hoststr@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com") at glusterd-peer-utils.c:639 #2 0x00007f3ab242f81d in glusterd_peerinfo_find_by_hostname (hoststr=hoststr@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com") at glusterd-peer-utils.c:111 #3 0x00007f3ab242fa09 in glusterd_hostname_to_uuid (hostname=hostname@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com", uuid=uuid@entry=0x7f3aaded4ba0 "") at glusterd-peer-utils.c:155 #4 0x00007f3ab2380cdc in glusterd_volume_brickinfo_get (uuid=uuid@entry=0x0, hostname=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com", path=0x7f3aa000d170 "/rhs/brick4/afr2x2", volinfo=volinfo@entry=0x7f3abfb11ea0, brickinfo=brickinfo@entry=0x7f3aaded4c88) at glusterd-utils.c:1310 #5 0x00007f3ab2380d68 in glusterd_volume_brickinfo_get_by_brick (brick=brick@entry=0x7f3aa0005b85 "rhs-client19.lab.eng.blr.redhat.com:/rhs/brick4/afr2x2", volinfo=0x7f3abfb11ea0, brickinfo=brickinfo@entry=0x7f3aaded4c88) at glusterd-utils.c:1354 #6 0x00007f3ab235b36f in get_brickinfo_from_brickid (brickinfo=0x7f3aaded4c88, brickid=0x7f3a9c026e90 "dbf7ab58-21a1-4951-b8ae-44e3aaa4c0ea:rhs-client19.lab.eng.blr.redhat.com:/rhs/brick4/afr2x2") at glusterd-handler.c:4816 #7 __glusterd_brick_rpc_notify (rpc=rpc@entry=0x7f3a9c026f40, mydata=mydata@entry=0x7f3a9c026e90, event=event@entry=RPC_CLNT_DISCONNECT, data=data@entry=0x0) at glusterd-handler.c:4842 #8 0x00007f3ab235e46c in glusterd_big_locked_notify (rpc=0x7f3a9c026f40, mydata=0x7f3a9c026e90, event=RPC_CLNT_DISCONNECT, data=0x0, notify_fn=0x7f3ab235b270 <__glusterd_brick_rpc_notify>) at glusterd-handler.c:71 #9 0x00007f3abd5ccc60 in rpc_clnt_notify (trans=<optimized out>, mydata=0x7f3a9c026f70, event=RPC_TRANSPORT_DISCONNECT, data=0x7f3a9c02a0e0) at rpc-clnt.c:874 #10 0x00007f3abd5c8883 in rpc_transport_notify (this=this@entry=0x7f3a9c02a0e0, event=event@entry=RPC_TRANSPORT_DISCONNECT, data=data@entry=0x7f3a9c02a0e0) at rpc-transport.c:545 #11 0x00007f3ab00b13a2 in socket_event_poll_err (this=0x7f3a9c02a0e0) at socket.c:1151 #12 socket_event_handler (fd=fd@entry=24, idx=idx@entry=15, data=0x7f3a9c02a0e0, poll_in=1, poll_out=0, poll_err=<optimized out>) at socket.c:2356 #13 0x00007f3abd85f8ba in event_dispatch_epoll_handler (event=0x7f3aaded4e80, event_pool=0x7f3abfa8bd10) at event-epoll.c:575 #14 event_dispatch_epoll_worker (data=0x7f3abfb078b0) at event-epoll.c:678 #15 0x00007f3abc666df5 in start_thread (arg=0x7f3aaded5700) at pthread_create.c:308 #16 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 (gdb) t a a bt Thread 7 (Thread 0x7f3aae6d6700 (LWP 25006)): #0 pthread_cond_wait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 #1 0x00007f3ab2401133 in hooks_worker (args=<optimized out>) at glusterd-hooks.c:534 #2 0x00007f3abc666df5 in start_thread (arg=0x7f3aae6d6700) at pthread_create.c:308 #3 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 Thread 6 (Thread 0x7f3ab36a4700 (LWP 24879)): #0 pthread_cond_timedwait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_timedwait.S:238 #1 0x00007f3abd841f08 in syncenv_task (proc=proc@entry=0x7f3abfa9acb0) at syncop.c:607 #2 0x00007f3abd842c40 in syncenv_processor (thdata=0x7f3abfa9acb0) at syncop.c:699 #3 0x00007f3abc666df5 in start_thread (arg=0x7f3ab36a4700) at pthread_create.c:308 #4 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 Thread 5 (Thread 0x7f3ab2ea3700 (LWP 24880)): #0 pthread_cond_timedwait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_timedwait.S:238 #1 0x00007f3abd841f08 in syncenv_task (proc=proc@entry=0x7f3abfa9b070) at syncop.c:607 #2 0x00007f3abd842c40 in syncenv_processor (thdata=0x7f3abfa9b070) at syncop.c:699 #3 0x00007f3abc666df5 in start_thread (arg=0x7f3ab2ea3700) at pthread_create.c:308 #4 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 Thread 4 (Thread 0x7f3ab3ea5700 (LWP 24878)): #0 0x00007f3ab1b94f04 in _fini () from /lib64/liburcu-cds.so.1 #1 0x00007f3abdab3b78 in _dl_fini () at dl-fini.c:258 #2 0x00007f3abbeefe49 in __run_exit_handlers (status=status@entry=0, listp=0x7f3abc2716c8 <__exit_funcs>, run_list_atexit=run_list_atexit@entry=true) at exit.c:77 #3 0x00007f3abbeefe95 in __GI_exit (status=status@entry=0) at exit.c:99 #4 0x00007f3abdcd1733 in cleanup_and_exit (signum=<optimized out>) at glusterfsd.c:1293 #5 0x00007f3abdcd1855 in glusterfs_sigwaiter (arg=<optimized out>) at glusterfsd.c:2014 #6 0x00007f3abc666df5 in start_thread (arg=0x7f3ab3ea5700) at pthread_create.c:308 #7 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 Thread 3 (Thread 0x7f3ab46a6700 (LWP 24877)): #0 0x00007f3abc66d99d in nanosleep () at ../sysdeps/unix/syscall-template.S:81 #1 0x00007f3abd81d944 in gf_timer_proc (ctx=0x7f3abfa6d010) at timer.c:205 #2 0x00007f3abc666df5 in start_thread (arg=0x7f3ab46a6700) at pthread_create.c:308 #3 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 Thread 2 (Thread 0x7f3abdcb4780 (LWP 24876)): #0 0x00007f3abc667f27 in pthread_join (threadid=139890002843392, thread_return=thread_return@entry=0x0) at pthread_join.c:92 #1 0x00007f3abd85fc18 in event_dispatch_epoll (event_pool=0x7f3abfa8bd10) at event-epoll.c:762 #2 0x00007f3abdcce747 in main (argc=5, argv=0x7ffef53b2bd8) at glusterfsd.c:2350 Thread 1 (Thread 0x7f3aaded5700 (LWP 25007)): #0 0x00007f3ab1d9b0ad in rcu_read_lock_bp () from /lib64/liburcu-bp.so.1 #1 0x00007f3ab242f342 in gd_peerinfo_find_from_hostname (hoststr=hoststr@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com") at glusterd-peer-utils.c:639 #2 0x00007f3ab242f81d in glusterd_peerinfo_find_by_hostname (hoststr=hoststr@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com") at glusterd-peer-utils.c:111 #3 0x00007f3ab242fa09 in glusterd_hostname_to_uuid (hostname=hostname@entry=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com", uuid=uuid@entry=0x7f3aaded4ba0 "") at glusterd-peer-utils.c:155 #4 0x00007f3ab2380cdc in glusterd_volume_brickinfo_get (uuid=uuid@entry=0x0, hostname=0x7f3aa000cd70 "rhs-client19.lab.eng.blr.redhat.com", path=0x7f3aa000d170 "/rhs/brick4/afr2x2", volinfo=volinfo@entry=0x7f3abfb11ea0, brickinfo=brickinfo@entry=0x7f3aaded4c88) at glusterd-utils.c:1310 #5 0x00007f3ab2380d68 in glusterd_volume_brickinfo_get_by_brick (brick=brick@entry=0x7f3aa0005b85 "rhs-client19.lab.eng.blr.redhat.com:/rhs/brick4/afr2x2", volinfo=0x7f3abfb11ea0, brickinfo=brickinfo@entry=0x7f3aaded4c88) at glusterd-utils.c:1354 #6 0x00007f3ab235b36f in get_brickinfo_from_brickid (brickinfo=0x7f3aaded4c88, brickid=0x7f3a9c026e90 "dbf7ab58-21a1-4951-b8ae-44e3aaa4c0ea:rhs-client19.lab.eng.blr.redhat.com:/rhs/brick4/afr2x2") at glusterd-handler.c:4816 #7 __glusterd_brick_rpc_notify (rpc=rpc@entry=0x7f3a9c026f40, mydata=mydata@entry=0x7f3a9c026e90, event=event@entry=RPC_CLNT_DISCONNECT, data=data@entry=0x0) at glusterd-handler.c:4842 #8 0x00007f3ab235e46c in glusterd_big_locked_notify (rpc=0x7f3a9c026f40, mydata=0x7f3a9c026e90, event=RPC_CLNT_DISCONNECT, data=0x0, notify_fn=0x7f3ab235b270 <__glusterd_brick_rpc_notify>) at glusterd-handler.c:71 #9 0x00007f3abd5ccc60 in rpc_clnt_notify (trans=<optimized out>, mydata=0x7f3a9c026f70, event=RPC_TRANSPORT_DISCONNECT, data=0x7f3a9c02a0e0) at rpc-clnt.c:874 #10 0x00007f3abd5c8883 in rpc_transport_notify (this=this@entry=0x7f3a9c02a0e0, event=event@entry=RPC_TRANSPORT_DISCONNECT, data=data@entry=0x7f3a9c02a0e0) at rpc-transport.c:545 #11 0x00007f3ab00b13a2 in socket_event_poll_err (this=0x7f3a9c02a0e0) at socket.c:1151 #12 socket_event_handler (fd=fd@entry=24, idx=idx@entry=15, data=0x7f3a9c02a0e0, poll_in=1, poll_out=0, poll_err=<optimized out>) at socket.c:2356 #13 0x00007f3abd85f8ba in event_dispatch_epoll_handler (event=0x7f3aaded4e80, event_pool=0x7f3abfa8bd10) at event-epoll.c:575 #14 event_dispatch_epoll_worker (data=0x7f3abfb078b0) at event-epoll.c:678 #15 0x00007f3abc666df5 in start_thread (arg=0x7f3aaded5700) at pthread_create.c:308 #16 0x00007f3abbfad1ad in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:113 From above bt it similar to https://bugzilla.redhat.com/show_bug.cgi?id=1238067 . crash due to glusterd going down and same time other thread trying to access rcu resource ,this can be confirmed by fini() and rcu_read_lock() function are executing simultaneously (Thread1 and Thread4). *** This bug has been marked as a duplicate of bug 1238067 *** |