Bug 763719 (GLUSTER-1987) - Bring in Graceful shutdown of glusterfs when terminated by sending a Signal.
Summary: Bring in Graceful shutdown of glusterfs when terminated by sending a Signal.
Keywords:
Status: CLOSED EOL
Alias: GLUSTER-1987
Product: GlusterFS
Classification: Community
Component: core
Version: mainline
Hardware: All
OS: Linux
low
low
Target Milestone: ---
Assignee: Raghavendra Bhat
QA Contact:
URL:
Whiteboard:
: GLUSTER-1592 763490 763569 763588 763589 763592 763594 763611 763764 763801 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2010-10-20 10:41 UTC by Raghavendra G
Modified: 2015-10-22 15:46 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2015-10-22 15:46:38 UTC
Regression: RTNR
Mount Type: fuse
Documentation: DNR
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)

Description Raghavendra G 2010-10-20 07:42:46 UTC
it should've been "same structures being accessed by other threads" instead of "same structures are accessed by other threads"

Comment 1 Raghavendra G 2010-10-20 07:43:41 UTC
*** Bug 1856 has been marked as a duplicate of this bug. ***

Comment 2 Raghavendra G 2010-10-20 07:44:08 UTC
*** Bug 1857 has been marked as a duplicate of this bug. ***

Comment 3 Raghavendra G 2010-10-20 07:44:20 UTC
*** Bug 1862 has been marked as a duplicate of this bug. ***

Comment 4 Raghavendra G 2010-10-20 10:41:30 UTC
When glusterfs is terminated by an external command/program using a signal, sometimes glusterfs/d crashes, because of race-conditions between signal handling thread and other threads. Crashes are due to various reasons like private being set to NULL, transport being freed as part of cleanup process and same structures are accessed by other threads.

Comment 5 Anush Shetty 2010-10-22 09:04:39 UTC
Found a crash on rdma too

(gdb) p *this
$1 = {ops = 0xbcc91f0, listener = 0x0, private = 0x0, xl_private = 0x0, xl = 0x2b5146f2a380, mydata = 0xbcc9028, lock = {__data = {__lock = 0, __count = 0,
      __owner = 0, __nusers = 0, __kind = -1, __spins = 0, __list = {__prev = 0x0, __next = 0x0}},
    __size = '\0' <repeats 16 times>, "����", '\0' <repeats 19 times>, __align = 0}, refcount = 2, ctx = 0xbcc6010, options = 0xbcc8c08,
  name = 0xbcc94f8 "glusterf\020\003", dnscache = 0xbcc9cb8, buf = 0x0, init = 0x2aaaaad6eeaf <init>, fini = 0x2aaaaad6edbf <fini>, validate_options = 0,
  reconfigure = 0x2aaaaad6e6b9 <reconfigure>, notify = 0x2b5146f3c4a3 <rpc_clnt_notify>, notify_data = 0x0, peerinfo = {sockaddr = {ss_family = 2,
      __ss_align = 0, __ss_padding = '\0' <repeats 111 times>}, sockaddr_len = 16, identifier = "127.0.0.1:24007", '\0' <repeats 92 times>}, myinfo = {
    sockaddr = {ss_family = 2, __ss_align = 0, __ss_padding = '\0' <repeats 111 times>}, sockaddr_len = 16,
    identifier = "127.0.0.1:1020", '\0' <repeats 93 times>}, total_bytes_read = 2212, total_bytes_write = 828, list = {next = 0x0, prev = 0x0}}
(gdb) p this->ops
$2 = (struct rpc_transport_ops *) 0xbcc91f0
(gdb) p *this->ops
$3 = {submit_request = 0, submit_reply = 0x311, connect = 0x3ece752e68 <main_arena+1160>, listen = 0x3ece752e68 <main_arena+1160>, disconnect = 0xbcc91f0,
  get_peername = 0xbcc91f0, get_peeraddr = 0, get_myname = 0, get_myaddr = 0}
(gdb) p *this->ref
There is no member named ref.
(gdb) p *this->refcount
Cannot access memory at address 0x2
(gdb)

Comment 6 Raghavendra G 2010-10-25 02:30:45 UTC
*** Bug 1758 has been marked as a duplicate of this bug. ***

Comment 7 Raghavendra G 2010-11-09 02:48:19 UTC
*** Bug 2032 has been marked as a duplicate of this bug. ***

Comment 8 Amar Tumballi 2010-11-09 02:52:23 UTC
*** Bug 1860 has been marked as a duplicate of this bug. ***

Comment 9 Amar Tumballi 2010-11-09 02:52:33 UTC
*** Bug 1879 has been marked as a duplicate of this bug. ***

Comment 10 Anand Avati 2010-11-09 09:04:05 UTC
*** Bug 2069 has been marked as a duplicate of this bug. ***

Comment 11 Amar Tumballi 2011-01-20 07:08:27 UTC
*** Bug 1837 has been marked as a duplicate of this bug. ***

Comment 12 Pranith Kumar K 2011-01-21 05:36:43 UTC
*** Bug 1592 has been marked as a duplicate of this bug. ***

Comment 13 Amar Tumballi 2011-04-25 09:33:56 UTC
Please update the status of this bug as its been more than 6months since its filed (bug id < 2000)

Please resolve it with proper resolution if its not valid anymore. If its still valid and not critical, move it to 'enhancement' severity.

Comment 14 Amar Tumballi 2011-09-27 05:50:03 UTC
Planing to keep 3.4.x branch as "internal enhancements" release without any features. So moving these bugs to 3.4.0 target milestone.

Comment 15 Amar Tumballi 2011-09-28 04:27:08 UTC
More of internal development, for future releases.

Comment 16 Kaleb KEITHLEY 2015-10-22 15:46:38 UTC
because of the large number of bugs filed against mainline version\ is ambiguous and about to be removed as a choice.

If you believe this is still a bug, please change the status back to NEW and choose the appropriate, applicable version for it.


Note You need to log in before you can comment on or make changes to this bug.