Bug 1049278
Summary: | [SNAPSHOT]: glusterd crashed while performing IO and taking snapshot at the same time | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rahul Hinduja <rhinduja> |
Component: | snapshot | Assignee: | Avra Sengupta <asengupt> |
Status: | CLOSED ERRATA | QA Contact: | senaik |
Severity: | urgent | Docs Contact: | |
Priority: | urgent | ||
Version: | rhgs-3.0 | CC: | asengupt, nsathyan, rhs-bugs, rjoseph, sdharane, senaik, ssamanta, storage-qa-internal |
Target Milestone: | --- | ||
Target Release: | RHGS 3.0.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | SNAPSHOT | ||
Fixed In Version: | glusterfs-3.4.1.snap.feb05.2014 | Doc Type: | Bug Fix |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2014-09-22 19:31:33 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1048831 | ||
Bug Blocks: |
Description
Rahul Hinduja
2014-01-07 10:29:36 UTC
Issue is reproducible with build: glusterfs-3.4.1.snap.jan15.2014git-1.el6.x86_64 IO pattern: compile_kernel bt: === #0 gf_store_mkstemp (shandle=0x2d74736f68250000) at store.c:66 #1 0x00007fcaae138e82 in glusterd_store_perform_snap_volume_store (volinfo=0x7fca98001720, snap_volinfo=0x7fcaa84581d0) at glusterd-store.c:1379 #2 0x00007fcaae138f3f in glusterd_store_snap_volume (volinfo=0x7fca98001720, snap=0x7fcaa845afa0) at glusterd-store.c:1430 #3 0x00007fcaae139213 in glusterd_store_perform_snap_store (volinfo=0x7fca98001720) at glusterd-store.c:1534 #4 0x00007fcaae17c680 in glusterd_do_snap (volinfo=0x7fca98001720, snapname=0x7fcaa843fb80 "s45", dict=0x7fcab03c5638, cg=0x0, cg_id=0x0, volcount=1, snap_volid=0x7fca985d7640 "\227\250Kں\355G\215\217V\210\336z\300\245(", cg_name=0x0) at glusterd-snapshot.c:3114 #5 0x00007fcaae17d1ac in glusterd_snapshot_create_commit (dict=<value optimized out>, op_errstr=0x24d7698, rsp_dict=<value optimized out>) at glusterd-snapshot.c:4026 #6 0x00007fcaae17d5c3 in glusterd_snapshot (dict=0x7fcab03c5638, op_errstr=0x24d7698, rsp_dict=0x7fcab03c7eb0) at glusterd-snapshot.c:4404 #7 0x00007fcaae18143e in gd_mgmt_v3_commit_fn (op=GD_OP_SNAP, dict=0x7fcab03c5638, op_errstr=0x24d7698, rsp_dict=0x7fcab03c7eb0) at glusterd-mgmt.c:174 #8 0x00007fcaae181f97 in glusterd_mgmt_v3_commit (conf=0x20a7890, op=GD_OP_SNAP, op_ctx=0x7fcab03c5034, req_dict=0x7fcab03c5638, op_errstr=0x24d7698, npeers=3) at glusterd-mgmt.c:957 #9 0x00007fcaae1845ec in glusterd_mgmt_v3_initiate_snap_phases (req=0x209ae5c, op=GD_OP_SNAP, dict=0x7fcab03c5034) at glusterd-mgmt.c:1578 #10 0x00007fcaae17baab in glusterd_handle_snapshot_fn (req=0x209ae5c) at glusterd-snapshot.c:4656 #11 0x00007fcaae0f148f in glusterd_big_locked_handler (req=0x209ae5c, actor_fn=0x7fcaae17b580 <glusterd_handle_snapshot_fn>) at glusterd-handler.c:78 #12 0x00000033d1c4ce52 in synctask_wrap (old_task=<value optimized out>) at syncop.c:293 #13 0x0000003213043bf0 in ?? () from /lib64/libc.so.6 #14 0x0000000000000000 in ?? () Moving back the bug to assigned state. Faced brick crash while verifying this bug which is tracked by bz 1048831. Marking this bug as dependant of bz 1048831 bt : ==== (gdb) bt #0 0x0000003e33032925 in raise () from /lib64/libc.so.6 #1 0x0000003e33034105 in abort () from /lib64/libc.so.6 #2 0x0000003e33070837 in __libc_message () from /lib64/libc.so.6 #3 0x0000003e33076166 in malloc_printerr () from /lib64/libc.so.6 #4 0x0000003e33078c93 in _int_free () from /lib64/libc.so.6 #5 0x00007fd2d06f603a in ltable_delete_locks (ltable=0x7fd2b0000ee0) at posix.c:2559 #6 0x00007fd2d06f6466 in disconnect_cbk (this=<value optimized out>, client=<value optimized out>) at posix.c:2619 #7 0x0000003555a63d9d in gf_client_disconnect (client=0x1cb7b50) at client_t.c:374 #8 0x00007fd2cbbbf608 in server_connection_cleanup (this=0x1c72570, client=0x1cb7b50, flags=<value optimized out>) at server-helpers.c:244 #9 0x00007fd2cbbbae0c in server_rpc_notify (rpc=<value optimized out>, xl=0x1c72570, event=<value optimized out>, data=0x1cb6d50) at server.c:558 #10 0x0000003555e07cc5 in rpcsvc_handle_disconnect (svc=0x1c74490, trans=0x1cb6d50) at rpcsvc.c:682 #11 0x0000003555e09800 in rpcsvc_notify (trans=0x1cb6d50, mydata=<value optimized out>, event=<value optimized out>, data=0x1cb6d50) at rpcsvc.c:720 #12 0x0000003555e0af18 in rpc_transport_notify (this=<value optimized out>, event=<value optimized out>, data=<value optimized out>) at rpc-transport.c:512 #13 0x00007fd2d1d72761 in socket_event_poll_err (fd=<value optimized out>, idx=<value optimized out>, data=0x1cb6d50, poll_in=<value optimized out>, poll_out=0, poll_err=24) at socket.c:1071 #14 socket_event_handler (fd=<value optimized out>, idx=<value optimized out>, data=0x1cb6d50, poll_in=<value optimized out>, poll_out=0, poll_err=24) at socket.c:2239 #15 0x0000003555a66107 in event_dispatch_epoll_handler (event_pool=0x1c44ee0) at event-epoll.c:384 #16 event_dispatch_epoll (event_pool=0x1c44ee0) at event-epoll.c:445 #17 0x000000000040680a in main (argc=19, argv=0x7ffff54e6288) at glusterfsd.c:1964 Fixed with http://review.gluster.org/#/c/6903/ Marking snapshot BZs to RHS 3.0. Setting flags required to add BZs to RHS 3.0 Errata Verified with build: glusterfs-3.6.0.12-1.el6rhs.x86_64 No crash observed while taking snapshot of a volume when arequal was in progress from fuse and nfs client. Moving the bug to verified state. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHEA-2014-1278.html |