Bug 762706 (GLUSTER-974)

Summary: Spurious self-heal?
Product: [Community] GlusterFS Reporter: Anush Shetty <anush>
Component: replicateAssignee: Pranith Kumar K <pkarampu>
Status: CLOSED CURRENTRELEASE QA Contact:
Severity: high Docs Contact:
Priority: low    
Version: 3.0.4CC: gluster-bugs, vijay
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Anush Shetty 2010-06-02 07:26:49 UTC
Found this while doing ls over the mount point containing 250000 files.

2010-06-02 12:47:58] T [fuse-bridge.c:602:fuse_lookup] glusterfs-fuse: 1603: LOOKUP /tmpfile.000625(4003013)
[2010-06-02 12:47:58] T [fuse-bridge.c:470:fuse_entry_cbk] glusterfs-fuse: 1603: LOOKUP() /tmpfile.000625 => 4003013 (4003013)
[2010-06-02 12:47:58] T [fuse-bridge.c:602:fuse_lookup] glusterfs-fuse: 1604: LOOKUP /tmpfile.000282(4000241)
[2010-06-02 12:47:58] T [fuse-bridge.c:470:fuse_entry_cbk] glusterfs-fuse: 1604: LOOKUP() /tmpfile.000282 => 4000241 (4000241)
[2010-06-02 12:47:58] T [fuse-bridge.c:602:fuse_lookup] glusterfs-fuse: 1605: LOOKUP /tmpfile.000427(4001409)
[2010-06-02 12:47:58] T [fuse-bridge.c:470:fuse_entry_cbk] glusterfs-fuse: 1605: LOOKUP() /tmpfile.000427 => 4001409 (4001409)
[2010-06-02 12:47:58] T [fuse-bridge.c:2252:fuse_readdir] glusterfs-fuse: 1606: READDIR (0x7f8f70007f90, size=4096, offset=4294967295)
[2010-06-02 12:47:58] T [fuse-bridge.c:2198:fuse_readdir_cbk] glusterfs-fuse: 1606: READDIR => 0/4096,4294967295
[2010-06-02 12:47:58] T [fuse-bridge.c:2270:fuse_releasedir] glusterfs-fuse: 1607: RELEASEDIR 0x7f8f70007f90
[2010-06-02 12:48:07] T [afr-self-heal-common.c:1557:afr_self_heal] rep1: performing self heal on / (metadata=0 data=1 entry=0)
[2010-06-02 12:48:07] T [afr-self-heal-common.c:1593:afr_self_heal] rep1: proceeding to metadata check on /
[2010-06-02 12:48:07] T [afr-self-heal-common.c:772:afr_sh_missing_entries_done] rep1: proceeding to metadata check on /
[2010-06-02 12:48:07] T [afr-self-heal-metadata.c:95:afr_sh_metadata_done] rep1: proceeding to entry check on /
[2010-06-02 12:48:07] T [afr-self-heal-entry.c:2551:afr_self_heal_entry] rep1: proceeding to completion on /
[2010-06-02 12:48:07] T [afr-self-heal-entry.c:80:afr_sh_entry_done] rep1: self heal of / completed
[2010-06-02 12:48:07] T [afr-self-heal-common.c:1503:afr_self_heal_completion_cbk] rep1: background self-heal completed

Comment 1 Anand Avati 2010-07-29 03:58:12 UTC
PATCH: http://patches.gluster.com/patch/3952 in master (fixes for spurious entry self-heal in afr)

Comment 2 Anand Avati 2010-08-13 15:25:36 UTC
PATCH: http://patches.gluster.com/patch/4117 in release-3.0 (fixes for spurious entry self-heal in afr)

Comment 3 Pranith Kumar K 2010-08-14 02:59:15 UTC
the present commits should stop the trigger of entry self-heal. We need to enhance the rmdir functionality to complete the fix, which will be fixed as part of 1360

Comment 4 Pranith Kumar K 2010-08-14 03:03:08 UTC
the bug is triggered because checksum of entries in the root directory should not consider .landfill directory. tested it with the same case suggested in the description of the bug.