Bug 1028663 - AFR: self-heal daemon crawler not obeying the 'cluster.heal-timeout' time interval
AFR: self-heal daemon crawler not obeying the 'cluster.heal-timeout' time int...
Product: GlusterFS
Classification: Community
Component: core (Show other bugs)
Unspecified Unspecified
unspecified Severity high
: ---
: ---
Assigned To: Ravishankar N
Depends On: 1027559
  Show dependency treegraph
Reported: 2013-11-09 03:15 EST by Ravishankar N
Modified: 2014-07-11 15:17 EDT (History)
2 users (show)

See Also:
Fixed In Version: glusterfs-3.5.1
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: 1027559
Last Closed: 2014-07-11 15:17:40 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

  None (edit)
Description Ravishankar N 2013-11-09 03:15:50 EST
+++ This bug was initially created as a clone of Bug #1027559 +++

Description of problem:
The afr_start_crawl() function is called before expiry of "cluster.heal-timeout" seconds (which is 600 seconds by default)

Version-Release number of selected component (if applicable):
RHS 2.1

How reproducible:

Steps to Reproduce:
1.Create and start a 1x2 replica volume using 2 different nodes.
2.gluster v set <VOLNAME> diagnostics.client-log-level DEBUG
3.gluster v set <VOLNAME> cluster.heal-timeout 300
4.tailf /var/log/glusterfs/glustershd.log (on either of the nodes)

[2013-11-07 10:32:06.058154] D [afr-self-heald.c:1233:afr_start_crawl] 0-testvol-replicate-0: starting crawl 1 for testvol-client-0
[2013-11-07 10:32:07.059428] D [afr-self-heald.c:1233:afr_start_crawl] 0-testvol-replicate-0: starting crawl 1 for testvol-client-0

Actual results:

The time interval between 2 successive invocations of afr_start_crawl() is just one second.

Expected results:
The crawler must start only once in "cluster.heal-timeout" seconds.

Additional info:

--- Additional comment from RHEL Product and Program Management on 2013-11-07 00:24:39 EST ---

Since this issue was entered in bugzilla, the release flag has been
set to ? to ensure that it is properly evaluated for this release.
Comment 1 Anand Avati 2013-11-09 03:27:48 EST
REVIEW: http://review.gluster.org/6243 (libglusterfs: fix bug in timespec adjustment) posted (#2) for review on master by Ravishankar N (ravishankar@redhat.com)
Comment 2 Anand Avati 2013-11-10 22:42:05 EST
COMMIT: http://review.gluster.org/6243 committed in master by Anand Avati (avati@redhat.com) 
commit d5335f9e40f6e9533f7812d153b9727bcc04aa4e
Author: Ravishankar N <ravishankar@redhat.com>
Date:   Sat Nov 9 11:56:34 2013 +0000

    libglusterfs: fix bug in timespec adjustment
    The argument to the timespec_adjust_delta() function introudced in
    commit 6836118b21 needs to be passed by reference rather than by value
    for the function to do it's job.
    BUG: 1028663
    Change-Id: I62a3636906e67ed35b7786e9553f6819b48f3626
    Signed-off-by: Ravishankar N <ravishankar@redhat.com>
    Reviewed-on: http://review.gluster.org/6243
    Tested-by: Gluster Build System <jenkins@build.gluster.com>
    Reviewed-by: Anand Avati <avati@redhat.com>

Note You need to log in before you can comment on or make changes to this bug.