809675 – [FEAT] Asymptotic synchronization is not reached in an unreliable enviroment

Bug 809675 - [FEAT] Asymptotic synchronization is not reached in an unreliable enviroment

Summary: [FEAT] Asymptotic synchronization is not reached in an unreliable enviroment

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	GlusterFS
Classification:	Community
Component:	geo-replication
Sub Component:
Version:	pre-release
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	medium
Target Milestone:	---
Assignee:	Csaba Henk
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2012-04-04 02:40 UTC by Csaba Henk
Modified:	2013-07-24 17:59 UTC (History)
CC List:	4 users (show)
Fixed In Version:	glusterfs-3.4.0
Clone Of:
Environment:
Last Closed:	2013-07-24 17:59:47 UTC
Regression:	---
Mount Type:	---
Documentation:	---
CRM:
Verified Versions:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Csaba Henk 2012-04-04 02:40:48 UTC

Description of problem:

By "asymptotic synchronization" we mean that any particular change
on master side gets synchronized to slave at some time.

The geo-rep model theoretically delivers asymptotic synchronization,
but it's not robust: if the gsyncd worker is interrupted more
frequently than the time needed for a complete crawl (eg. due to
network failures, panicky slave, or aux glusterfs leaking up to
triggering the OOM killer), then some files will never get to
the slave end, due to the deterministic order of walking through
the file tree.

Solution is to randomize the walk.

Version-Release number of selected component (if applicable):


How reproducible:

Well reproducible, but deeming if the issue appears is not
easy to automate.

Steps to Reproduce:
1. create a file tree in some volume that's bigger than being possible to sync in a minute
2. start geo-rep with the above volume as master and empty slave
3. in each minute, stop and re-start geo-rep
  
Actual results:

Some files of master never appear on slave side.

Expected results:

Eventually all files of master should appear on slave side.

Additional info:

Comment 1 Csaba Henk 2012-04-05 01:51:42 UTC

To add, the above description, while gives a good insight to the issue at hand, is a bit of oversimplification. Assuming that the worker is always interrupted early:

- If we have a static file tree to sync over (as in the repro instructions),
it will be synced over even with a deterministic traversal.

- If there are ongoing changes in the tree, there might be deep locations in the file tree where the synchronization activity never reaches (or with very low probability), even with randomized traversal.

Regardless, randomized traversal will provide a more even distribution of synchronization, tending to a broader coverage, something like this:

Sync coverage with deterministic traversal:

     /\
    /**\
   /**  \
  /**    \
 /*       \
/*         \

Sync coverage with random traversal:

     /\
    /**\
   /****\
  /** ** \
 /   *  * \
/          \

So in fact, the whole thing can be investigated only
heuristically, for which purpose the actual scenario
where this came to picture is quite suitable -- ie.
in a case where aux glusterfs always leaks till OOM,
untar a bunch of kernel trees under geo-rep.

Comment 2 Anand Avati 2012-04-05 12:13:06 UTC

CHANGE: http://review.gluster.com/3079 (geo-rep / gsyncd: shuffle directory entries in crawl) merged in master by Vijay Bellur (vijay)

Note You need to log in before you can comment on or make changes to this bug.