Bug 987929

Summary: Dist-geo-rep : geo-rep xsync misses some files while doing rebalance and geo-rep syncing files parallely
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Vijaykumar Koppad <vkoppad>
Component: geo-replicationAssignee: Bug Updates Notification Mailing List <rhs-bugs>
Status: CLOSED EOL QA Contact: shilpa <smanjara>
Severity: high Docs Contact:
Priority: high    
Version: 2.1CC: avishwan, chrisw, csaba, david.macdonald, nsathyan, rhs-bugs, rwheeler, sdharane, vagarwal, vbellur
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2015-11-25 08:49:05 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 957769    

Description Vijaykumar Koppad 2013-07-24 12:29:20 UTC
Description of problem: doing rebalance while  geo-rep syncing files, geo-rep xsync fails to get some of the  files. Consequently those file won't be synced to slave.

Observations: 
1. all missing files had entries in the one  brick changelog
2. That brick changelog was present in either, .processing or .processed directories of the geo-rep working_dir of the brick 
3. Those had no entries in the XSYNC-CHANGELOG too, which means, after stop and start of the geo-rep , first xsync crawl failed get the those entries. 

Version-Release number of selected component (if applicable):3.4.0.12rhs.beta6-1.el6rhs.x86_64


How reproducible: Didn't try reproducing it


Steps to Reproduce:
1.Create and start a geo-rep relationship between master(DIST_REP)  and slave.
2.Add bricks to the master volume.
3.start creating file on the master volume  and  parallely  do geo-rep stop && geo-rep start && rebalance start 

Actual results: Fails to sync few files


Expected results:Should sync all the files


Additional info:

Comment 2 Vijaykumar Koppad 2013-07-24 12:54:08 UTC
One more observation to add, 

- .processed directory in geo-rep working directory of the brick, the brick where all the missing file were from, has entries for the changelogs , 
 CHANGELOG.1374662896 and CHANGELOG.1374662936, the missing one was CHANGELOG.1374662916, and if you check from the backend changelog dir, all those files had entries in changelog  CHANGELOG.1374662916,  and the changelog of xsync crawl which was processed in that time, was XSYNC-CHANGELOG.1374662916, 


Hope this might help.

Comment 4 Amar Tumballi 2013-08-07 05:31:45 UTC
As per the discussions in mailing thread:

>
> Can we consider taking 'blocker' flag from this and mark bug as 'medium'
> priority? Its an issue when rebalancing, you are not having the
> geo-replication stopped and started. For now, if geo-replication is
> continuously running, rebalance is handled properly.
>

Sayan: Agree that this is not a blocker.
Amar : taking this out of 'blocker' list now.

Comment 11 Aravinda VK 2015-11-25 08:49:05 UTC
Closing this bug since RHGS 2.1 release reached EOL. Required bugs are cloned to RHGS 3.1. Please re-open this issue if found again.

Comment 12 Aravinda VK 2015-11-25 08:50:56 UTC
Closing this bug since RHGS 2.1 release reached EOL. Required bugs are cloned to RHGS 3.1. Please re-open this issue if found again.