Bug 1247536 - Dist-geo-rep : checkpoint doesn't reach even though all the files have been synced through hybrid crawl.
Summary: Dist-geo-rep : checkpoint doesn't reach even though all the files have been s...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: GlusterFS
Classification: Community
Component: geo-replication
Version: mainline
Hardware: x86_64
OS: Linux
medium
high
Target Milestone: ---
Assignee: Aravinda VK
QA Contact:
URL:
Whiteboard: checkpoint
Depends On: 1044645 1064309 1285196
Blocks: 1202842 1223636 1279306
TreeView+ depends on / blocked
 
Reported: 2015-07-28 09:20 UTC by Aravinda VK
Modified: 2016-06-16 13:27 UTC (History)
9 users (show)

Fixed In Version: glusterfs-3.8rc2
Doc Type: Bug Fix
Doc Text:
Clone Of: 1044645
: 1279306 (view as bug list)
Environment:
Last Closed: 2016-06-16 13:27:16 UTC
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Embargoed:


Attachments (Terms of Use)

Description Aravinda VK 2015-07-28 09:20:48 UTC
+++ This bug was initially created as a clone of Bug #1044645 +++

Description of problem: geo-rep status checkpoint doesn't reach even though all the files have been synced through hybrid crawl.


Version-Release number of selected component (if applicable):glusterfs-3.4.0.51geo-1

How reproducible: didn't try to reproduce, but seems like consistently reproducible. 


Steps to Reproduce:
1.create and start a geo-rep relationship between master and slave. 
2.stop geo-rep 
3.create some data on master.
4.set the checkpoint.
5.start geo-rep 
6. wait for the geo-rep to sync data. 
7. check geo-rep status whether checkpoint has reached or not. 

Actual results: checkpoint doesn't reach at all. 


Expected results: checkpoint should reach when all the files are synced. 


--- Additional comment from Aravinda VK on 2013-12-20 03:05:42 EST ---

During start of hybrid crawl, crawler stores masters xtime in memory. After completion of crawl and sync, it will update the same xtime for slave. 

If files created after crawler started, then checkpoint time will be more than the last saved xtime in memory, so even after completion it shows checkpoint is not reached.

This is expected behavior, if we update the latest xtime instead of xtime stored in memory, their are chances of data loss.

--- Additional comment from Venky Shankar on 2013-12-20 03:41:07 EST ---

Vijaykumar,

was I/O done on the mount after checkpoint was set? If yes, then isn't this the expected behaviour?

--- Additional comment from Rahul Hinduja on 2015-07-07 06:58:30 EDT ---

Verified with build: glusterfs-3.7.1-7.el6rhs.x86_64

Tried both the below scenarios:

a. Have the files before creation of geo-rep session so as to use HYBRID CRAWL
b. Change the change_detector to xsync to use HYBRID CRAWL

In both the above cases, the last sync is not update. In the first case, Last sync is N/A and in the second case, last sync shows when the last changelog was synced.

Eventually in Hybrid Crawl, the checkpoint completed Remains always as NO even when the files are synced to slave. Moving this bug to Assigned state.

--- Additional comment from Rahul Hinduja on 2015-07-07 07:00:25 EDT ---

For Scenario A:
===============

[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave status detail
 
MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE                  SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED    ENTRY    DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A            0        3567    7797    0           2015-07-07 15:39:13    No                      N/A                          
georep1        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A            0        3611    7845    0           2015-07-07 15:39:13    No                      N/A                          
georep3        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A            0        3441    7611    0           2015-07-07 15:39:13    No                      N/A                          
georep3        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    N/A            0        3550    7726    0           2015-07-07 15:39:13    No                      N/A                          
georep2        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
[root@georep1 scripts]# 



[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave status detail
 
MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE                  SLAVE NODE      STATUS     CRAWL STATUS       LAST_SYNCED    ENTRY    DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME   
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    N/A            0        7798    0       0           2015-07-07 15:39:13    No                      N/A                          
georep1        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    N/A            0        7847    0       0           2015-07-07 15:39:13    No                      N/A                          
georep3        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl       N/A            0        3441    0       0           2015-07-07 15:39:13    No                      N/A                          
georep3        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl       N/A            0        3550    0       0           2015-07-07 15:39:13    No                      N/A                          
georep2        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A            N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
[root@georep1 scripts]# 



[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave status detail
 
MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE                  SLAVE NODE      STATUS     CRAWL STATUS       LAST_SYNCED            ENTRY    DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME   
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07 15:39:27    0        0       0       0           2015-07-07 15:39:13    Yes                     2015-07-07 15:53:41          
georep1        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07 15:39:27    0        0       0       0           2015-07-07 15:39:13    Yes                     2015-07-07 15:52:54          
georep2        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep3        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07 15:39:33    0        0       0       0           2015-07-07 15:39:13    Yes                     2015-07-07 15:53:12          
georep3        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Changelog Crawl    2015-07-07 15:39:33    0        0       0       0           2015-07-07 15:39:13    Yes                     2015-07-07 15:53:14          
georep4        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A                N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
[root@georep1 scripts]# 

[rahinuj@rahul Desktop]$

--- Additional comment from Rahul Hinduja on 2015-07-07 07:02:30 EDT ---

For Scenario B:
===============

[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave config change_detector
changelog
[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave config change_detector xsync
geo-replication config updated successfully
[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave config change_detector
xsync
[root@georep1 scripts]#


[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave status detail
 
MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE                  SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED            ENTRY    DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:27    0        154     0       0           2015-07-07 16:16:07    No                      N/A                          
georep1        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:27    0        160     0       0           2015-07-07 16:16:07    No                      N/A                          
georep3        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:33    0        156     0       0           2015-07-07 16:16:07    No                      N/A                          
georep3        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:33    0        179     0       0           2015-07-07 16:16:07    No                      N/A                          
georep4        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
[root@georep1 scripts]#

[root@georep1 scripts]# gluster volume geo-replication master 10.70.46.101::slave status detail
 
MASTER NODE    MASTER VOL    MASTER BRICK      SLAVE USER    SLAVE                  SLAVE NODE      STATUS     CRAWL STATUS    LAST_SYNCED            ENTRY    DATA    META    FAILURES    CHECKPOINT TIME        CHECKPOINT COMPLETED    CHECKPOINT COMPLETION TIME   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
georep1        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:27    0        0       0       0           2015-07-07 16:16:07    No                      N/A                          
georep1        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:27    0        0       0       0           2015-07-07 16:16:07    No                      N/A                          
georep2        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep2        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep3        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:33    0        0       0       0           2015-07-07 16:16:07    No                      N/A                          
georep3        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.101    Active     Hybrid Crawl    2015-07-07 15:39:33    0        0       0       0           2015-07-07 16:16:07    No                      N/A                          
georep4        master        /rhs/brick1/b1    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
georep4        master        /rhs/brick2/b2    root          10.70.46.101::slave    10.70.46.103    Passive    N/A             N/A                    N/A      N/A     N/A     N/A         N/A                    N/A                     N/A                          
[root@georep1 scripts]#

Comment 1 Anand Avati 2015-07-28 09:26:01 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#1) for review on master by Aravinda VK (avishwan)

Comment 2 Anand Avati 2015-08-05 04:52:30 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#2) for review on master by Aravinda VK (avishwan)

Comment 3 Anand Avati 2015-08-12 09:57:41 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#3) for review on master by Aravinda VK (avishwan)

Comment 4 Anand Avati 2015-08-19 05:59:19 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#4) for review on master by Aravinda VK (avishwan)

Comment 5 Anand Avati 2015-08-26 05:56:59 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#5) for review on master by Aravinda VK (avishwan)

Comment 6 Vijay Bellur 2015-09-08 17:17:45 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#6) for review on master by Aravinda VK (avishwan)

Comment 7 Vijay Bellur 2015-11-03 20:23:31 UTC
REVIEW: http://review.gluster.org/11771 (geo-rep: Update last_synced_time in XSync) posted (#7) for review on master by Jeff Darcy (jdarcy)

Comment 8 Niels de Vos 2016-06-16 13:27:16 UTC
This bug is getting closed because a release has been made available that should address the reported issue. In case the problem is still not fixed with glusterfs-3.8.0, please open a new bug report.

glusterfs-3.8.0 has been announced on the Gluster mailinglists [1], packages for several distributions should become available in the near future. Keep an eye on the Gluster Users mailinglist [2] and the update infrastructure for your distribution.

[1] http://blog.gluster.org/2016/06/glusterfs-3-8-released/
[2] http://thread.gmane.org/gmane.comp.file-systems.gluster.user


Note You need to log in before you can comment on or make changes to this bug.