Bug 1348940

Summary: Restart of RBD daemon is again initiating full Sync/Copy of an Image
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Tanay Ganguly <tganguly>
Component: RBDAssignee: Jason Dillaman <jdillama>
Status: CLOSED ERRATA QA Contact: Rachana Patel <racpatel>
Severity: high Docs Contact: Bara Ancincova <bancinco>
Priority: high    
Version: 2.0CC: ceph-eng-bugs, flucifre, hnallurv, jdillama, kdreyer, kurs, uboppana
Target Milestone: rc   
Target Release: 2.1   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: RHEL: ceph-10.2.3-2.el7cp Ubuntu: ceph_10.2.3-3redhat1xenial Doc Type: Bug Fix
Doc Text:
.Image synchronization no longer starts from the beginning after restarting `rbd-mirror` When the `rbd-mirror` daemon was restarted during image synchronization, the synchronization started from the beginning. With this update, the sync point object number is updated periodically during the synchronization. As a result, the image synchronization no longer starts from the beginning after restarting `rbd-mirror`.
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-11-22 19:27:00 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1322504, 1383917    

Description Tanay Ganguly 2016-06-22 11:07:42 UTC
Description of problem:
w.r.t BUG https://bugzilla.redhat.com/show_bug.cgi?id=1348928, after the restart of RBD mirror daemon the sync starts from beginning.

Version-Release number of selected component (if applicable):
10.2.2-5.el7cp.x86_64

Steps to Reproduce:
1. If there is a rbd daemon restart in between the syncing of the Image, then it again starts from the beginning.

An Image of size 100GB was synced 70% and it took around 4.30 hours,then i hit the BUG(1348928)
It again started the re-sync process from the beginning, i.e. 0%

bigimage1:
  global_id:   1ebacfa8-fa2b-4c0a-8d38-56cbbee90507
  state:       up+syncing
  description: bootstrapping, IMAGE_COPY/COPY_OBJECT 21%
  last_update: 2016-06-22 10:51:30

Actual results:
Starting from beginning.

Expected results:
Copying/Re-sync should start where it left off.

Additional info:

Comment 1 Ken Dreyer (Red Hat) 2016-06-22 16:39:31 UTC
Brett/Jason to make recommendations for the customer here...

Comment 4 Jason Dillaman 2016-08-10 19:13:40 UTC
Upstream pull request: https://github.com/ceph/ceph/pull/9699

Comment 11 Rachana Patel 2016-10-21 01:12:21 UTC
Verified with 10.2.3-8.el7cp.x86_64
Working as expected hence moving to Verified

Comment 15 errata-xmlrpc 2016-11-22 19:27:00 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHSA-2016-2815.html