Bug 2120624

Summary: don't leave an incomplete primary snapshot if the peer who is handling snapshot creation dies
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Ilya Dryomov <idryomov>
Component: RBD-MirrorAssignee: Prasanna Kumar Kalever <prasanna.kalever>
Status: CLOSED ERRATA QA Contact: Vasishta <vashastr>
Severity: urgent Docs Contact: Akash Raj <akraj>
Priority: unspecified    
Version: 5.0CC: akraj, amagrawa, bniver, ceph-eng-bugs, cephqe-warriors, flucifre, idryomov, jdurgin, jespy, kramdoss, kseeger, mmuench, mmurthy, muagarwa, ocs-bugs, olakra, owasserm, prasanna.kalever, prsurve, sagrawal, sostapov, srangana, tserlin, uchapaga, vashastr
Target Milestone: ---   
Target Release: 6.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-17.2.6-26.el9cp Doc Type: Bug Fix
Doc Text:
.Snapshot mirroring no longer halts permanently Previously, if a primary snapshot creation request was forwarded to rbd-mirror daemon when the rbd-mirror daemon was axed for some practical reason before marking the snapshot as complete, the primary snapshot would be permanently incomplete. This is because, upon retrying that primary snapshot creation request, `librbd` would notice that such a snapshot already existed. It would not check whether this "pre-existing" snapshot was complete or not. Due to this, the mirroring of snapshots was permanently halted. With this fix, as part of the next mirror snapshot creation, including being triggered by a scheduler, checks are made to ensure that any incomplete snapshots are deleted accordingly to resume the mirroring.
Story Points: ---
Clone Of: 2067095
: 2121514 (view as bug list) Environment:
Last Closed: 2023-06-15 09:15:36 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2067095, 2121514, 2192813    

Comment 24 Scott Ostapovicz 2023-02-06 16:55:32 UTC
 Missed the 5.3 z1 window.  Moving to 6.1.  Please advise if this is a problem.

Comment 38 errata-xmlrpc 2023-06-15 09:15:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:3623