Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 1365034

Summary: [rbd-mirror] - failed to commit journal event
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Hemanth Kumar <hyelloji>
Component: RBDAssignee: Jason Dillaman <jdillama>
Status: CLOSED ERRATA QA Contact: Parikshith <pbyregow>
Severity: medium Docs Contact:
Priority: medium    
Version: 2.0CC: bniver, ceph-eng-bugs, flucifre, hnallurv, jdillama, kdreyer, nlevine, racpatel
Target Milestone: rc   
Target Release: 3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: RHEL: ceph-12.1.2-1.el7cp Ubuntu: ceph_12.1.2-2redhat1xenial Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-12-05 23:31:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 2 Jason Dillaman 2016-08-08 14:18:31 UTC
The errors on the "remote" side are actually just errors recorded within the image journals. You had some event on the primary side (unrelated to mirroring) where you witnessed the following errors:

2016-08-05 08:02:00.903776 7fb903fff700 -1 librbd::SnapshotProtectRequest: image must support layering
2016-08-05 08:02:00.903786 7fb903fff700 -1 librbd::SnapshotProtectRequest: encountered error: (38) Function not implemented
2016-08-05 08:02:01.004965 7fcecafcdd80 -1 librbd: parent image must support layering
2016-08-05 08:02:01.053367 7fe6a7e46d80 -1 librbd: rbd image dataset2 already exists
2016-08-05 08:04:28.369797 7ff3fa7fc700 -1 librbd::SnapshotProtectRequest: image must support layering
2016-08-05 08:04:28.369808 7ff3fa7fc700 -1 librbd::SnapshotProtectRequest: encountered error: (38) Function not implemented
2016-08-05 08:04:28.454584 7f1207490d80 -1 librbd: parent image must support layering
2016-08-05 08:04:28.502443 7fc272e36d80 -1 librbd: rbd image dataset3 already exists
2016-08-05 08:06:59.738275 7f0289ffb700 -1 librbd::SnapshotProtectRequest: image must support layering
2016-08-05 08:06:59.738287 7f0289ffb700 -1 librbd::SnapshotProtectRequest: encountered error: (38) Function not implemented
2016-08-05 08:06:59.820645 7fa1e352bd80 -1 librbd: parent image must support layering
2016-08-05 08:06:59.869198 7f9799d11d80 -1 librbd: rbd image dataset4 already exists
2016-08-05 08:09:20.475211 7f28e1ffb700 -1 librbd::SnapshotProtectRequest: image must support layering
2016-08-05 08:09:20.475222 7f28e1ffb700 -1 librbd::SnapshotProtectRequest: encountered error: (38) Function not implemented

Therefore, you have images where you attempted to protect snapshot that didn't support layering. This is unrelated to IPv4 / IPv6 and instead is just a split-brain caused by an unhandled error code.

Comment 3 Neil Levine 2016-08-08 18:38:20 UTC
Not clear on what the customer impact here is?

Comment 4 Jason Dillaman 2016-08-08 19:47:21 UTC
An illegal operation (attempting to protect a snapshot against an image that doesn't support that feature), resulted in a split-brain for the affected image. The customer would need to request a full resync of the affected image to get mirroring started again for the image.

In general, if a maintenance operation fails (except in a few white listed cases), we need to treat the result as a potential delta between the primary and non-primary image. This is an example of a case that should be white listed.

Comment 5 Neil Levine 2016-08-08 20:28:41 UTC
Is this a release note for 2.0 and a re-assign to 2.1?

Comment 8 Jason Dillaman 2017-01-04 21:17:49 UTC
This fix is included in Ceph v10.2.6

Comment 10 Jason Dillaman 2017-02-12 17:42:38 UTC
*** Bug 1421311 has been marked as a duplicate of this bug. ***

Comment 17 errata-xmlrpc 2017-12-05 23:31:14 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3387