Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 1501374

Summary: Possible infinite loop in journal:get_tag_list class method will result in OSD suicide timeout
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Jason Dillaman <jdillama>
Component: RBD-MirrorAssignee: Jason Dillaman <jdillama>
Status: CLOSED ERRATA QA Contact: Parikshith <pbyregow>
Severity: medium Docs Contact:
Priority: medium    
Version: 3.0CC: ceph-eng-bugs, ceph-qe-bugs, hnallurv, kdreyer, mhackett, vakulkar, vashastr
Target Milestone: rc   
Target Release: 3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: RHEL: ceph-12.2.1-32.el7cp Ubuntu: ceph_12.2.1-34redhat1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-12-05 23:47:58 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jason Dillaman 2017-10-12 13:15:00 UTC
Description of problem:
There is a possible infinite loop within journal:tag_list class method. If a peer cluster is offline and not replicating journal entries from the primary image and the primary image's exclusive lock is acquired at least 64 times during this outage, when the rbd-mirror in the peer cluster attempts to replay the image, it will cause an infinite loop in the primary cluster's OSD.

Version-Release number of selected component (if applicable):
12.2.x

How reproducible:
100%

Steps to Reproduce:
1. Mirror an image to a secondary cluster
2. Stop the rbd-mirror daemon
3. Run 'rbd bench-write' for a few bytes more than 64 times
4. Restart rbd-mirror daemon

Actual results:
The 'rbd bench-write' or 'rbd-mirror' daemon will cause the OSD to enter an infinite loop.

Expected results:
The OSD does not enter an infinite loop.

Additional info:

Comment 22 errata-xmlrpc 2017-12-05 23:47:58 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3387