Bug 1501374 - Possible infinite loop in journal:get_tag_list class method will result in OSD suicide timeout
Summary: Possible infinite loop in journal:get_tag_list class method will result in OS...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RBD-Mirror
Version: 3.0
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: rc
: 3.0
Assignee: Jason Dillaman
QA Contact: Parikshith
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-10-12 13:15 UTC by Jason Dillaman
Modified: 2017-12-05 23:47 UTC (History)
7 users (show)

Fixed In Version: RHEL: ceph-12.2.1-32.el7cp Ubuntu: ceph_12.2.1-34redhat1
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-12-05 23:47:58 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 21771 0 None None None 2017-10-12 13:14:59 UTC
Ceph Project Bug Tracker 21956 0 None None None 2017-10-27 13:17:09 UTC
Red Hat Product Errata RHBA-2017:3387 0 normal SHIPPED_LIVE Red Hat Ceph Storage 3.0 bug fix and enhancement update 2017-12-06 03:03:45 UTC

Description Jason Dillaman 2017-10-12 13:15:00 UTC
Description of problem:
There is a possible infinite loop within journal:tag_list class method. If a peer cluster is offline and not replicating journal entries from the primary image and the primary image's exclusive lock is acquired at least 64 times during this outage, when the rbd-mirror in the peer cluster attempts to replay the image, it will cause an infinite loop in the primary cluster's OSD.

Version-Release number of selected component (if applicable):
12.2.x

How reproducible:
100%

Steps to Reproduce:
1. Mirror an image to a secondary cluster
2. Stop the rbd-mirror daemon
3. Run 'rbd bench-write' for a few bytes more than 64 times
4. Restart rbd-mirror daemon

Actual results:
The 'rbd bench-write' or 'rbd-mirror' daemon will cause the OSD to enter an infinite loop.

Expected results:
The OSD does not enter an infinite loop.

Additional info:

Comment 22 errata-xmlrpc 2017-12-05 23:47:58 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3387


Note You need to log in before you can comment on or make changes to this bug.