Bug 1489060 - backport prune past_intervals capability to 2.y
Summary: backport prune past_intervals capability to 2.y
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RADOS
Version: 2.4
Hardware: All
OS: All
high
high
Target Milestone: rc
: 2.5
Assignee: Josh Durgin
QA Contact: Manohar Murthy
Bara Ancincova
URL:
Whiteboard:
Depends On:
Blocks: 1536401
TreeView+ depends on / blocked
 
Reported: 2017-09-06 15:54 UTC by Sage Weil
Modified: 2020-12-14 09:54 UTC (History)
10 users (show)

Fixed In Version: RHEL: ceph-10.2.10-1.el7cp Ubuntu: ceph_10.2.10-2redhat1xenial
Doc Type: Enhancement
Doc Text:
.The `osd_hack_prune_past_interval` option is now supported The `osd_hack_prune_past_interval` option helps to reduce memory usage for the past intervals entries, which can help with recovery of unhealthy clusters. WARNING: This option can cause data loss, therefore, use it only when instructed by the Red Hat Support Engineers.
Clone Of:
Environment:
Last Closed: 2018-02-21 19:43:32 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github ceph ceph pull 17351 0 None None None 2017-09-06 16:18:23 UTC
Red Hat Product Errata RHBA-2018:0340 0 normal SHIPPED_LIVE Red Hat Ceph Storage 2.5 bug fix and enhancement update 2018-02-22 00:50:32 UTC

Description Sage Weil 2017-09-06 15:54:32 UTC
Description of problem:

The past_intervals structure can get very big, consuming memory and ultimately making recovery difficult.

How reproducible:

It has happened several times with customers with unhealthy clusters.

Steps to Reproduce:
1. make cluster unhealthy
2. thrash osds
3. osd memory requirements increase, eventually beyond what the host has available


Now in upstream jewel:
 https://github.com/ceph/ceph/pull/17351

Backport that patch to downstream 2.y.

Note that luminous (and thus 3.y) does not have this problem.

Comment 3 Ken Dreyer (Red Hat) 2018-01-02 21:29:54 UTC
Fix is in Ceph v10.2.10 upstream

Comment 12 errata-xmlrpc 2018-02-21 19:43:32 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:0340


Note You need to log in before you can comment on or make changes to this bug.