Bug 2093106

Summary: [RHCS 6] dups.size logging + COT dups trim command + online dups trimming fix
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Vikhyat Umrao <vumrao>
Component: RADOSAssignee: Radoslaw Zarzynski <rzarzyns>
Status: CLOSED ERRATA QA Contact: Pawan <pdhiran>
Severity: high Docs Contact: Eliska <ekristov>
Priority: unspecified    
Version: 5.0CC: akupczyk, amathuri, bhubbard, ceph-eng-bugs, choffman, ekristov, ksirivad, lflores, nmordech, nojha, pdhange, pdhiran, rfriedma, rzarzyns, sseshasa, tserlin, vereddy, vumrao
Target Milestone: ---   
Target Release: 6.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-17.2.3-23.el9cp Doc Type: Bug Fix
Doc Text:
.Corrupted dups entries of a PG Log can be removed by off-line and on-line trimming Previously, trimming of PG log dups entries could be prevented during the low-level PG split operation, which is used by the PG autoscaler with far higher frequency than by a human operator. Stalling the trimming of dups resulted in significant memory growth of PG log, leading to OSD crashes as it ran out of memory. Restarting an OSD did not solve the problem as the PG log is stored on disk and reloaded to RAM on startup. With this fix, both off-line, using the `ceph-objectstore-tool` command, and on-line, within OSD, trimming can remove corrupted dups entries of a PG log that jammed the on-line trimming machinery and were responsible for the memory growth. A debug improvement is implemented that prints the number of dups entries to the OSD's log to help future investigations.
Story Points: ---
Clone Of: 2093031 Environment:
Last Closed: 2023-03-20 18:56:39 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2126050    

Comment 1 Vikhyat Umrao 2022-06-02 23:01:27 UTC
Quincy backport:

https://tracker.ceph.com/issues/55632
https://github.com/ceph/ceph/pull/46251

Comment 30 errata-xmlrpc 2023-03-20 18:56:39 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 6.0 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:1360