Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 1925344

Summary: Bucket index entries and object names with trailing NUL character on re-uploaded multiparts
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Mike Hackett <mhackett>
Component: RGWAssignee: Matt Benjamin (redhat) <mbenjamin>
Status: CLOSED ERRATA QA Contact: Tejas <tchandra>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 3.3CC: cbodley, ceph-eng-bugs, gsitlani, ivancich, jdurgin, kbader, kdreyer, mbenjamin, nojha, rzarzyns, sweil
Target Milestone: ---   
Target Release: 5.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-16.1.0-486.el8cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1929905 (view as bug list) Environment:
Last Closed: 2021-08-30 08:28:17 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Mike Hackett 2021-02-04 22:27:14 UTC
Description of problem:
## Issue
In object storage with radosgw, accumulating large quantities of objects in the data pool that appear to be orphans. Since this is a large cluster(2PB), customer cannot execute the command "radosgw-admin orphans find". It is taking VERY long time.

## Subscription Details
- As per the SKU --SVCRC12E4000, customer having ceph subscription.

## Environment 
* ceph version 12.2.12 (1436006594665279fe734b4c15d7e08c13ebd777) luminous (stable)


## Root cause
* The issues with the `orphans find` and `bucket check --fix` issues have the same root cause ie reuploaded multiparts.
* The `orphan find` command removes (or is supposed to remove) the leaked reuploaded parts from the bucket data pool, and the `bucket check --fix` command should remove the leaked reuploaded parts from the bucket index.
* Both sets of functionality are required to remediate the issues caused by user reuploading parts in a multipart upload.

## Summary / Tracker /PR
http://tracker.ceph.com/issues/38454
https://tracker.ceph.com/issues/44722
https://tracker.ceph.com/issues/49018

https://github.com/ceph/ceph/pull/34148
https://github.com/ceph/ceph/pull/35005 
https://github.com/ceph/ceph/pull/37260
https://github.com/ceph/ceph/pull/38905

Version-Release number of selected component (if applicable):
3.3

Related tool request: https://bugzilla.redhat.com/show_bug.cgi?id=1925342

Comment 6 errata-xmlrpc 2021-08-30 08:28:17 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 5.0 bug fix and enhancement), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3294