1844720 – [RGW] Buckets/objects deletion is causing orphan rados objects

Bug 1844720 - [RGW] Buckets/objects deletion is causing orphan rados objects

Summary: [RGW] Buckets/objects deletion is causing orphan rados objects

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	Red Hat Ceph Storage
Classification:	Red Hat Storage
Component:	RGW
Sub Component:
Version:	4.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	5.1
Assignee:	J. Eric Ivancich
QA Contact:	Rachana Patel
Docs Contact:	Karen Norteman
URL:
Whiteboard:
Depends On:
Blocks:	1816167
TreeView+	depends on / blocked

Reported:	2020-06-06 15:19 UTC by Vikhyat Umrao
Modified:	2024-05-21 09:14 UTC (History)
CC List:	20 users (show)
Fixed In Version:
Doc Type:	Known Issue
Doc Text:	.Deleting buckets or objects in the Ceph Object Gateway causes orphan RADOS objects Deleting buckets or objects after the Ceph Object Gateway garbage collection (GC) has processed the GC queue causes large quantities of orphan RADOS objects. These RADOS objects are "leaked" data that belonged to the deleted buckets. Over time, the number of orphan RADOS objects can fill the data pool and degrade the performance of the storage cluster. To reclaim the space from these orphan RADOS objects, refer to the link:{object-gw-guide}#finding-orphan-and-leaky-objects_rgw[_Finding orphan and leaky objects_] section of the _{storage-product} Object Gateway Configuration and Administration Guide_.
Clone Of:
Environment:
Last Closed:	2021-11-16 18:03:22 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Vikhyat Umrao 2020-06-06 15:19:33 UTC

Description of problem:
[RGW] Buckets/objects deletion is causing orphan rados objects

In the workload, DFG QA cluster which we are using for RHCS 4.1 release based criteria testing - https://bugzilla.redhat.com/show_bug.cgi?id=1824263 is causing a lot of orphan rados objects.

The cluster was running a 10 hours cosbench workload during RHCS 4.0 to RHCS 4.1 cluster upgrade and almost filled the cluster. The team decided to remove the data by deleting some objects from the cosbench client and that did not help in the space reclamation then we have deleted all the buckets with the help of radosgw-admin bucket rm command that removed all the buckets.

We ran gc process --include-all and that also cleaned up all the objects but still, the cluster has not reclaimed the space and after checking with rados ls in data pool it is all shadow objects.


Version-Release number of selected component (if applicable):

[root@f09-h17-b05-5039ms ~]# radosgw-admin gc process --include-all
[root@f09-h17-b05-5039ms ~]# radosgw-admin gc list --include-all
[]


[root@f09-h17-b05-5039ms ~]# ceph df
RAW STORAGE:
    CLASS     SIZE        AVAIL       USED        RAW USED     %RAW USED 
    hdd       567 TiB     234 TiB     333 TiB      333 TiB         58.70 
    TOTAL     567 TiB     234 TiB     333 TiB      333 TiB         58.70 
 
POOLS:
    POOL                          ID      STORED      OBJECTS     USED        %USED     MAX AVAIL 
    .rgw.root                     140     1.2 KiB           4     768 KiB         0        46 TiB 
    default.rgw.control           141         0 B           8         0 B         0        46 TiB 
    default.rgw.meta              142       578 B           5     768 KiB         0        46 TiB 
    default.rgw.log               143     512 MiB         207     1.5 GiB         0        46 TiB 
    default.rgw.index             144         0 B           0         0 B         0        46 TiB 
    default.rgw.buckets.data      146     192 TiB      50.00M     289 TiB     67.48        93 TiB 
    default.rgw.buckets.index     147         0 B           0         0 B         0        46 TiB 



[root@f09-h17-b05-5039ms ~]# radosgw-admin bucket list
[]
[root@f09-h17-b05-5039ms ~]# radosgw-admin bucket stats
[]
[root@f09-h17-b05-5039ms ~]#

Comment 1 Vikhyat Umrao 2020-06-06 15:22:27 UTC

During the upgrade one OSD node daemons and mons and mgrs got upgraded.


[root@f09-h17-b05-5039ms ~]# ceph versions
{
    "mon": {
        "ceph version 14.2.8-50.el7cp (53387608e81e6aa2487c952a604db06faa5b2cd0) nautilus (stable)": 3
    },
    "mgr": {
        "ceph version 14.2.8-50.el7cp (53387608e81e6aa2487c952a604db06faa5b2cd0) nautilus (stable)": 3
    },
    "osd": {
        "ceph version 14.2.4-51.el7cp (db63624068590e593c47150c7574d08c1ec0d3e4) nautilus (stable)": 264,
        "ceph version 14.2.8-50.el7cp (53387608e81e6aa2487c952a604db06faa5b2cd0) nautilus (stable)": 24
    },
    "mds": {},
    "rgw": {
        "ceph version 14.2.4-51.el7cp (db63624068590e593c47150c7574d08c1ec0d3e4) nautilus (stable)": 11,
        "ceph version 14.2.8-50.el7cp (53387608e81e6aa2487c952a604db06faa5b2cd0) nautilus (stable)": 1
    },
    "overall": {
        "ceph version 14.2.4-51.el7cp (db63624068590e593c47150c7574d08c1ec0d3e4) nautilus (stable)": 275,
        "ceph version 14.2.8-50.el7cp (53387608e81e6aa2487c952a604db06faa5b2cd0) nautilus (stable)": 31
    }
}

Comment 2 Vikhyat Umrao 2020-06-06 15:25:41 UTC

We have captured the listing of rados data pool.

# rados -p default.rgw.buckets.data ls > rados.list.txt

# du -sh rados.list.txt 
4.2G    rados.list.txt

[root@f09-h17-b05-5039ms ~]# cat rados.list.txt | wc -l
50004910
[root@f09-h17-b05-5039ms ~]# cat rados.list.txt | grep shadow | wc -l
50004910
[root@f09-h17-b05-5039ms ~]# 


The above confirms that all 50M objects are shadow objects.

Comment 20 J. Eric Ivancich 2020-06-08 16:58:28 UTC

Here's some preliminary analysis.

The number of orphans listed in /root/rados.list.txt is 50,004,910.

All orphans are "shadow" objects.

It looks like all those objects came from 5 buckets. Here is the result of lopping off everything from "_shadow" onwards, sorting what is left, and running through "uniq -c".

16978508 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11841.1_
3675110 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11856.1_
6175225 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11862.1_
6151689 987371de-e3d9-45cf-b9b8-3c1a19cabd59.21284.1_
17024378 987371de-e3d9-45cf-b9b8-3c1a19cabd59.21287.1_

In the narratives above, 5 buckets/containers are mentioned and three are mentioned by name -- mycontainers3, mycontainers5, and mycontainers6 (sometimes without the "s" -- mycontainer6).

Some questions...

1. How many buckets were there over the life of this cluster? If only 5 why is there a bucket named "mycontainers6"?
2. Would it be fair to say that we do not know at this point whether this is an issue with:
    a) 4.0, 
    b) 4.1, or
    c) the upgrade from 4.0 to 4.1 while the workload is running?

If that's not fair, what above answers the question?

Eric

Comment 21 Vikhyat Umrao 2020-06-08 17:29:49 UTC

(In reply to J. Eric Ivancich from comment #20)

Thanks Eric. Response inline.

> Here's some preliminary analysis.
> 
> The number of orphans listed in /root/rados.list.txt is 50,004,910.
> 
> All orphans are "shadow" objects.
> 
> It looks like all those objects came from 5 buckets. Here is the result of
> lopping off everything from "_shadow" onwards, sorting what is left, and
> running through "uniq -c".
> 
> 16978508 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11841.1_
> 3675110 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11856.1_
> 6175225 987371de-e3d9-45cf-b9b8-3c1a19cabd59.11862.1_
> 6151689 987371de-e3d9-45cf-b9b8-3c1a19cabd59.21284.1_
> 17024378 987371de-e3d9-45cf-b9b8-3c1a19cabd59.21287.1_
> 
> In the narratives above, 5 buckets/containers are mentioned and three are
> mentioned by name -- mycontainers3, mycontainers5, and mycontainers6
> (sometimes without the "s" -- mycontainer6).
> 
> Some questions...
> 
> 1. How many buckets were there over the life of this cluster? If only 5 why
> is there a bucket named "mycontainers6"?

Yes. mycontainers6 is from RHCS 4.1 cluster which Rachana reproduced on RHCS 4.1 cluster. The details are given in comment#15.
Before comment#15 all the details are from RHCS 4 cluster and that had 5 containers starting from mycontainers1 to mycontainers5.

> 2. Would it be fair to say that we do not know at this point whether this is
> an issue with:
>     a) 4.0, 
>     b) 4.1, or
>     c) the upgrade from 4.0 to 4.1 while the workload is running?
> 
> If that's not fair, what above answers the question?
> 
It is reproducible in RHCS 4.0 and RHCS 4.1 both comment#15 talks about RHCS 4.1 mycontainers6 bucket. 

 -- vikhyat

Comment 22 J. Eric Ivancich 2020-06-09 18:13:18 UTC

Thank you, Vikhyat, for clarifying.

I believe I've reproduced the issue on 4.1 with a much more simple test case (i.e., no cosbench). And that will allow me to trace to see what's going on. I'll keep all of you posted.

Eric

Comment 24 Vikhyat Umrao 2020-06-10 07:21:28 UTC

(In reply to J. Eric Ivancich from comment #22)
> Thank you, Vikhyat, for clarifying.
> 
> I believe I've reproduced the issue on 4.1 with a much more simple test case
> (i.e., no cosbench). And that will allow me to trace to see what's going on.
> I'll keep all of you posted.
> 
> Eric

Thank you, Eric.

Comment 26 J. Eric Ivancich 2020-06-10 15:08:30 UTC

I apologize. What I thought was a reproducer is not reproducing the issue. Back to the drawing board....

Eric

Comment 40 J. Eric Ivancich 2020-06-19 11:21:37 UTC

Thank you, Thomas, for getting the build out so quickly!

Eric

Comment 89 Red Hat Bugzilla 2023-09-18 00:21:14 UTC

The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days

Note You need to log in before you can comment on or make changes to this bug.

agunn
anharris
cbodley
ceph-eng-bugs
gsitlani
ivancich
jharriga
jmelvin
kbader
knortema
mbenjamin
mmuench
racpatel
sweil
tchandra
tserlin
twilkins
ukurundw
vereddy
vimishra