Bug 1785080

Summary: reshard list contains entries of buckets that are not present
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Vidushi Mishra <vimishra>
Component: RGWAssignee: J. Eric Ivancich <ivancich>
Status: CLOSED ERRATA QA Contact: Uday kurundwade <ukurundw>
Severity: high Docs Contact:
Priority: unspecified    
Version: 4.0CC: cbodley, ceph-eng-bugs, ceph-qe-bugs, ivancich, kbader, mbenjamin, sweil, tserlin, ukurundw
Target Milestone: rcKeywords: Regression
Target Release: 4.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: ceph-14.2.4-106.el8cp, ceph-14.2.4-40.el7cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1833497 (view as bug list) Environment:
Last Closed: 2020-01-31 12:48:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1833497, 1835777    

Comment 4 Yaniv Kaul 2020-01-08 14:03:05 UTC
Can this be assigned and triaged?

Comment 8 J. Eric Ivancich 2020-01-13 15:49:40 UTC
First, although likely rare, it is not a bug for a bucket that does not exist to be on a reshard list. This can happen when the bucket is added to the reshard list and then removed before the reshard occurs.

However two PRs addressed this issue in two ways, both of which are missing from ceph-4.0-rhel-patches (ceph-14.2.4-104).

The two PRs are:

https://github.com/ceph/ceph/pull/31271
    backport to mimic still pending
    backport to nautilus merged 4 days ago

https://github.com/ceph/ceph/pull/31323
    backport to mimic still pending
    backport to nautilus merged 4 days ago

So we should cherry-pick them into ceph-4.0-rhel-patches.

Eric

Comment 9 J. Eric Ivancich 2020-01-13 16:39:05 UTC
I should also add the following.

The first of the two PRs -- https://github.com/ceph/ceph/pull/31271 -- allows the user to remove those entries. That did make it into 3.3 (ceph-12.2.12-81). So that does represent a regression.

The second PR -- https://github.com/ceph/ceph/pull/31323 -- auto-cleans those entries during reshard queue processing. That is not in 3.3, but would be nice to get into 4.0 as it removes the need for user/amdin intervention.

Eric

Comment 10 J. Eric Ivancich 2020-01-13 16:50:04 UTC
Matt,

Requesting an OK to cherry-pick one or both of the fixes into ceph-4.0-rhel-patches. Details in the previous two comments (#8 and #9).

Eric

Comment 11 J. Eric Ivancich 2020-01-13 19:03:56 UTC
Thanks, Matt. I've cherry-picked them both.

Comment 18 errata-xmlrpc 2020-01-31 12:48:25 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0312