Bug 2131932 - [RGW-MS] RGW multisite sync is stuck after 4.3z1 to 5.3 upgrade [NEEDINFO]
Summary: [RGW-MS] RGW multisite sync is stuck after 4.3z1 to 5.3 upgrade
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RGW-Multisite
Version: 5.3
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 5.3
Assignee: Matt Benjamin (redhat)
QA Contact: Hemanth Sai
Akash Raj
URL:
Whiteboard:
Depends On:
Blocks: 2126049
TreeView+ depends on / blocked
 
Reported: 2022-10-04 07:12 UTC by Ameena Suhani S H
Modified: 2023-01-11 17:42 UTC (History)
9 users (show)

Fixed In Version: ceph-16.2.10-92.el8cp
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-01-11 17:41:26 UTC
Embargoed:
akraj: needinfo? (mbenjamin)


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-5407 0 None None None 2022-10-05 21:20:39 UTC
Red Hat Product Errata RHSA-2023:0076 0 None None None 2023-01-11 17:42:18 UTC

Description Ameena Suhani S H 2022-10-04 07:12:06 UTC
Description of problem:
RGW multisite sync is stuck after 4.3z1 to 5.3 upgrade

Version-Release number of selected component (if applicable):
16.2.10-48.el8cp

How reproducible:
1/1

Steps to Reproduce:
install 4.3z1
Fill - 7- 10% cluster full with fill
Measure - 1hr hybrid 
Upgrade - 10hr hybrid with Upgrade - 10hr (Cluster upgrade to 5.3 in parallel)
Measure post upgrade- 1hr hybrid - 1hr hybrid before enabling resharding
Measure with resharding - Enable resharding and start 1hr hybrid immediately after enabling 
Aging - 2hr hybrid - 
Measure - 1hr hybrid - 1hr
48-72 hrs no I/O


Actual results:
Data is not synced after  around 12 hours of completing the last run

Expected results:
Data should be synced as soon as the workload completed

Additional info:

COSBench URL:
Site1: http://dell-r630-002.dsal.lab.eng.rdu2.redhat.com:19088/controller/index.html
Site2: http://dell-r640-072.dsal.lab.eng.tlv2.redhat.com:19088/controller/index.html

eg: We see that these buckets "bucketpri1" and "bucketsec1" do not have consistent objects across sites

Comment 30 errata-xmlrpc 2023-01-11 17:41:26 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 5.3 security update and Bug Fix), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:0076


Note You need to log in before you can comment on or make changes to this bug.