Bug 2177220 - [CEE/sd][RGW-Multisite][Ceph Upgrade] RGW crashes with Segmentation fault on s3:copy_obj post RHCS 5.3.1 upgrade
Summary: [CEE/sd][RGW-Multisite][Ceph Upgrade] RGW crashes with Segmentation fault on ...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: RGW-Multisite
Version: 5.3
Hardware: x86_64
OS: Linux
urgent
urgent
Target Milestone: ---
: 5.3z2
Assignee: Mark Kogan
QA Contact: Vidushi Mishra
Akash Raj
URL:
Whiteboard:
Depends On:
Blocks: 2185621
TreeView+ depends on / blocked
 
Reported: 2023-03-10 13:06 UTC by Tridibesh Chakraborty
Modified: 2023-10-08 04:25 UTC (History)
16 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
.Segmentation fault no longer occurs in the Ceph Object gateway process Previously, a segmentation fault would occur in the Ceph Object Gateway process when an admin user performed the below operations: - Copying a non-existing object. - Copying an existing object over itself. With this fix, with admin or system privileges, you can initialize objects that were not initialized and the segmentation fault no longer occurs.
Clone Of:
Environment:
Last Closed: 2023-04-11 20:07:59 UTC
Embargoed:
aemerson: needinfo-
aemerson: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 58035 0 None None None 2023-03-10 17:19:15 UTC
Red Hat Issue Tracker RHCEPH-6251 0 None None None 2023-03-10 13:10:05 UTC
Red Hat Knowledge Base (Solution) 7017201 0 None None None 2023-06-09 19:22:33 UTC
Red Hat Product Errata RHBA-2023:1732 0 None None None 2023-04-11 20:08:56 UTC

Description Tridibesh Chakraborty 2023-03-10 13:06:47 UTC
Description of problem:
Customer last night upgraded primary site of RGW multisite from RHCS 5.3 to RHCS 5.3.1 and they observe RGW crashes with Segmentation fault on s3:copy_obj 

Version-Release number of selected component (if applicable):
RHCS 5.3z1 (16.2.10-138.el8cp)

How reproducible:
Customer environment specific

Steps to Reproduce:
1. Have a RGW multisite running on version RHCS 5.3 with testfix
2. Upgrade primary site to 5.3.1
3. Enable the RGW sync
4. RGW daemon crashes on primary site due to segmentation fault on s3:copy_obj
5. If customer stops the secondary site, they are able to bring up the primary site RGW daemons and it is running for last 15 hours 

Actual results:
RGW daemons are crashing due to segmentation fault

Expected results:
Customer should be able to start the RGW daemons


Additional info:

Comment 52 errata-xmlrpc 2023-04-11 20:07:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 5.3 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:1732

Comment 53 Manny 2023-06-09 19:21:51 UTC
Added a link to another impacted customer, SFDC #03530266

Also wrote KCS #7017201 for this issue, (https://access.redhat.com/solutions/7017201)

Best regards,
Manny

Comment 54 Red Hat Bugzilla 2023-10-08 04:25:04 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days


Note You need to log in before you can comment on or make changes to this bug.