Bug 2322498 - Coredumps are not getting auto-generated when osd crash is reported
Summary: Coredumps are not getting auto-generated when osd crash is reported
Keywords:
Status: NEW
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: ceph
Version: 4.17
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: Radoslaw Zarzynski
QA Contact: Elad
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-10-29 16:40 UTC by Pratik Surve
Modified: 2024-10-29 16:42 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OCSBZM-9453 0 None None None 2024-10-29 16:42:55 UTC

Description Pratik Surve 2024-10-29 16:40:35 UTC
Description of problem (please be detailed as possible and provide log
snippests):

Coredumps are not getting auto-generated when osd crash is reported

Version of all relevant components (if applicable):

OCP version:- 4.17.0-0.nightly-2024-10-20-231827
ODF version:- 4.17.0-126
CEPH version:- ceph version 18.2.1-229.el9cp (ef652b206f2487adfc86613646a4cac946f6b4e0) reef (stable)

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?


Is there any workaround available to the best of your knowledge?


Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?


Can this issue reproducible?
yes

Can this issue reproduce from the UI?


If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1.Deploy ODF cluster
2.debug into node and kill ceph-osd process using kill -11 <pid>
3.check for coredump list


Actual results:
[root@compute-1 coredump]# coredumpctl list
No coredumps found.

Expected results:
coredump should be generated 


Additional info:

When checked /var/lib/rook/openshift-storage/crash/posted/ we can see the logs here


bash-5.1$ ceph crash ls
ID                                                                ENTITY  NEW
2024-10-29T14:05:58.473757Z_53a7e5f3-e20f-4556-8025-d8d9cc0a8ee0  mon.f    *
2024-10-29T14:06:49.937796Z_851de819-c85b-40e9-a965-631ac9f34259  osd.2    *
2024-10-29T14:07:01.269548Z_9278fe64-da00-481c-a549-5246c9e02ff7  osd.2    *
2024-10-29T14:49:42.131778Z_08adcb38-7223-4336-9618-ccab02a828c6  osd.2    *
2024-10-29T15:25:14.199019Z_4f4e828d-2e96-4ccb-ae31-a4478687dd46  osd.2    *
2024-10-29T16:14:13.763145Z_ad5bc691-1788-4ff6-8637-5a5fa949c344  osd.1    *
2024-10-29T16:33:20.357187Z_ef062472-8703-4e68-bff4-49823b067efb  osd.2    *
2024-10-29T16:33:31.070116Z_b19acc18-f270-496b-be70-78282c938f4d  mon.f    *


Note You need to log in before you can comment on or make changes to this bug.