Bug 2033581

Summary: [rados] osd crashing continuously after `submit_common error: Corruption: block checksum mismatch` in void BlueStore::_txc_apply_kv(BlueStore::TransContext*, bool)'
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Vasishta <vashastr>
Component: RADOSAssignee: Adam Kupczyk <akupczyk>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Pawan <pdhiran>
Severity: high Docs Contact:
Priority: unspecified    
Version: 5.1CC: akupczyk, amathuri, bhubbard, ceph-eng-bugs, choffman, ksirivad, lflores, nojha, pdhange, rfriedma, rzarzyns, skanta, sseshasa, vereddy, vumrao
Target Milestone: ---Flags: nojha: needinfo? (akupczyk)
Target Release: 7.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-07-06 17:53:44 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Vasishta 2021-12-17 10:57:53 UTC
Description of problem:
Tried upgrading a cluster from 4.x to 5.1
One of the OSD continously crashing (6016 and continuing)
Checked some of the crashes. All of them were in assert_func": "void BlueStore::_txc_apply_kv(BlueStore::TransContext*, bool)",

Version-Release number of selected component (if applicable):
16.2.6

How reproducible:
Tried once

Steps to Reproduce:
1. Configure 4.x cluster
2. Upgrade it to 5.1

Actual results:
OSD is crashing continuously

Expected results:
OSD should not crash or OSD should join back surviving crashes

Additional info: