Bug 2142148

Summary: [cee][ceph-osd] Bluestore errors and corruption on many OSDs after OSP/RHCS minor update
Product: Red Hat OpenStack Reporter: Steve Baldwin <sbaldwin>
Component: cephAssignee: Giulio Fidente <gfidente>
Status: CLOSED CURRENTRELEASE QA Contact: Yogev Rabl <yrabl>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 13.0 (Queens)CC: akupczyk, amathuri, bhubbard, bhull, ceph-eng-bugs, cephqe-warriors, choffman, eolivare, fpantano, gfidente, gjose, jdurgin, johfulto, jpretori, kelwhite, ksirivad, lema, lflores, lhh, mcaldeir, mhicks, nojha, pdhange, peli, pgrist, rcernin, rfriedma, rzarzyns, skanta, sseshasa, tkajinam, tpetr, vcojot, vkoul, vumrao
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-04-24 12:06:43 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 7 lema 2022-11-12 11:00:48 UTC
Hi, Adam,

Are you available to check, all logs in the case now. 

3 - Last left two options we have

- Add more disks manually then do rebalancing,
- Also, think to bring up the most OSD ( host PG down OSD)!

https://access.redhat.com/solutions/4539671 is for replacing do we have more to add manually?

Cheers
Ray

Comment 62 kelwhite 2023-01-02 16:14:48 UTC
Any update on this BZ? We haven't had any feedback from c#46 almost 1.5 months ago.

Comment 63 Manny 2023-01-02 16:44:27 UTC
Hello RHCS Engineering,

We are looking for an update this week. I am the new case owner.

Best regards,
Manny

Comment 65 Giulio Fidente 2023-01-09 13:46:55 UTC
I belive comment #46 confirms for the workaround to be effective; hence the KCS [1] seems good.

1. https://access.redhat.com/solutions/6987369

Comment 75 Red Hat Bugzilla 2023-09-19 04:29:54 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days