Bug 2223380
| Summary: | [IBM Z] ODF deployed on IBM Z with DASD ( OSD CLBO failed to load OSD map for epoch ) | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat OpenShift Data Foundation | Reporter: | khover |
| Component: | ceph | Assignee: | Radoslaw Zarzynski <rzarzyns> |
| ceph sub component: | RADOS | QA Contact: | Elad <ebenahar> |
| Status: | CLOSED NOTABUG | Docs Contact: | |
| Severity: | high | ||
| Priority: | high | CC: | bniver, glaw, hnallurv, jquinn, mhackett, muagarwa, nojha, ocs-bugs, odf-bz-bot, rzarzyns, sostapov, tstober |
| Version: | 4.12 | Flags: | khover:
needinfo?
(tstober) khover: needinfo? (rzarzyns) |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2023-07-31 11:53:22 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
khover
2023-07-17 14:05:04 UTC
Must gather uploaded to supportshell, /cases/03562792 0010-odf-mustgather-c2nwedi01.tgz namespaces/openshift-storage/pods/rook-ceph-osd-0-754b566b47-d578b/osd/osd/logs/previous.log namespaces/openshift-storage/pods/rook-ceph-osd-0-754b566b47-d578b/osd/osd/logs/current.log These are two diff clusters same customer and BZ 2222728 error is. - inferring bluefs devices from bluestore path 2023-07-12T09:21:44.004+0000 3ff8ab68800 -1 rocksdb: Corruption: SST file is ahead of WALs Adam may have added that HINT: to the wrong bz ========================================================== The question from the customer remains> Customer is requesting if there is a way to target the exact area on disk where the bad crc occurred based on: 2023-07-17T08:06:34.180481051Z debug -6> 2023-07-17T08:06:34.146+0000 3ff8926a500 -1 bluestore(/var/lib/ceph/osd/ceph-0) _verify_csum bad crc32c/0x1000 checksum at blob offset 0x1000, got 0x6706be76, expected 0x7b34f36b, device location [0x471000~1000], logical extent 0x1000~1000, object #-1:3b30e826:::osdmap.14:0# 2023-07-17T08:06:34.180481051Z debug -5> 2023-07-17T08:06:34.146+0000 3ff8926a500 -1 bluestore(/var/lib/ceph/osd/ceph-0) _verify_csum bad crc32c/0x1000 checksum at blob offset 0x1000, got 0x6706be76, expected 0x7b34f36b, device location [0x471000~1000], logical extent 0x1000~1000, object #-1:3b30e826:::osdmap.14:0# 2023-07-17T08:06:34.180507921Z debug -4> 2023-07-17T08:06:34.146+0000 3ff8926a500 -1 bluestore(/var/lib/ceph/osd/ceph-0) _verify_csum bad crc32c/0x1000 checksum at blob offset 0x1000, got 0x6706be76, expected 0x7b34f36b, device location [0x471000~1000], logical extent 0x1000~1000, object #-1:3b30e826:::osdmap.14:0# 2023-07-17T08:06:34.180507921Z debug -3> 2023-07-17T08:06:34.146+0000 3ff8926a500 -1 bluestore(/var/lib/ceph/osd/ceph-0) _verify_csum bad crc32c/0x1000 checksum at blob offset 0x1000, got 0x6706be76, expected 0x7b34f36b, device location [0x471000~1000], logical extent 0x1000~1000, object #-1:3b30e826:::osdmap.14:0# 2023-07-17T08:06:34.180507921Z debug -2> 2023-07-17T08:06:34.146+0000 3ff8926a500 -1 osd.0 0 failed to load OSD map for epoch 14, got 0 bytes Or additional info we could collect to achieve this ? |