Bug 158424
Summary: | slab corruption during x86_64 install using Firewire/USB disks | ||||||
---|---|---|---|---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Alexandre Oliva <oliva> | ||||
Component: | kernel | Assignee: | Dave Jones <davej> | ||||
Status: | CLOSED RAWHIDE | QA Contact: | Brian Brock <bbrock> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | rawhide | CC: | pfrields, wtogami | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2005-05-30 21:37:18 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Bug Depends On: | |||||||
Bug Blocks: | 136450 | ||||||
Attachments: |
|
Description
Alexandre Oliva
2005-05-22 05:30:25 UTC
Unsurprisingly. the errors also show up post-install, with kernel 1.1337_FC4 as well. is it possible you can try and repeat this with just firewire, or just usb, so we can narrow the scope a little ? Bug #158468 indicates firewire screwing up x86_64 install too. I've done a new install today with Firewire only, and still got the same errors. Unfortunately, USB doesn't seem to be very stable under very high I/O loads, I get constant freezes while rsyncing isos for new test trees primed from my local mirror of rawhide if I have one of the raid 1 members on Firewire and the other on USB. If everything is on Firewire, it works fine. Except for the slab corruption, that is. I haven't even tried both on USB, but I could if you think that would help. Ugh, 1.1355_FC4 just won't boot if my disks are connected to firewire, printing far too many oops/panic messages to fit an 80x60 screen before coming to a complete halt :-( I suspect this is not a new bug, just a side effect of turning slab debugging off. But this sucks, just before the release :-( It means Firewire-only people will need custom install disks again. And, to make matters worse, I'm leaving on a trip in the next few minutes, and won't be back before Sat evening. Oh well ;-( Created attachment 114959 [details]
fix slab corruption error
This patch makes sure we allocate at least one unsigned long for hostdata.
Without it, we write past the memory block allocated by scsi_host_alloc,
triggering slab corruption detection. It seems like this has been broken
forever, and I can't figure out how come it didn't run into a problem before,
but I've verified that this fixes it on 2.6.11-1.1353_FC4. I'm yet to build
1.1363_FC4 with the fix to see whether the init crashes at sbp2 load time are
gone. The info I'm getting from the stack trace doesn't make it obvious
whether it's related.
It was the same bug, after all. I'm now running 1.1363_FC4 with the patch above, and it boots and works just fine. Please oh please add it to FC4 final. It's very narrow in scope, can't possibly break anything that isn't already broken and will fix a memory corruption error. This will be davej and Sopwith's decision. Last week I heard davej say "final" fc4 kernel a few times so it may already be too late. Accepted into dist-fc4 Fix confirmed in FC4-re0530.1 (kernel-2.6.11-1.1366_FC4). |