Bug 408521
| Summary: | Kernel-2.6.23 hosing my Fedora ext3 / Partition | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Declan Moriarty <declan.moriarty> | ||||||||||||
| Component: | kernel | Assignee: | Kernel Maintainer List <kernel-maint> | ||||||||||||
| Status: | CLOSED NOTABUG | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||||||||||
| Severity: | high | Docs Contact: | |||||||||||||
| Priority: | low | ||||||||||||||
| Version: | 7 | CC: | chris.brown | ||||||||||||
| Target Milestone: | --- | ||||||||||||||
| Target Release: | --- | ||||||||||||||
| Hardware: | i386 | ||||||||||||||
| OS: | Linux | ||||||||||||||
| Whiteboard: | |||||||||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||||||||
| Doc Text: | Story Points: | --- | |||||||||||||
| Clone Of: | Environment: | ||||||||||||||
| Last Closed: | 2008-01-17 16:33:38 UTC | Type: | --- | ||||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||||
| Documentation: | --- | CRM: | |||||||||||||
| Verified Versions: | Category: | --- | |||||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||
| Embargoed: | |||||||||||||||
| Attachments: |
|
||||||||||||||
|
Description
Declan Moriarty
2007-12-03 10:22:59 UTC
Created attachment 275601 [details]
error from syslog
Hmmmph!
Nobody has even read this, by the look of it. Nobody cares that F7 is about as
reliable here as windoze. Here's "/usr/lib/libwine.so.1" from the offending
partition:
#!/bin/sh
for i in $HOME/.config/menus/applications-merged/*menu ; do
sed -i -e 's:<Name>wine-wine</Name>:<Name>Wine</Name>:g' \
-e ':<Directory>wine-wine</Directory>:<Directory>wine</Directory>:g' "$i"
done
Somebody tell me that's normal? Oh, I forgot - nobody's reading this stuff
/goes off to selectively reinstall - windows style :-(.
Can you post the entire dmesg from bootup until it errors? Created attachment 280811 [details]
Tree of unusual lost+found stuff
Created attachment 281301 [details]
Dmesg output
Good to hear from you. I am not able to reproduce the error, as I reinstalled,
and it now doesn't run, it limps along :-(. The tree output shows unusual &
complete files in lost+found. dmesg also attached, but no error today.
This is an ext3 filesystem problem affecting X programs on a new disk. Mozilla,
wine (minesweeper and Watchtower Library, both aok in wine) Openoffice, python
or yum and some init symlinks. It also picked up stray files from some
compiling going on on a separate partition. The zips there (mozilla bits
afaict) expand without error. A copy of the MIT license from python is readable
and perfect. There's a disproportionate amount of xml, xul, and java there.
Much of this stuff may have been read, but should _never_ have been written to.
I certainly didn't. But huge toolchain compiles were behaving faultlessly on a
different partition of the same hard disk. What I can't do is give you the
other missing ingredient on a plate. I will also attach the errors from
/var/log/messages with the context.
Created attachment 281341 [details]
grep -C10 frozen /var/log/messages > errors.out
the word 'frozen' is on the 1st line of error output. In this, when they are
less then 20 lines apart, the error is continuing. Watch UDMA speed.
Footnote to this: Nobody here paid any attention. It eventually got picked up through nabble.com by the kernel guys. If you have a similar bug, post it on bugzilla.kernel.org and reference this one. Hello, I'm reviewing this bug as part of the kernel bug triage project, an attempt to isolate current bugs in the Fedora kernel. http://fedoraproject.org/wiki/KernelBugTriage Thank you for filing the bug - I am CC'ing myself to this bug and will try and assist you in resolving it if I can. There hasn't been much activity on this for a while. Could you tell me if you are still having problems with the latest kernel? Does comment #7 indicate this has been resolved or is there any upstream bug filed at the kernel.org bugzilla which can be referenced? Comment #7 was the outcome of my discussions with Theodore Ts'o on the kernel list as he wants to run this down. The current situation is. I gather the via section of the PATA ide driver has bugs which Theodore wants to sort if he can. I failed to reproduce the problem after a reinstall, and Fedora 7 failed to reinstall properly. Tired of the deafening silence here, and such a dodgy system, I formatted and overwrote the F-7 partition with a Slackware 12.0. Slack and I are making friends. My comment #4 has a tree of the 41 Megs of stuff in lost+found. That is now your best information. This tree shows X apps were hammered (/usr/lib/libwine.so.1 became a script, /usr/lib/mozilla/ went awol altogether) while at the same time heavy compiling of an lfs-like distribution behaved faultlessly in a console. Mozilla, openoffice, evolution were always open, and wine a good bit of the time. Fedora had sda5 as /home, but that was untouched. This is odd considering ~/.wine & ~/.evolution were base dirs. Many other partitions were mounted, but only / got borked. The chipset is Via KT400-333 Mhx (=166 Mhz) which is a relative of the famous MPV3 southbridges with the hardware problem. Only read-only stuff got done in. Data was unharmed. I would get a disk crash, a reboot would follow, the system wouldn't mount the disk, and froze. I would boot into another system and run e2fsck -cvy or somesuch. Lost+found would fill up some more, and we'd be going again. Hello Declan, Okay, thanks for the update. Is the Slackware system exhibiting the same issues? Are you still able to test a Fedora system? I can assign this over to the PATA kernel maintainers but only if you are still able to test with a Fedora system - the latest release would be good. In the first instance they would need the following as separate text/plain attachments to this bug: # dmidecode lspci -vvxxx dmesg Cheers Chris Created attachment 291974 [details]
dmidecode, dmesg, and lspci outputs
dmidecode, lspci and dmesg outputs - attached (from slackware). As for your
questions:
No other system (I have a few) displays these symptoms. Fedora 7 before the
2.6.23.1 update didn't either. The old driver (hda) is fine, it was the new
one(sda). I can clear a partition and install any version of Fedora. Testing
for this is another matter, as I was simply living in the system, doing basic
stuff while compiling in a console (usually not an xterm). I now believe that
something in the few hundred megabytes of updates I pulled in with the
2.6.23.1 kernel made the system unstable, and that this, along with whatever
bugs exist in the new driver caused the loss of binaries. I sat here with the
system hosed for a week waiting for you guys to wake up, but it never happened.
If you have something automatic to test for such a problem it's worth doing.
Hello Declan, I appreciate its frustrating (and thank you for providing the requested information) but the fact is that the Fedora project is run by people who mostly volunteer their time (me included) and if you persist in being rude I'm happy to focus my time elsewhere. Now then, as few things to test: 1) It would be better to try with a Fedora 8 install or even an updated spin from the fedora unity project: http://spins.fedoraunity.org/spins as the latter provide critical updates to the original released Fedora 8 which resolve some installation issues. 2) http://fedoraproject.org/wiki/KernelCommonProblems This is a website which has numerous suggestions on how to resolve install and boot problems - I'm afraid there always will be some. 3) If an attempted install is still failing for you then try adding: libata.dma=0 to the boot options Let me know how you get on. Also in future, please attach information as I indicated as separate text/plain attachments to this bug. tgz files just create several more steps for us to access the debugging information. I apologise if I'm seen as rude, which was not my intention. I suppose I was trying to say that if notice had been taken of the bug report, I would have been able to avoid the reinstall, and this would have been easier. If nobody else is having this issue, I feel we should let it die, and close this bug. Time is not particularly on my side ATM. I have a peculiar early revision of the Via 82c686 Southbridge in a 2004 board. It has 6 usb ports (vs the usual 4) and two of these have log spam on them because they are non compliant. That resulted in a revision to ehci-hcd adding the ignore_oc=1 option to the module around 2.6.20. Via's response was to disable 2 of the ports quietly. The APIC is broken and every device gets the same 2 halfassed IRQs so nothing works unless I disable it on bootup. ACPI is dodgy at best. So it's pretty clear where the problem really is - MY HARDWARE. OTOH, if there's some issue you know is there and you just need a box like mine showing the symptoms, (Like the ehci-hcd maintainer did), I'm game. Whatever you want installed or tried I will do. In that case, the code was patched to read the vital registers into syslog, and I would boot and plug/unplug into usb. I added notes on what I was doing and sent it all off to the maintainer. I personally liked Fedora. I feel sure that I could install F-8 like I installed FC5 & F7 and they would work fine for some months. Then after some update, I'd be in a mess again, exactly as happened with F7 and FC5. The F7 one was spectacular - hence the bug report. Okay, as you feel your hardware is at issue I am closing NOTABUG. I would recommend checking for a BIOS upgrade if one is available and also testing what I suggested above. Cheers Chris |