Bug 66143
Summary: | System hang after 5-12 h IO stress - flushtlb problem? | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | [Retired] Red Hat Linux | Reporter: | Martin Wilck <martin.wilck> | ||||||
Component: | kernel | Assignee: | Arjan van de Ven <arjanv> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Brian Brock <bbrock> | ||||||
Severity: | high | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 7.3 | CC: | nbock, paulw | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | i386 | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2002-06-11 16:03:28 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Martin Wilck
2002-06-05 12:20:42 UTC
That patch only affects SMP Pentium IV systems. Is the system in question also a pentium IV one ? Yes, it is a System GC LE chipset an Dual Prestonia. Yes, it is a System GC LE chipset an Dual Prestonia. In that case the yesterday released 2.4.9-34 kernel should fix this for 7.1/7.2; for 7.3 a fix is in the works. So you think that this is related to the PGE handling you mention in the advisory? In any case I have looked at the new kernel source - it is missing two small Patches that were included in the 7.3 2.4.18 series. Both are very important for our newer machines. I'll atttach them here although they are not directly related to the problem itself. Arjan, please have a look at them. Concerning the original problem - we'll test the new kernel and see what happens. Martin Created attachment 59844 [details]
One-line patch for ServerWorks CSB5 IDE DMA
Created attachment 59845 [details]
4-line patch that fixes DMA address calculation >4GB (important!!)
Yes the PGE fix the the problem Sunil found. As for the patches; I've added the 4Gb one to the tree in case we ever do a 2.4.9 erratum again. I hope you do because that is really a nasty one if you have >4GB machines (we had an Adaptec SCSI controller happily DMA'ing to and from the kernel core memory). Meanwhile, we'll advise our >4GB customers to upgrade to the 7.3 kernel. The CSB5-Patch may seem ridiculous - however, IDE load may cause such a heavy interrupt load that timer and local APIC interrupts don't get through, causing the LOC interupt counts to differ heavily, and (if the same CPU servers timer and IDE IRQs) cause system time to go awry. I am taking back what I said about the CSB5 patch. It should *not* be applied, and probably even reverted in 2.4.18, until bug 66054 is resolved. p4 bug is fixed in the 2.4.18-5 kernel |