Red Hat Bugzilla – Bug 45612
kernel-2.4.3-12 does not boot on AST P5/90
Last modified: 2005-10-31 17:00:50 EST
2.4.3-12 (errata) doesn't make it very far along in the boot process on my
P5/90. It crashes after printing maybe the first five or so lines of
kernel bootup, and hangs displaying a few screenfuls of what looks like a
protection fault. I don't have a capture of the error message, but the
error code is 0x0000000D.
Exact error will be attached shortly. Processor is a Pentium Classic,
90MHz, and it works fine with 2.4.2-2.
The error message was ...
CPU#0: Machine Check Exception: 0x 235EEC (type 0x D).
Correcting my earlier report, the kernel prints about half to two-thirds of a
screen of messages before blowing up (I assume the above fault could happen at
any point, and knowing what the kernel is trying to init would be useful? It
scrolls by way too fast to see.)
Machine Check Exception is a hardwarefault...
By 'hardwarefault' do you mean broken hardware or a triple fault in software?
This machine has never to my knowledge failed to boot any other linux kernel in
A machine check exception is raised when the board or processor decides
something bad might be happening. In some cases the machine will run fine
because the threshold on the check seems to be tighter than an actual crash
(which I guess make ssense). This can also include overheat/fan faults but in
your case the fault is
Arjan - I am curious why it then blew up rather than continuing. I would be very
interested to know what a kernel patched in
(remove the pushl $0)
does on this problem box.
Removing the push instruction causes the kernel to panic with a traceback that
happily fills the screen. "Aiee, killing interrupt handler!" seem to be the
favorite dying words for this particular kernel.
With 2.4.3-12 pristine, an actual panic occurs only occasionally. I compiled
with gcc 2.96-85.
Ok, so the fault not recovering is real. I suspect your CPU is absolutely
borderline if it wasnt showing faults before the machine check. You can
certainly disable the check (mcheck_init in arch/i386/kernel/bluesmoke.c) but I
am not sure that is wise given that it is an integrity check for the system
So, to recap and resolve this report, a CPU machine check was added sometime
circa 2.4.3, and it may crash on borderline machines that formerly booted
I also notice there's no explicit check for the AMD K6 in that code. Are such
processors treated as Pentium-compatible, or do they not support MCE at all?
The arch/i386/kernel/setup.c code only calls the mcheck init for processors it
knows about. Currently that is:
AMD Athlon/Duron (basically intel compatible)
VIA/Cyrix MIII/ VIA C3 (limited features)
Intel (pentium or higher)
Winchip/WinchipII/WinchipIII (limited features)
The older Cyrix processors and the K5/K6 apparently don't have the functionality
I recently picked up an IBM Thinkpad 755CX, this is
an older model (1995).
Pentium-75, 40M ram 3.2G.
I also have a "Dock II" docking station.
This has built-in scsi controller, I added a 2G
fireball drive and a NEC 24X scsi cdrom.
I ran the on-board diags - all fine, and the floppy
based diags - all fine.
Ok - so it works fine under DOS and W95, I installed
W98SE to test it -ok, then formatted back to DOS
with cd support.
When I try to install RH7.1 (or Roswell-1) it
I tried both CDROM (via autoboot from dos), and a
It gets to the point of "running /sbin/loader",
then dies: with a black screen
"CPU#0: Machine Check Exception: 0x 1234 (type 0
this scrolls forever and you have to power-off.
The dock-II has an adaptec AIC-6360 in it - what is
the correct line to use it?
I have tried various combinations of "linux dd text
and "linux dd text aic6x60="0x340.11,7,1".
None of which worked.
It uses the "Adaptec 620/6360/6370" driver for dos.
Both the 7.1 and Roswell CD sets are fine. ( it
same thing for RH 7.0, and RH 6.0)
I searched for info on the web and found out how to
set up the MWAVE and power management,
but could not find the install info.
NOTE ************ I Tried Toms rootdisk and it
worked fine!!!!! **************
Toms rootdisk (1.7.218)
output of dmesg
Intel Pentium with F0 0F bug - workaround enabled.
alias mapping IDT readonly ... ... done
Linux version 2.0.37 (root@6M) (gcc version
#13 Fri Oct 15
scsi : 0 hosts.
scsi : detected total.
aha152x: BIOS test: passed, auto configuration: ok,
detected 1 controller(s)
aha152x0: vital data: PORTBASE=0x340, IRQ=11, SCSI
parity=enabled, synchronous=disabled, delay=100,
aha152x: trying software interrupt, ok.
scsi0 : Adaptec 152x SCSI driver; $Revision: 1.18 $
scsi : 1 host.
Vendor: QUANTUM Model: FIREBALL_TM2110S Rev:
Type: Direct-Access ANSI
SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 0,
Vendor: NEC Model: CD-ROM DRIVE:464 Rev:
Type: CD-ROM ANSI
SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi0, channel 0, id 6,
SCSI device sda: hdwr sector= 512 bytes. Sectors=
4124736 [2014 MB] [2.0 GB]
scsi : 1 host.
contents of /etc/mtab
/dev/sda1 /mnt/vfat vfat rw 0 0
/dev/sr0 /mnt/cdrom iso9660 ro 0 0
Tom's found the controller, hard drive and cdrom.
I was able to mount and move files around. (
and vfat ).
So what do I have to do to get RH7+ onto this
Also does anyone know what the video chipset is -
W98 just say 'Digital'
Its a SVGA 1M 800x600x16bit TFT
Chris Cloiber suggested 'linux nomce' I will try it later.
This bug is obsoleted by bug 55097 and the errata 2.4.9-13 kernel.