Red Hat Bugzilla – Bug 501441
Kernel will not boot (kernel panic)
Last modified: 2009-05-22 07:39:44 EDT
Description of problem:
When working on Bug #499999, I tried the patch kernel "kernel-2.6.18-148.el5.bz499999test.i686.rpm" as posted by Michal Schmidt.
The kernel would not boot on my Compaq CQ60-206US (nForce MCP78S chipset), I was then asked to make a new bug report by Michal Schmidt.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Start laptop
kernel panic referencing cpu_freq_governor
Kernel loads and system boots
CPU: Athlon X2 64 Bit
Mem: 2 GB
Chipset: nVidia MCP78S
Drive Type: Sata
Here are the results I found (as posted at https://bugzilla.redhat.com/show_bug.cgi?id=499999):
I just installed and tried the i686 rpm, cannot get this kernel to boot. I get
kernel panic at CPU Scaling immediately after Checking for new hardware line.
The kernel panic mentions cpu_freq_governor, cpu_set_policy, etc... in the call
Kernel panic - not syncing: Fatal exception
I tried "pci=nomsi" (for my sata drives) and "nomce"
(since I have a Compaq).
OLD WORKING KERNEL:
On the running kernel, which I am using now, I have to have both on the kernel
line or my laptop will not boot. I see this kernel panic without the
"pci=nomsi" on kernel line with running kernel. I get the "Machine Check
Exception" if I do not have "nomce".
NEW TEST KERNEL:
I tried them seperately then together and then without any. The system either
hangs with no error or gives kernel panic but always at the same place in boot.
Almost seems that the "pci=nomsi" option is not taking as I get the same error
on old kernel without it.
To clarify, the "old working kernel" above is "2.6.18-128.1.10.el5" and it runs fine as long as I have "pci=nomsi" and "nomce" on the boot line.
Only the test patch kernel is failing to boot.
Can you try booting your machine with 'nmi_watchdog=2' on the kernel command line and remove the 'quiet' command from that line. Hopefully the kernel will panic after a while and the stack will match the one in bugzilla 501178.
Yes, it does panic with stack trace, but they are completely different.
The stack in bug #501178 mentions clock ticks and scheduler. Mine has many references to cpu_freq_governor, cpu_freq_<this>, cpu_freq_<that> etc...
How can I grab all that in a file so that I can copy and paste after reboot?
You can't really, unless you have a serial console port attached to your laptop.
Ok, your problem still maybe similar. Prarit is working on a creating an rpm with a patch that we will like you to try. Hopefully he will have something in the next couple of hours.
Created attachment 344822 [details]
RHEL5 RPM with fix from BZ 501178
Please test with this RPM. It only contains a fix for this issue.
(In reply to comment #3)
> Yes, it does panic with stack trace, but they are completely different.
> The stack in bug #501178 mentions clock ticks and scheduler. Mine has many
> references to cpu_freq_governor, cpu_freq_<this>, cpu_freq_<that> etc...
> How can I grab all that in a file so that I can copy and paste after reboot?
... boot with "vga=791".
Pull out your cellphone. Take a picture. Attach it to this BZ.
Actually I did take a picture with my cel the first time. Unfortunately, my cel has no way to upload to computer (does not have USB port) and I do not have a data plan either so I cannot email them either :(
(In reply to comment #8)
> Actually I did take a picture with my cel the first time. Unfortunately, my cel
> has no way to upload to computer (does not have USB port) and I do not have a
> data plan either so I cannot email them either :(
:) Okay, then can you at least type it out so we can see where the panic is?
I have a digital camera now that has a usb cable. Problem is that as I said before, you can not be sure when it will hang up (always right after "Checking for hardware changes") or when it will decide to give the kernel panic messaage.
I have rebooted 4 or 5 times already and every boot into that kernel so far tonight has just hung the system. I will keep on this until I get the panic again I guess.
I will post the image back later (as soon as I can get picture).
Created attachment 345051 [details]
This is the kernel panic I get when booting.
I get this kernel panic randomly. Usually with this kernel it just hangs after "Checking for hardware changes". Once in a while (with no change to kernel options) it gives this kernel panic message.
In bug #499999 I tried a new kernel (kernel-2.6.18-150.el5) and this kernel boots properly.
(In reply to comment #12)
> In bug #499999 I tried a new kernel (kernel-2.6.18-150.el5) and this kernel
> boots properly.
Okay -- CLOSED as NOTABUG.
Just for the record. Prarit investigated a similar problem on a laptop in our office. It was determined the patches from bz 297731 were causing the problems. In -150.el5 those patches were reverted which is probably why your issue was fixed.