Hide Forgot
Created attachment 486086 [details] cpuinfo file from bigi system Description of problem: The bigi testbed which runs SPECsfs NFS worload is receiving messages of the form ... Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Uhhuh. NMI received for unknown reason 31 on CPU 0. Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Do you have a strange power saving mode enabled? Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Dazed and confused, but trying to continue Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Uhhuh. NMI received for unknown reason 00 on CPU 1. Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Do you have a strange power saving mode enabled? Message from syslogd@bigi at Mar 16 17:17:21 ... kernel:Dazed and confused, but trying to continue ... in bursts with the 2.6.32-122 kernel. Tests with the -118 kernel show no problem. The cpuinfo for this server, an HP DL580g2 (Intel) has been attached. I have tried disabling hyper threads at the BIOS but we still get errors. Disabling nmi_watchdog in /proc/sys/kernel makes them go away. I have talked to dzickus about this ... Version-Release number of selected component (if applicable): 2.6.32-122 How reproducible: everytime Steps to Reproduce: 1. run specsfs on bigi testbed for sure 2. 3. Actual results: Expected results: Additional info:
bmarson, we need more logs than what you have above. P.
Barry, I thought you were the first one to notice this yesterday, turns out you are already the third bz filed for it. :-) I hate p4 boxes. *** This bug has been marked as a duplicate of bug 688547 ***
I had see kernel:Uhhuh. NMI received for unknown reason 31 on CPU 0. kernel:Uhhuh. NMI received for unknown reason 21 on CPU 0. kernel:Do you have a strange power saving mode enabled? kernel:Dazed and confused, but trying to continue I had try : add ‘nmi_watchdog=0 pcie_aspm=off nohpet’ to kernel param change a older kernel Result: Use a older kernel 2.6.32-131.21.1 (default is 2.6.32-358.23.2)。