Bug 633196
Summary: | testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (62->62)! | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Eryu Guan <eguan> |
Component: | kernel | Assignee: | Don Zickus <dzickus> |
Status: | CLOSED ERRATA | QA Contact: | Han Pingtian <phan> |
Severity: | medium | Docs Contact: | |
Priority: | low | ||
Version: | 5.5.z | CC: | jhunt, jwilson, pvn, qcai |
Target Milestone: | rc | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2011-07-21 10:25:05 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Eryu Guan
2010-09-13 08:11:29 UTC
System with the same issues, I get the following when the server boots up. AMD Opteron(tm) Processor 6174 stepping 01 Brought up 24 CPUs testing NMI watchdog ... <4>WARNING: CPU#0: NMI appears to be stuck (177->177)! time.c: Using 14.318180 MHz WALL HPET GTOD HPET/TSC timer. time.c: Detected 2200.011 MHz processor. Information: RHEL 5.4 kernel 2.6.18-164.el5 HP ProLiant BL465c G7 This maybe due to the BIOS using the same performance counters the nmi watchdog is using. HP has suggested the following to disable some monitoring to allow the nmi watchdog to work. [This only affects AMD G7s AFAIK] (when the BIOS loads during a restart) - Press "F9" during POST to go into RBSU - Hit "control-a" - you will then see a new "service options" menu - go into it, and disable the following: 1) memory pre-failure notification 2) processor power utilization monitoring If this works, I will dup this bug over to another bug I am working on to address this issue. Cheers, Don This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. Patch(es) available in kernel-2.6.18-252.el5 You can download this test kernel (or newer) from http://people.redhat.com/jwilson/el5 Detailed testing feedback is always welcomed. Verified with 2.6.18-256.el5PAE. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2011-1065.html |