Bug 16748 - CPQ Proliant 1600 freezes after ~7 days
CPQ Proliant 1600 freezes after ~7 days
Status: CLOSED NOTABUG
Product: Red Hat Linux
Classification: Retired
Component: kernel (Show other bugs)
6.2
i386 Linux
high Severity high
: ---
: ---
Assigned To: Alan Cox
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2000-08-22 16:21 EDT by James Ringland
Modified: 2008-05-01 11:37 EDT (History)
1 user (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2002-12-14 20:20:11 EST
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)

  None (edit)
Description James Ringland 2000-08-22 16:21:01 EDT
CPQ1600R will completely freeze after several days of operation. System 
temp. always within norms, Internal Rack Temp 70 deg. maintained. Two 
seperate systems have been tried, both 1600R's but of two different 
varieties the new 1" UltraWide model and the older 1.6 Ultra2 model. The 
system console will be black and register as "Powered" on the KVM, but the 
machine must be restarted by a power cycle. RT Clock also does not update 
during these outages, no errors are logged in the EISA logs.
Comment 1 Michael K. Johnson 2000-08-22 16:30:17 EDT
Are you running the 2.2.16-3 errata kernel?
Comment 2 James Ringland 2000-09-11 10:02:22 EDT
Yes I am. I finally was able to trap the error using the playback on the Remote 
Insight server management board. It reads as follows: 

       Uhhuh. NMI received for unknown reason 20
       Dazed and confused, but trying to continue
       Do you have a strange power saving mode enabled?
Comment 3 Alan Cox 2000-09-15 14:01:00 EDT
 Uhhuh. NMI received for unknown reason 20

NMI is normally issued for things like ECC memory errors or bus errors. 20 is a
compaq specific error code so I don't know what it means. It certainly looks to
me like the hardware waved the white flag and surrendered rather than a Linux
crash.

If you can find out from compaq what NMI error code 20 is on these boxes I'd
love to know and can then try and help further.
Comment 4 James Ringland 2000-09-15 14:15:35 EDT
Thanks. I have placed a call to technical support. Also, I had another 
<SARCASM>Graceful Shutdown</SARCASM> with an NMI 21 this morning.
Comment 5 Eugene Kanter 2005-05-30 01:44:01 EDT
(In reply to comment #4)
> Thanks. I have placed a call to technical support. Also, I had another 
> <SARCASM>Graceful Shutdown</SARCASM> with an NMI 21 this morning.

James,

just wondering if you figured out what Coompaq NMI errors mean. I have seen
similar issues.
Comment 6 James Ringland 2005-05-31 16:36:18 EDT
That system is now out of service, but IIRC, the NMI's stopped for no apparent
reason. We never found a definitive cause for the problem.

Note You need to log in before you can comment on or make changes to this bug.