Bug 235336 - qla3200 HBA Card hangs with a memory error
qla3200 HBA Card hangs with a memory error
Status: CLOSED NOTABUG
Product: Red Hat Enterprise Linux 4
Classification: Red Hat
Component: kernel (Show other bugs)
4.4
x86_64 Linux
medium Severity medium
: ---
: ---
Assigned To: Red Hat Kernel Manager
Martin Jenner
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2007-04-05 05:24 EDT by Qian Shen
Modified: 2015-10-25 21:54 EDT (History)
2 users (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2007-09-27 18:12:34 EDT
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)

  None (edit)
Description Qian Shen 2007-04-05 05:24:32 EDT
Description of problem:

The customer uses a qla2300 HBA card to connect to a storage box. when one of
memory chips becomes fault, the qla2300 card reports waring to /var/log/message
and doesn't work. After replaced a new memory chip, the card become work then.

The customer think that it's a big issue that HBA card doesn't work, so it's not
enough to report warning message only one time. He think it will be good that
HBA card can report warning when each i/o request arrived.

The error message like this:

> Uhhuh. NMI received. Dazed and confused, but trying to continue

> > cpqphp: power fault interrupt

> > cpqphp: power fault bit 1 set

> > You probably have a hardware problem with your RAM chips qla2300 

> > 0000:06:02.0: qla2xxx_eh_abort scsi(1:0:0:26): cmd_timeout_in_sec=0x1e.

> > qla2300 0000:06:02.0: Parity error -- HCCR=ffff.

> > qla2300 0000:06:02.0: Mailbox command timeout occured. Issuing ISP abort.

> > qla2300 0000:06:02.0: Performing ISP error recovery - ha= f6f9c258.

> > cpqphp: power fault interrupt

> > cpqphp: power fault bit 0 set

> > qla2300 0000:06:01.0: qla2xxx_eh_abort scsi(0:0:0:26): cmd_timeout_in_sec=0x1e.

> > qla2300 0000:06:01.0: Parity error -- HCCR=ffff.

> > qla2300 0000:06:01.0: Mailbox command timeout occured. Issuing ISP abort.

> > qla2300 0000:06:01.0: Performing ISP error recovery - ha= f7e0c258.

Version-Release number of selected component (if applicable):
RHEL AS 4.4

How reproducible:


Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:
Comment 1 Ernie Petrides 2007-09-27 18:12:34 EDT
This is a hardware problem.  The operating system cannot be expected
to continue to function properly when there are disk controller errors.

Note You need to log in before you can comment on or make changes to this bug.