Bug 220984

Summary: Kernel hang on Supermicro P4SCT with sata_mv
Product: Red Hat Enterprise Linux 5 Reporter: Milan Kerslager <milan.kerslager>
Component: kernelAssignee: Red Hat Kernel Manager <kernel-mgr>
Status: CLOSED NOTABUG QA Contact: Brian Brock <bbrock>
Severity: medium Docs Contact:
Priority: medium    
Version: 5.0   
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2007-09-05 20:47:30 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
dmesg output (kernel-2.6.18-1.2913.2.1.el5.gtest.3)
none
lsmod
none
lspci none

Description Milan Kerslager 2006-12-29 22:41:20 UTC
This is Supermicro P4SCT+ rev. 1.11 with latest BIOS update (1.2b, 06/12/2006).

The system has been reported to me as stable with FC3 in the past. Now with
RHEL4 or RHEL5b2 I see random hard hangs (after minutes to several hours of
uptime). There are 7 discs (SATA WD4000KD-00N) connected to the MV88SX5081
8-port SATA I PCI-X Controller. I tryed binary kernel module instead of sata_mv
too with no luck (there is RAID5 array but with no dmraid now).

I tryed to disable hyperthreading, update BIOS, boot with noapic kernel
parameter with no luck. The system hanged during startup sequence on hotplud and
smartd daemon start so I disabled these. Other hangs was after some time with no
traces or OOPSes in the logs or on the screen.

I still have no clue about what going on (maybe SATA driver/card?). I moved the
system to my home to be able to do more testing so please ask me for what I have
to try.

The system is now running the latest test kernel (2.6.18-1.2913.2.1.el5.gtest.3)
for RHEL5 from http://people.redhat.com/agospoda/rhel5/gtest/.

Comment 1 Milan Kerslager 2006-12-29 22:47:17 UTC
Created attachment 144565 [details]
dmesg output (kernel-2.6.18-1.2913.2.1.el5.gtest.3)

Comment 2 Milan Kerslager 2006-12-29 22:47:54 UTC
Created attachment 144566 [details]
lsmod

Comment 3 Milan Kerslager 2006-12-29 22:48:24 UTC
Created attachment 144567 [details]
lspci

Comment 4 Milan Kerslager 2007-01-10 08:28:44 UTC
I'm using kernel 2.6.18-1.2962.2.1.el5.gtest.4.i686 (HT off) from
http://people.redhat.com/agospoda/#rhel5 and the system seems to be stable last
few days (it survives rebuilding RAID5 arrays). Also I put off console blanking
by "setterm -blank 0 > /dev/tty[1-8]".

I'm going to do some stress tests to make sure it will not hang again.

Comment 5 Milan Kerslager 2007-03-20 22:41:58 UTC
It seems like HW bug on motherboard (the last piece of HW that has been claimed).
I'm not able to close bug as NOTABUG even I'm a submiter. Please close. Thank you.

Comment 6 Ernie Petrides 2007-09-05 20:47:30 UTC
Closing at the request of reporter.