Bug 185724
Summary: | lockup on accessing second sata drive (Maxtor 6V250F0) | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 3 | Reporter: | Samuel Masham <samuel.masham> |
Component: | kernel | Assignee: | Jeff Garzik <jgarzik> |
Status: | CLOSED WONTFIX | QA Contact: | Brian Brock <bbrock> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 3.0 | CC: | peterm, petrides, rwahl |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
URL: | http://lkml.org/lkml/2006/3/16/369 | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2007-10-19 18:46:04 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Samuel Masham
2006-03-17 05:52:03 UTC
Original info was at http://lkml.org/lkml/2006/3/16/369 In addition: I have now talked with Maxtor and found that the drives do have the latest (nvida4 bugfix) firmware. I have retested with shorter new cables with the same result. I would like to confirm that in the Device Configuration Overlay the NCQ is disabled but no idea how to read that... Any ideas how we can work around or fix this issue? I reported this update to the http://lkml.org/lkml/2006/3/22/122 but as yet no ideas what to try next. The issue also occurs when: 1, only the cdrom and the 6V250F0 are present 2, a 2.6.12 kernel is used ie I just unplugged the first sata disk and rebooted from an old knoppix disk that I had around and reproduced this issue its *really* 100% reproducible! Are there any news on this topic? The related thread on LKML seems death since end of march. Today I had similar issue (parity error on the second of my two 6V300F0 (Firmware VA111630) with FC5 with the 2.6.16-1.2096_FC5smp kernel. Now is the second morning where this bug occures (during/after the cron.daily scripts are run shortly 4:00 AM - I guess updatedb). Yesterday a kernel crash happened because of a NULL pointer dereference. There was even an interesting SELinux denial message before: May 2 04:05:11 rohan kernel: audit(1146535511.571:30): avc: denied { execheap } for pid=11632 comm="ld-linux.so.2" scontext=user_u:system_r:unconfined_t:s0 tcontext=user_u:system_r:unconfined_t:s0 tclass=process execheap sound strange ... rather like some bad thing happend already. The i/o errors started 2:30h later. Then this appeared multiple times: May 2 06:49:33 rohan kernel: Assertion failed! qc != NULL,drivers/scsi/libata-core.c,ata_pio_poll,line=2897 followed by the oops with this backtrace: May 2 06:49:55 rohan kernel: Process ata/1 (pid: 356, threadinfo=f7f4e000 task=f7cce850) May 2 06:49:55 rohan kernel: Stack: <0>f7cce978 f7e81050 f7d1f310 c241d160 64a95700 003d27ea 00000000 00000001 May 2 06:49:55 rohan kernel: 00200082 00200282 f7d1f8cc f7d1f8d0 c2648cc0 00200282 c0131394 f88a7334 May 2 06:49:55 rohan kernel: f7d1f310 c2648cd8 c2648cc0 c2648ce0 c0131b81 c0131c67 00000000 00000000 May 2 06:49:55 rohan kernel: Call Trace: May 2 06:49:55 rohan kernel: [<c0131394>] run_workqueue+0x7f/0xba [<f88a7334>] ata_pio_task+0x0/0x677 [libata] May 2 06:49:56 rohan kernel: [<c0131b81>] worker_thread+0x0/0x117 [<c0131c67>] worker_thread+0xe6/0x117 May 2 06:49:56 rohan kernel: [<c011da89>] default_wake_function+0x0/0xc [<c0134499>] kthread+0x9d/0xc9 May 2 06:49:56 rohan kernel: [<c01343fc>] kthread+0x0/0xc9 [<c0102005>] kernel_thread_helper+0x5/0xb The board is an ASUS P4P800-E Deluxe with Intel ICH5. Do you need more Info? Shall I file a seperate (more detailed) bug report against FC5 for this or is it ok to track it here in this report? Hi. After much searching on the internet to solve our problems we too came across this bug but on Fedora Core 5, except that we don't seem to get any disk error output, HOWEVER we DO have 2 Maxtor drives set up to use software raid. Also, i'd like to mention that i found this in the intel docs for linux and RH9: ftp://download.intel.com/design/motherbd/linux/RedHat9_info.pdf -- Currently Red Hat 9.0 does not natively support Serial ATA disk drive configurations running in "Enhanced" mode. You must set the Intel(R) desktop board to run in "Legacy" mode to install the operating system on a SATA disk drive. NOTE: When in "Legacy" mode you are limited to a combination of 4 storage devices (for example, 2 SATA and/or 2 PATA disk drives, or 4 PATA disk drives). I havn't had a chance to try this yet though to see if it fixes our problems. Below are some details that may/may not help. It certainly is an interesting bug. Here's the details: [root@ log]# uname -a Linux 2.6.15-1.2054_FC5 #1 SMP Tue Mar 14 15:48:20 EST 2006 x86_64 x86_64 x86_64 GNU/Linux [root@ log]# lspci 00:00.0 Host bridge: Intel Corporation 82865G/PE/P DRAM Controller/Host-Hub Interface (rev 02) 00:02.0 VGA compatible controller: Intel Corporation 82865G Integrated Graphics Controller (rev 02) 00:1d.0 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #1 (rev 02) 00:1d.1 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #2 (rev 02) 00:1d.2 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #3 (rev 02) 00:1d.3 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #4 (rev 02) 00:1d.7 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller (rev 02) 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev c2) 00:1f.0 ISA bridge: Intel Corporation 82801EB/ER (ICH5/ICH5R) LPC Interface Bridge (rev 02) 00:1f.2 IDE interface: Intel Corporation 82801EB (ICH5) SATA Controller (rev 02) 00:1f.3 SMBus: Intel Corporation 82801EB/ER (ICH5/ICH5R) SMBus Controller (rev 02) 00:1f.5 Multimedia audio controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) AC'97 Audio Controller (rev 02) 01:08.0 Ethernet controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) integrated LAN Controller (rev 02) -------------------------- SCSI subsystem initialized libata version 1.20 loaded. ata_piix 0000:00:1f.2: version 1.05 ata_piix 0000:00:1f.2: combined mode detected (p=1, s=0) GSI 16 sharing vector 0xA9 and IRQ 16 ACPI: PCI Interrupt 0000:00:1f.2[A] -> GSI 18 (level, low) -> IRQ 16 PCI: Setting latency timer of device 0000:00:1f.2 to 64 ata1: SATA max UDMA/133 cmd 0x1F0 ctl 0x3F6 bmdma 0xF000 irq 14 ata1: dev 0 cfg 49:2f00 82:7c6b 83:7f09 84:4773 85:7c68 86:3e01 87:4763 88:207f ata1: dev 0 ATA-7, max UDMA/133, 312579695 sectors: LBA48 ata1: dev 1 cfg 49:2f00 82:7c6b 83:7f09 84:4773 85:7c68 86:3e01 87:4763 88:207f ata1: dev 1 ATA-7, max UDMA/133, 312581808 sectors: LBA48 ata1: dev 0 configured for UDMA/133 ata1: dev 1 configured for UDMA/133 scsi0 : ata_piix Vendor: ATA Model: Maxtor 6V160E0 Rev: VA11 Type: Direct-Access ANSI SCSI revision: 05 SCSI device sda: 312579695 512-byte hdwr sectors (160041 MB) sda: Write Protect is off sda: Mode Sense: 00 3a 00 00 SCSI device sda: drive cache: write back SCSI device sda: 312579695 512-byte hdwr sectors (160041 MB) sda: Write Protect is off sda: Mode Sense: 00 3a 00 00 SCSI device sda: drive cache: write back sda: sda1 sda2 sd 0:0:0:0: Attached scsi disk sda Vendor: ATA Model: Maxtor 6V160E0 Rev: VA11 Type: Direct-Access ANSI SCSI revision: 05 SCSI device sdb: 312581808 512-byte hdwr sectors (160042 MB) sdb: Write Protect is off sdb: Mode Sense: 00 3a 00 00 SCSI device sdb: drive cache: write back SCSI device sdb: 312581808 512-byte hdwr sectors (160042 MB) sdb: Write Protect is off sdb: Mode Sense: 00 3a 00 00 SCSI device sdb: drive cache: write back sdb: sdb1 sd 0:0:1:0: Attached scsi disk sdb ata2: PATA max UDMA/100 cmd 0x170 ctl 0x376 bmdma 0xF008 irq 15 ata2: disabling port scsi1 : ata_piix and lastly [root@ ~]lshw description: Desktop Computer width: 32 bits capabilities: smbios-2.3 dmi-2.3 configuration: boot=normal chassis=desktop *-core description: Motherboard product: 8I865GVMK-775 vendor: Gigabyte Technology Co., Ltd. physical id: 0 version: x.x *-firmware description: BIOS vendor: Award Software International, Inc. physical id: 0 version: F2 (09/16/2005) size: 128KB capacity: 320KB capabilities: pci pnp apm upgrade shadowing cdboot bootselect edd int13floppy360 int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int9keyboard int14serial int17printer int10video acpi usb ls120boot zipboot biosbootspecification *-cpu:0 description: CPU product: Intel(R) Pentium(R) 4 CPU 3.00GHz vendor: Intel Corp. physical id: 4 bus info: cpu@0 version: Intel(R) Pentium(R) 4 CPU slot: Socket 775 size: 3GHz capacity: 3600MHz width: 64 bits clock: 200MHz capabilities: fpu fpu_exception wp vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm syscall nx x86-64 constant_tsc pni monitor ds_cpl est cid cx16 xtpr *-cache:0 description: L1 cache physical id: b slot: Internal Cache size: 16KB capacity: 16KB capabilities: synchronous internal write-back *-cache:1 description: L2 cache physical id: c slot: External Cache size: 2MB capabilities: synchronous internal write-back *-cpu:1 description: CPU product: ( ) vendor: Intel physical id: 5 bus info: cpu@1 version: Intel(R) Pentium(R) 4 CPU slot: Socket 775 size: 3GHz capacity: 3600MHz clock: 200MHz *-memory description: System Memory physical id: 1b slot: System board or motherboard size: 1GB *-bank:0 description: DIMM 400 MHz (2.5 ns) product: None vendor: None physical id: 0 serial: None slot: A0 size: 512MB width: 64 bits clock: 400MHz (2.5ns) *-bank:1 description: DIMM [empty] product: None vendor: None physical id: 1 serial: None slot: A1 *-bank:2 description: DIMM 400 MHz (2.5 ns) product: None vendor: None physical id: 2 serial: None slot: A2 size: 512MB width: 64 bits clock: 400MHz (2.5ns) *-bank:3 description: DIMM [empty] product: None vendor: None physical id: 3 serial: None slot: A3 *-pci description: Host bridge product: 82865G/PE/P DRAM Controller/Host-Hub Interface vendor: Intel Corporation physical id: e0000000 bus info: pci@00:00.0 version: 02 width: 32 bits clock: 33MHz resources: iomemory:e0000000-efffffff *-display description: VGA compatible controller product: 82865G Integrated Graphics Controller vendor: Intel Corporation physical id: 2 bus info: pci@00:02.0 version: 02 size: 128MB width: 32 bits clock: 33MHz capabilities: vga bus_master cap_list resources: iomemory:f0000000-f7ffffff iomemory:f8100000-f817ffff ioport:c000-c007 irq:19 *-usb:0 description: USB Controller product: 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #1 vendor: Intel Corporation physical id: 1d bus info: pci@00:1d.0 version: 02 width: 32 bits clock: 33MHz capabilities: uhci bus_master configuration: driver=uhci_hcd resources: ioport:b000-b01f irq:19 *-usbhost product: UHCI Host Controller vendor: Linux 2.6.15-1.2054_FC5 uhci_hcd physical id: 1 bus info: usb@1 logical name: usb1 version: 2.06 capabilities: usb-1.10 configuration: driver=hub maxpower=0mA slots=2 speed=12.0MB/s *-usb:1 description: USB Controller product: 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #2 vendor: Intel Corporation physical id: 1d.1 bus info: pci@00:1d.1 version: 02 width: 32 bits clock: 33MHz capabilities: uhci bus_master configuration: driver=uhci_hcd resources: ioport:b400-b41f irq:20 *-usbhost product: UHCI Host Controller vendor: Linux 2.6.15-1.2054_FC5 uhci_hcd physical id: 1 bus info: usb@2 logical name: usb2 version: 2.06 capabilities: usb-1.10 configuration: driver=hub maxpower=0mA slots=2 speed=12.0MB/s *-usb:2 description: USB Controller product: 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #3 vendor: Intel Corporation physical id: 1d.2 bus info: pci@00:1d.2 version: 02 width: 32 bits clock: 33MHz capabilities: uhci bus_master configuration: driver=uhci_hcd resources: ioport:b800-b81f irq:16 *-usbhost product: UHCI Host Controller vendor: Linux 2.6.15-1.2054_FC5 uhci_hcd physical id: 1 bus info: usb@3 logical name: usb3 version: 2.06 capabilities: usb-1.10 configuration: driver=hub maxpower=0mA slots=2 speed=12.0MB/s *-usb:3 description: USB Controller product: 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #4 vendor: Intel Corporation physical id: 1d.3 bus info: pci@00:1d.3 version: 02 width: 32 bits clock: 33MHz capabilities: uhci bus_master configuration: driver=uhci_hcd resources: ioport:bc00-bc1f irq:19 *-usbhost product: UHCI Host Controller vendor: Linux 2.6.15-1.2054_FC5 uhci_hcd physical id: 1 bus info: usb@4 logical name: usb4 version: 2.06 capabilities: usb-1.10 configuration: driver=hub maxpower=0mA slots=2 speed=12.0MB/s *-usb:4 description: USB Controller product: 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller vendor: Intel Corporation physical id: 1d.7 bus info: pci@00:1d.7 version: 02 width: 32 bits clock: 33MHz capabilities: ehci bus_master cap_list configuration: driver=ehci_hcd resources: iomemory:f8180000-f81803ff irq:21 *-usbhost product: EHCI Host Controller vendor: Linux 2.6.15-1.2054_FC5 ehci_hcd physical id: 1 bus info: usb@5 logical name: usb5 version: 2.06 capabilities: usb-2.00 configuration: driver=hub maxpower=0mA slots=8 speed=480.0MB/s *-pci description: PCI bridge product: 82801 PCI Bridge vendor: Intel Corporation physical id: 1e bus info: pci@00:1e.0 version: c2 width: 32 bits clock: 33MHz capabilities: pci normal_decode bus_master *-network description: Ethernet interface product: 82801EB/ER (ICH5/ICH5R) integrated LAN Controller vendor: Intel Corporation physical id: 8 bus info: pci@01:08.0 logical name: eth0 version: 02 serial: 00:14:85:c5:4c:98 size: 100MB/s capacity: 100MB/s width: 32 bits clock: 33MHz capabilities: bus_master cap_list ethernet physical tp mii 10bt 10bt-fd 100bt 100bt-fd autonegotiation configuration: autonegotiation=on broadcast=yes driver=e100 driverversion=3.5.10-k2-NAPI duplex=full firmware=N/A ip=192.168.0.14 link=yes multicast=yes port=MII speed=100MB/s resources: iomemory:f8000000-f8000fff ioport:a000-a03f irq:17 *-isa UNCLAIMED description: ISA bridge product: 82801EB/ER (ICH5/ICH5R) LPC Interface Bridge vendor: Intel Corporation physical id: 1f bus info: pci@00:1f.0 version: 02 width: 32 bits clock: 33MHz capabilities: isa bus_master *-ide description: IDE interface product: 82801EB (ICH5) SATA Controller vendor: Intel Corporation physical id: 1f.2 bus info: pci@00:1f.2 logical name: scsi0 version: 02 width: 32 bits clock: 66MHz capabilities: ide bus_master emulated configuration: driver=ata_piix resources: ioport:f000-f00f irq:16 *-disk:0 description: SCSI Disk product: Maxtor 6V160E0 vendor: ATA physical id: 0.0.0 bus info: scsi@0:0.0.0 logical name: /dev/sda version: VA11 serial: V39AEFGG size: 149GB capabilities: partitioned partitioned:dos configuration: ansiversion=5 *-volume:0 description: Linux filesystem partition physical id: 1 bus info: scsi@0:0.0.0,1 logical name: /dev/sda1 capacity: 101MB capabilities: primary bootable *-volume:1 description: Linux LVM partition physical id: 2 bus info: scsi@0:0.0.0,2 logical name: /dev/sda2 capacity: 148GB capabilities: primary multi *-disk:1 description: SCSI Disk product: Maxtor 6V160E0 vendor: ATA physical id: 0.1.0 bus info: scsi@0:0.1.0 logical name: /dev/sdb version: VA11 serial: V395DFEG size: 149GB capabilities: partitioned partitioned:dos configuration: ansiversion=5 *-volume description: Linux LVM partition physical id: 1 bus info: scsi@0:0.1.0,1 logical name: /dev/sdb1 capacity: 149GB capabilities: primary bootable multi *-serial description: SMBus product: 82801EB/ER (ICH5/ICH5R) SMBus Controller vendor: Intel Corporation physical id: 1f.3 bus info: pci@00:1f.3 version: 02 width: 32 bits clock: 33MHz configuration: driver=i801_smbus resources: ioport:1400-141f irq:9 *-multimedia description: Multimedia audio controller product: 82801EB/ER (ICH5/ICH5R) AC'97 Audio Controller vendor: Intel Corporation physical id: 1f.5 bus info: pci@00:1f.5 version: 02 width: 32 bits clock: 33MHz capabilities: bus_master cap_list configuration: driver=Intel ICH resources: ioport:dc00-dcff ioport:e000-e03f iomemory:f8181000- f81811ff iomemory:f8182000-f81820ff irq:18 For me the problem _seems_ to be gone since I switched the irq balancer off (this is a system daemon). On a P4 with Hyperthreading (my platform) this makes not much sense I think. Some other problems are gone too with this change - e.g.the jiffi counters in /proc/stat run factor 10x slower than usual sometimes. Dunno if there is a direct relation but at least I havn't seen any anomalies since then. Unfortunately the bug occured again - after more than a month of working. The second disc - respective my whole system - was freezed again this morning. ... Jun 14 03:34:40 rohan kernel: ata2: command 0xb0 timeout, stat 0xd0 host_stat 0x0 Jun 14 03:34:40 rohan kernel: ata2: status=0xd0 { Busy } ... Jun 14 03:34:50 rohan kernel: Assertion failed! qc != NULL,drivers/scsi/libata-core.c,ata_pio_poll,line=2897 Jun 14 03:35:10 rohan last message repeated 1241 times Jun 14 03:35:10 rohan kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000094 ... This bug is filed against RHEL 3, which is in maintenance phase. During the maintenance phase, only security errata and select mission critical bug fixes will be released for enterprise products. Since this bug does not meet that criteria, it is now being closed. For more information of the RHEL errata support policy, please visit: http://www.redhat.com/security/updates/errata/ If you feel this bug is indeed mission critical, please contact your support representative. You may be asked to provide detailed information on how this bug is affecting you. |