Bug 471756 - "kernel:Disabling IRQ #16" on terminals, followed quickly by system death....
Summary: "kernel:Disabling IRQ #16" on terminals, followed quickly by system death....
Keywords:
Status: CLOSED DUPLICATE of bug 471162
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: rawhide
Hardware: All
OS: Linux
medium
high
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2008-11-15 21:16 UTC by Tom London
Modified: 2008-11-18 15:38 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2008-11-18 15:38:06 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)
"camera shot" of screen showing kernel trace (133.57 KB, image/jpeg)
2008-11-16 00:25 UTC, Tom London
no flags Details

Description Tom London 2008-11-15 21:16:21 UTC
Description of problem:
Running kernel-2.6.27.5-109.fc10.x86_64 on Thinkpad X61 (lspci below), after 1-2 hours of use, I get rsyslogd scribbling on my console windows:
 rsyslogd:kernel:Disabling IRQ #16

The system then starts behaving sluggish (e.g., seconds for mouse clicks to take effect, etc.).

When I finally active a console window, I try to run "tail /var/log/messages", but I get an immediate fault (sorry, don't remember the exact text: "Bus error"?)

This has happened twice today.

Not sure its relevant, but both times I was running "padsp audacity".

When I reboot (by power cycling), /var/log/messages shows nothing.

Booting up shows journal recovery on /boot, / as clean.

Rebooted system appears to run just fine.

Version-Release number of selected component (if applicable):
kernel-2.6.27.5-109.fc10.x86_64

How reproducible:
Twice today

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Tom London 2008-11-15 21:16:39 UTC
[root@tlondon ~]# lspci
00:00.0 Host bridge: Intel Corporation Mobile PM965/GM965/GL960 Memory Controller Hub (rev 0c)
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 0c)
00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 0c)
00:19.0 Ethernet controller: Intel Corporation 82566MM Gigabit Network Connection (rev 03)
00:1a.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 03)
00:1a.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801H (ICH8 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev f3)
00:1f.0 ISA bridge: Intel Corporation 82801HBM (ICH8M-E) LPC Interface Controller (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) IDE Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) SATA AHCI Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 03)
03:00.0 Network controller: Intel Corporation PRO/Wireless 4965 AG or AGN Network Connection (rev 61)
05:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev ba)
05:00.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C832 IEEE 1394 Controller (rev 04)
05:00.2 SD Host controller: Ricoh Co Ltd R5C822 SD/SDIO/MMC/MS/MSPro Host Adapter (rev 21)
[root@tlondon ~]#

Comment 2 Tom London 2008-11-15 21:35:51 UTC
Got the below on the rebooted (and seemingly working) system:

[root@tlondon ~]# cat /proc/interrupts
           CPU0       CPU1       
  0:     461227     428182   IO-APIC-edge      timer
  1:          4          4   IO-APIC-edge      i8042
  4:          0          2   IO-APIC-edge    
  7:          0          0   IO-APIC-edge      parport0
  8:          0          1   IO-APIC-edge      rtc0
  9:        349       2359   IO-APIC-fasteoi   acpi
 12:         76         73   IO-APIC-edge      i8042
 14:      10009       9438   IO-APIC-edge      ata_piix
 15:          0          0   IO-APIC-edge      ata_piix
 16:       2670     113959   IO-APIC-fasteoi   ahci, uhci_hcd:usb5, yenta, i915@pci:0000:00:02.0
 17:      32407        109   IO-APIC-fasteoi   uhci_hcd:usb6, firewire_ohci, HDA Intel, iwlagn
 18:          0          0   IO-APIC-fasteoi   mmc0
 19:          0          0   IO-APIC-fasteoi   ehci_hcd:usb2
 20:         82      47837   IO-APIC-fasteoi   uhci_hcd:usb3, eth0
 21:          0          0   IO-APIC-fasteoi   uhci_hcd:usb4
 22:      14409        470   IO-APIC-fasteoi   ehci_hcd:usb1
NMI:          0          0   Non-maskable interrupts
LOC:     349675     470509   Local timer interrupts
RES:     194683     188372   Rescheduling interrupts
CAL:       6215        233   function call interrupts
TLB:        417       1161   TLB shootdowns
TRM:          0          0   Thermal event interrupts
THR:          0          0   Threshold APIC interrupts
SPU:          0          0   Spurious interrupts
ERR:          0
[root@tlondon ~]#

Comment 3 Tom London 2008-11-16 00:24:01 UTC
I appear to be able to reproduce this by running "padsp audacity".  Otherwise I've never seen this; and without running this, the system is rock stable.

This time, I ran "inotail -f /var/log/messages" in one of the console windows while I was running "padsp audacity".

I captured a screenshot (with my cell phone, since the system was "under responsive").  I attach below.

Here (roughly) are the messages:

kernel: irq 16: nobody cared (try booting with the "irqpoll" option)
kernel: Pid: 5301, comm: audacity Not tainted 2.6.27.5-105.fc10.x86_64 #1
kernel:
kernel: Call Trace:
kernel: <IRQ> [<ffffffff0100320f>] __report_bad_irq_0x38/0x7c
kernel: [<ffffffff8108345b>] note_interrupt+0x200/0x26d
kernel: [<ffffffff81083b88>] handle_fasteoi_irq+0xbb/0xeb
kernel: [<ffffffff8101309e>] do_IRQ+xxf7/0x169
kernel: [<ffffffff81010933>] ret_from_intr+0x0/0x2e
kernel: <EOI>
kernel: handlers:
kernel: [<ffffffff81229dd6>] (ahci_intrrupt+0x0/0x4aa)
kernel: [<ffffffff8123b3f6>] (usb_hcd_irq+0x0/b3)
kernel: [<ffffffffa00e354f>] (yenta_interrupt+0x0/0xc0 [yenta_socket])
kernel: [<ffffffffa041c46f>] (i915_driver_irq_handler+0x0/0x199 [i915])
kernel: Disabling IRQ #16

I'll google the above and see what's what.

.jpg of screen below.....

Comment 4 Tom London 2008-11-16 00:25:34 UTC
Created attachment 323704 [details]
"camera shot" of screen showing kernel trace

Camera shot of screen with trace just before system went unstable.

Comment 5 Tom London 2008-11-16 00:39:51 UTC
Will anything be learned by me running with "irqpoll" or "irqfixup"?

Comment 6 Thorsten Leemhuis 2008-11-16 07:22:43 UTC
Seeing something like this as well with kernel-2.6.27.5-109.fc10.x86_64; but looks to me like this bug is a dupe of Bug 471162, hence I'll report details there

Comment 7 Tom London 2008-11-18 15:38:06 UTC

*** This bug has been marked as a duplicate of bug 471162 ***


Note You need to log in before you can comment on or make changes to this bug.