Bug 784518

Summary: irq 9: nobody cared (try booting with the "irqpoll" option)
Product: [Fedora] Fedora Reporter: Catalin BOIE <fedora-bugzilla>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 16CC: djip.perois, gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda, mhomolov
Target Milestone: ---   
Target Release: ---   
Hardware: i686   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-07-12 15:44:20 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
lspci -vvv none

Description Catalin BOIE 2012-01-25 09:05:00 UTC
Created attachment 557406 [details]
lspci -vvv

Description of problem:
Sometimes, seems related to network up/down, I get this error in the logs. After that seems the network is dead.
What is strange is that irq 9 is apci and network is on 41.
lspci says: "Interrupt: pin A routed to IRQ 41". Maybe this is the problem?

Version-Release number of selected component (if applicable):
Kernel: 3.1.9-1.fc16.i686.PAE #1 SMP Fri Jan 13 16:57:54 UTC 2012 i686 i686 i386 GNU/Linux

How reproducible:
service network restart

Steps to Reproduce:
1. Restart network
  
Actual results:
Network gets stuck

Expected results:
Network should work.

Additional info:
NET[4060]: /etc/sysconfig/network-scripts/ifup-post : updated /etc/resolv.conf
network[3708]: [  OK  ]
kernel: [2223699.521333] irq 9: nobody cared (try booting with the "irqpoll" option)
kernel: [2223699.521398] Pid: 4087, comm: ifup-eth Not tainted 3.1.6-1.fc16.i686.PAE #1
kernel: [2223699.521400] Call Trace:
kernel: [2223699.521410]  [<c0920c9f>] ? printk+0x2d/0x2f
kernel: [2223699.521416]  [<c04bec39>] __report_bad_irq+0x29/0xd0
kernel: [2223699.521419]  [<c04bee9e>] note_interrupt+0x11e/0x1d0
kernel: [2223699.521422]  [<c04bf620>] ? unmask_irq+0x30/0x30
kernel: [2223699.521428]  [<c06aec82>] ? acpi_ev_sci_xrupt_handler+0x1a/0x20
kernel: [2223699.521431]  [<c04bf620>] ? unmask_irq+0x30/0x30
kernel: [2223699.521433]  [<c04bd21e>] handle_irq_event_percpu+0x9e/0x200
kernel: [2223699.521436]  [<c04bf620>] ? unmask_irq+0x30/0x30
kernel: [2223699.521439]  [<c04bd3b2>] handle_irq_event+0x32/0x60
kernel: [2223699.521442]  [<c04bf620>] ? unmask_irq+0x30/0x30
kernel: [2223699.521444]  [<c04bf667>] handle_fasteoi_irq+0x47/0xb0
kernel: [2223699.521446]  <IRQ>  [<c0413932>] ? do_IRQ+0x42/0xc0
kernel: [2223699.521455]  [<c0931a70>] ? common_interrupt+0x30/0x38
kernel: [2223699.521459]  [<c0595c71>] ? sysfs_follow_link+0x1/0x190
kernel: [2223699.521463]  [<c05451f5>] ? link_path_walk+0x385/0x750
kernel: [2223699.521466]  [<c05456e0>] ? path_lookupat+0x50/0x660
radvd[1479]: attempting to reread config file
kernel: [2223699.521469]  [<c0545d1c>] ? do_path_lookup+0x2c/0xb0
kernel: [2223699.521472]  [<c0546e36>] ? user_path_at_empty+0x46/0x80
kernel: [2223699.521475]  [<c0546e8f>] ? user_path_at+0x1f/0x30
kernel: [2223699.521478]  [<c053da28>] ? vfs_fstatat+0x48/0x80
kernel: [2223699.521481]  [<c053dab0>] ? vfs_stat+0x20/0x30
kernel: [2223699.521484]  [<c053dd26>] ? sys_stat64+0x16/0x30
kernel: [2223699.521488]  [<c04c24fd>] ? rcu_irq_exit+0xd/0x10
kernel: [2223699.521492]  [<c04622a7>] ? irq_exit+0x47/0xa0
kernel: [2223699.521495]  [<c041393b>] ? do_IRQ+0x4b/0xc0
kernel: [2223699.521500]  [<c046edf6>] ? sys_rt_sigprocmask+0x86/0xa0
kernel: [2223699.521504]  [<c042d349>] ? smp_apic_timer_interrupt+0x59/0x90
kernel: [2223699.521509]  [<c092daf0>] ? vmalloc_fault+0x190/0x190
kernel: [2223699.521511]  [<c09314df>] ? sysenter_do_call+0x12/0x28
kernel: [2223699.521513] handlers:
kernel: [2223699.521533] [<c06a1497>] acpi_irq
kernel: [2223699.521560] Disabling IRQ #9
kernel: [2223699.542529] ADDRCONF(NETDEV_UP): br0: link is not ready
radvd[1479]: no linklocal address configured for br0

# cat /proc/interrupts 
            CPU0       CPU1       
   0:        123        736   IO-APIC-edge      timer
   1:          2          8   IO-APIC-edge      i8042
   7:          1          0   IO-APIC-edge    
   8:          0          1   IO-APIC-edge      rtc0
   9:          0          0   IO-APIC-fasteoi   acpi
  14:          0          0   IO-APIC-edge      pata_atiixp
  15:          0          0   IO-APIC-edge      pata_atiixp
  16:          2        202   IO-APIC-fasteoi   ohci_hcd:usb3, ohci_hcd:usb4
  17:          0          2   IO-APIC-fasteoi   ehci_hcd:usb1
  18:          3        525   IO-APIC-fasteoi   ohci_hcd:usb5, ohci_hcd:usb6, ohci_hcd:usb7, radeon
  19:          0          3   IO-APIC-fasteoi   ehci_hcd:usb2
  21:      29667   33667010   IO-APIC-fasteoi   eth1
  22:       7150    3458278   IO-APIC-fasteoi   ahci
  41:       3017     712929   PCI-MSI-edge      eth0
 NMI:          0          0   Non-maskable interrupts
 LOC:  165881491  165200020   Local timer interrupts
 SPU:          0          0   Spurious interrupts
 PMI:          0          0   Performance monitoring interrupts
 IWI:          0          0   IRQ work interrupts
 RES:  137567713  136764578   Rescheduling interrupts
 CAL:      12804      48199   Function call interrupts
 TLB:   18714962   19428540   TLB shootdowns
 TRM:          0          0   Thermal event interrupts
 THR:          0          0   Threshold APIC interrupts
 MCE:          0          0   Machine check exceptions
 MCP:        738        738   Machine check polls
 ERR:          1
 MIS:          0

Comment 1 djip007 2012-03-17 15:37:41 UTC
same things here: with IRQ 5 et 7
cat /proc/interrupts
            CPU0       CPU1       CPU2       CPU3       CPU4       CPU5       CPU6       CPU7       
   0:        135          0          0          0          0          0          0         13   IO-APIC-edge      timer
   1:          0          0          0          1          0          0          3       2122   IO-APIC-edge      i8042
   5:          1          0          1         36          8         36      13318      86601   IO-APIC-fasteoi   ioc0
   7:          1          0          0         80          9         32       7266      92614   IO-APIC-fasteoi   ioc1
   8:          0          0          0          0          0          0          0          1   IO-APIC-edge      rtc0
   9:          0          0          0          0          0          0          0          0   IO-APIC-fasteoi   acpi
  14:          0          0          0         10          0         32        166      22534   IO-APIC-edge      pata_amd
  15:          0          0          0          0          0          0          0          0   IO-APIC-edge      pata_amd
  19:          1          2          4         26          0          0          0          4   IO-APIC-fasteoi   firewire_ohci
  21:          0         45          6         26          0          0          0        170   IO-APIC-fasteoi   sata_nv, snd_hda_intel
  22:          0          0          0          0          0          0          0          3   IO-APIC-fasteoi   ehci_hcd:usb1, sata_nv
  23:       1159      17252      14592       5870          0          0          1         91   IO-APIC-fasteoi   ohci_hcd:usb2, sata_nv
  65:          0          0          0         67          0          0          0         32   PCI-MSI-edge      snd_hda_intel
  66:          0          0          0          1          0          0         41      29082   PCI-MSI-edge      em1
  67:          0          0          0         13          1          2        116     182733   PCI-MSI-edge      fglrx[0]@PCI:35:0:0
 NMI:          2          3          2          1          1          3          1          3   Non-maskable interrupts
 LOC:     117561     180615     174097     121455      60766     171416     148343     159991   Local timer interrupts
 SPU:          0          0          0          0          0          0          0          0   Spurious interrupts
 PMI:          2          3          2          1          1          3          1          3   Performance monitoring interrupts
 IWI:          0          0          0          0          0          0          0          0   IRQ work interrupts
 RES:     100236     110562     114431     113321      60345      36074      41292      33999   Rescheduling interrupts
 CAL:      10567        824        870        840       1167        755        872        726   Function call interrupts
 TLB:       2216       7235       3411       2308       3439       8355       3121       4972   TLB shootdowns
 TRM:          0          0          0          0          0          0          0          0   Thermal event interrupts
 THR:          0          0          0          0          0          0          0          0   Threshold APIC interrupts
 MCE:          0          0          0          0          0          0          0          0   Machine check exceptions
 MCP:          8          8          8          8          8          8          8          8   Machine check polls
 ERR:          1
 MIS:          0

and here parte of the dmesg:

[    3.197023] ioc0: LSI53C1030 C0: Capabilities={Initiator,Target}
[    3.333534] scsi8 : ioc0: LSI53C1030 C0, FwRev=01032700h, Ports=1, MaxQ=255, IRQ=5
[    3.333888] mptspi 0000:03:04.1: PCI IRQ 0 -> rerouted to legacy IRQ 16
[    3.333895] ACPI: Invalid index 16
[    3.333899] mptspi 0000:03:04.1: PCI INT B: no GSI - using ISA IRQ 7
[    3.334144] mptbase: ioc1: Initiating bringup
[    3.456023] ioc1: LSI53C1030 C0: Capabilities={Initiator,Target}
[    3.528508] irq 5: nobody cared (try booting with the "irqpoll" option)
[    3.528517] Pid: 0, comm: swapper/7 Not tainted 3.2.9-1.fc16.x86_64 #1
[    3.528521] Call Trace:
[    3.528525]  <IRQ>  [<ffffffff810e11ad>] __report_bad_irq+0x3d/0xe0
[    3.528547]  [<ffffffff810e146d>] note_interrupt+0x16d/0x220
[    3.528553]  [<ffffffff810dec39>] handle_irq_event_percpu+0xa9/0x220
[    3.528561]  [<ffffffff8101b953>] ? native_sched_clock+0x13/0x80
[    3.528566]  [<ffffffff810dedf4>] handle_irq_event+0x44/0x70
[    3.528571]  [<ffffffff810e1edf>] handle_fasteoi_irq+0x5f/0xf0
[    3.528578]  [<ffffffff81016226>] handle_irq+0x46/0xb0
[    3.528585]  [<ffffffff815ed5da>] do_IRQ+0x5a/0xe0
[    3.528594]  [<ffffffff815e2f2e>] common_interrupt+0x6e/0x6e
[    3.528597]  <EOI>  [<ffffffff8101c725>] ? default_idle+0x55/0x1d0
[    3.528606]  [<ffffffff8101c723>] ? default_idle+0x53/0x1d0
[    3.528611]  [<ffffffff8101c8fd>] amd_e400_idle+0x5d/0x120
[    3.528617]  [<ffffffff81013236>] cpu_idle+0xd6/0x120
[    3.528623]  [<ffffffff815d1360>] start_secondary+0x260/0x262
[    3.528627] handlers:
[    3.528637] [<ffffffffa017eb40>] mpt_interrupt
[    3.528640] Disabling IRQ #5
[    3.592625] scsi9 : ioc1: LSI53C1030 C0, FwRev=01032700h, Ports=1, MaxQ=255, IRQ=7
[    3.621497] firewire_core: created device fw0: GUID 0011d80001376d38, S400
[    3.728207] scsi 8:0:0:0: Direct-Access     IBM      IC35L036UWPR15-0 S80D PQ: 0 ANSI: 3
[    3.728267] scsi target8:0:0: Beginning Domain Validation
[    3.747527] dracut: Scanning devices sda3  for LVM logical volumes vg_fedorajppx64/lv_swap vg_fedorajppx64/lv_root
[    3.763741] dracut: inactive '/dev/vg_fedorajppx64/lv_swap' [7.75 GiB] inherit
[    3.763833] dracut: inactive '/dev/vg_fedorajppx64/lv_root' [7.75 GiB] inherit
[    3.785557] irq 7: nobody cared (try booting with the "irqpoll" option)
[    3.785565] Pid: 0, comm: swapper/7 Not tainted 3.2.9-1.fc16.x86_64 #1
[    3.785568] Call Trace:
[    3.785572]  <IRQ>  [<ffffffff810e11ad>] __report_bad_irq+0x3d/0xe0
[    3.785593]  [<ffffffff810e146d>] note_interrupt+0x16d/0x220
[    3.785599]  [<ffffffff810dec39>] handle_irq_event_percpu+0xa9/0x220
[    3.785606]  [<ffffffff8101b953>] ? native_sched_clock+0x13/0x80
[    3.785611]  [<ffffffff810dedf4>] handle_irq_event+0x44/0x70
[    3.785617]  [<ffffffff810e1edf>] handle_fasteoi_irq+0x5f/0xf0
[    3.785624]  [<ffffffff81016226>] handle_irq+0x46/0xb0
[    3.785631]  [<ffffffff815ed5da>] do_IRQ+0x5a/0xe0
[    3.785639]  [<ffffffff815e2f2e>] common_interrupt+0x6e/0x6e
[    3.785642]  <EOI>  [<ffffffff8103ce3b>] ? native_safe_halt+0xb/0x10
[    3.785653]  [<ffffffff8101c723>] default_idle+0x53/0x1d0
[    3.785658]  [<ffffffff8101c8fd>] amd_e400_idle+0x5d/0x120
[    3.785664]  [<ffffffff81013236>] cpu_idle+0xd6/0x120
[    3.785670]  [<ffffffff815d1360>] start_secondary+0x260/0x262
[    3.785674] handlers:
[    3.785683] [<ffffffffa017eb40>] mpt_interrupt
[    3.785687] Disabling IRQ #7

Comment 2 Dave Jones 2012-03-22 16:40:37 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 3 Dave Jones 2012-03-22 16:45:38 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 4 Dave Jones 2012-03-22 16:54:41 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 5 tvizirov 2012-03-23 12:11:02 UTC
Hello, 


I have customer with the same problem on RHEL5.7 with kernel 2.6.18-194.17.1.el5, 64bit. 
Was something done to address the same issue on RHEL5.7 or this has been troubleshooted on Fedora only?

Comment 6 Dave Jones 2012-03-23 14:55:20 UTC
this a generic warning that could be triggered by any number of things. There's no guarantee this bug is the same as what is being seen in rhel5.

Comment 7 tvizirov 2012-03-24 11:14:02 UTC
The issue was resolved by adding the irqpoll to the end of the kernel stanza in the grub.conf file as suggested in:
>> https://access.redhat.com/knowledge/solutions/26013

I am talking about case 00584431.

Comment 8 Dave Jones 2012-07-12 15:44:20 UTC
If you can still reproduce this in 3.4, please reopen. We believe this should be fixed with the current updates.

Comment 9 tvizirov 2012-07-13 09:32:13 UTC
(In reply to comment #8)
> If you can still reproduce this in 3.4, please reopen. We believe this
> should be fixed with the current updates.

Mmm what do you mean to reproduce it on 3.4?