Fedora Account System
Red Hat Associate
Red Hat Customer
Description of problem: Sometimes, when coming out of suspend on my Lenovo Thinkpad T400, I experience a kernel bug, which prevents further use of the wireless card. Version-Release number of selected component (if applicable): Fedora Rawhide, kernel-2.6.36-0.27.rc5.git6.fc15.x86_64 How reproducible: Sometimes, not always afaict. Steps to Reproduce: Suspend to ram, then resume. See a black screen with the graphical mouse pointer visible, maybe some text console messages. Press ctrl-alt-F5, ctrl-alt-F2 to get back to the desktop. Network management is disabled, and I haven't found a way to re-activate my wifi. Workarounds appreciated. Additional info: Dump from dmesg CASE 1: ------------[ cut here ]------------ kernel BUG at drivers/pci/intel-iommu.c:1656! invalid opcode: 0000 [#1] SMP last sysfs file: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/device:00/PNP0C09:00/PNP0C0A:00/power_supply/BAT0/voltage_now CPU 1 Modules linked in: rfcomm sunrpc sco bnep l2cap cpufreq_ondemand acpi_cpufreq freq_table mperf ip6t_REJECT xt_physdev ipt_MASQUERADE nf_conntrack_ipv6 iptable_nat nf_nat ip6table_filter ip6_tables sha256_generic cryptd aes_x86_64 aes_generic cbc dm_crypt uinput arc4 ecb snd_hda_codec_conexant iwlagn snd_hda_intel iwlcore snd_hda_codec snd_hwdep mac80211 snd_seq snd_seq_device zaurus r852 uvcvideo cdc_ether microcode snd_pcm sm_common nand nand_ids btusb videodev cfg80211 nand_ecc usbnet v4l1_compat v4l2_compat_ioctl32 i2c_i801 cdc_wdm joydev iTCO_wdt mtd cdc_acm bluetooth mii iTCO_vendor_support snd_timer thinkpad_acpi e1000e snd_page_alloc rfkill wmi snd soundcore ipv6 firewire_ohci sdhci_pci sdhci mmc_core firewire_core yenta_socket crc_itu_t i915 drm_kms_helper drm i2c_algo_bit i2c_core video output [last unloaded: scsi_wait_scan] Pid: 8346, comm: NetworkManager Not tainted 2.6.36-0.27.rc5.git6.fc15.x86_64 #1 6474AR4/6474AR4 RIP: 0010:[<ffffffff8126e0b8>] [<ffffffff8126e0b8>] __domain_mapping+0x43/0x1ce RSP: 0018:ffff8801339e9648 EFLAGS: 00010206 RAX: 0000000001ffffff RBX: ffff8801341fd200 RCX: 000000000000001b RDX: 000000000010eabb RSI: 000ffffffffffdff RDI: ffff8801320d6f00 RBP: ffff8801339e9698 R08: 0000000000000001 R09: 0000000000000003 R10: ffff8801320d6f38 R11: 0000000000000000 R12: 0000000000000000 R13: ffff8801320d6f00 R14: ffff88013bc04ff8 R15: 0000000000000001 FS: 00007f165e262800(0000) GS:ffff880002e00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 000000000046ab30 CR3: 000000013380a000 CR4: 00000000000006e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process NetworkManager (pid: 8346, threadinfo ffff8801339e8000, task ffff880133708000) Stack: ffff88013707e240 ffff8801320d6f00 ffff8801339e9698 000ffffffffffdff <0> 0000000000000000 ffff8801341fd200 000000010eabb000 ffff8801320d6f00 <0> ffff88013bc04ff8 0000000000000000 ffff8801339e96f8 ffffffff8126ed80 Call Trace: [<ffffffff8126ed80>] __intel_map_single.clone.25+0xdc/0x16b [<ffffffff8126eefc>] intel_alloc_coherent+0xae/0xd5 [<ffffffffa0184eb4>] e1000_alloc_ring_dma.clone.32+0x94/0xc0 [e1000e] [<ffffffffa01870dc>] e1000e_setup_tx_resources+0x65/0xaa [e1000e] [<ffffffffa0187611>] e1000_open+0x61/0x41a [e1000e] [<ffffffff813edad1>] __dev_open+0x96/0xcf [<ffffffff813edd15>] __dev_change_flags+0xad/0x130 [<ffffffff813ede19>] dev_change_flags+0x21/0x56 [<ffffffff813f806e>] do_setlink+0x2b5/0x61a [<ffffffff8107d7d3>] ? print_lock_contention_bug+0x1b/0xd5 [<ffffffff8124963f>] ? debug_check_no_obj_freed+0x65/0x18a [<ffffffff8107d7d3>] ? print_lock_contention_bug+0x1b/0xd5 [<ffffffff813f8633>] rtnl_setlink+0xd0/0xf2 [<ffffffff813f8921>] rtnetlink_rcv_msg+0x1eb/0x201 [<ffffffff813f8736>] ? rtnetlink_rcv_msg+0x0/0x201 [<ffffffff8140c2b1>] netlink_rcv_skb+0x45/0x90 [<ffffffff813f7ca3>] rtnetlink_rcv+0x26/0x2d [<ffffffff8140bd8c>] netlink_unicast+0xee/0x157 [<ffffffff8140c0ad>] netlink_sendmsg+0x2b8/0x2d6 [<ffffffff813d964e>] __sock_sendmsg+0x6b/0x77 [<ffffffff813d99a8>] sock_sendmsg+0xa8/0xc1 [<ffffffff8107fe13>] ? lock_acquire+0xee/0xfd [<ffffffff810fad84>] ? might_fault+0x5c/0xac [<ffffffff8107fd19>] ? lock_release+0x19a/0x1a6 [<ffffffff810fadcd>] ? might_fault+0xa5/0xac [<ffffffff813e3db7>] ? copy_from_user+0x2f/0x31 [<ffffffff813e41a7>] ? verify_iovec+0x54/0x91 [<ffffffff813db96f>] sys_sendmsg+0x235/0x2b3 [<ffffffff8112b67a>] ? rcu_read_lock+0x0/0x35 [<ffffffff8107fe13>] ? lock_acquire+0xee/0xfd [<ffffffff8112b67a>] ? rcu_read_lock+0x0/0x35 [<ffffffff813db403>] ? sys_sendto+0x125/0x152 [<ffffffff8112c0f6>] ? fput+0x22/0x1d6 [<ffffffff8112c002>] ? fget_light+0x79/0x83 [<ffffffff811339af>] ? path_put+0x22/0x27 [<ffffffff810a828b>] ? audit_syscall_entry+0x11c/0x148 [<ffffffff8149c5ba>] ? trace_hardirqs_on_thunk+0x3a/0x3f [<ffffffff81009cb2>] system_call_fastpath+0x16/0x1b Code: d4 48 89 ca 48 89 7d b8 6b 8f 84 00 00 00 09 48 89 75 c8 4d 89 c7 83 c1 12 83 f9 3f 7f 0f 4a 8d 44 06 ff 48 d3 e8 48 85 c0 74 02 <0f> 0b 41 f6 c1 03 b8 ea ff ff ff 0f 84 6b 01 00 00 41 81 e1 03 RIP [<ffffffff8126e0b8>] __domain_mapping+0x43/0x1ce RSP <ffff8801339e9648> ---[ end trace c2e21288fc51658f ]--- Dump from dmesg CASE 2: ------------[ cut here ]------------ kernel BUG at drivers/pci/intel-iommu.c:1656! invalid opcode: 0000 [#1] SMP last sysfs file: /sys/devices/system/cpu/sched_mc_power_savings CPU 0 Modules linked in: rfcomm sunrpc sco bnep l2cap cpufreq_ondemand acpi_cpufreq freq_table mperf ip6t_REJECT xt_physdev ipt_MASQUERADE nf_conntrack_ipv6 ip6table_filter iptable_nat nf_nat ip6_tables sha256_generic cryptd aes_x86_64 aes_generic cbc dm_crypt uinput arc4 ecb iwlagn snd_hda_codec_conexant snd_hda_intel iwlcore snd_hda_codec mac80211 snd_hwdep zaurus snd_seq microcode snd_seq_device uvcvideo r852 cdc_ether sm_common snd_pcm nand btusb usbnet nand_ids nand_ecc cdc_acm joydev cfg80211 i2c_i801 mtd videodev cdc_wdm mii bluetooth thinkpad_acpi iTCO_wdt v4l1_compat v4l2_compat_ioctl32 iTCO_vendor_support snd_timer rfkill e1000e snd_page_alloc snd wmi soundcore ipv6 sdhci_pci firewire_ohci sdhci firewire_core mmc_core yenta_socket crc_itu_t i915 drm_kms_helper drm i2c_algo_bit i2c_core video output [last unloaded: scsi_wait_scan] Pid: 1367, comm: NetworkManager Not tainted 2.6.36-0.27.rc5.git6.fc15.x86_64 #1 6474AR4/6474AR4 RIP: 0010:[<ffffffff8126e0b8>] [<ffffffff8126e0b8>] __domain_mapping+0x43/0x1ce RSP: 0018:ffff880130db1648 EFLAGS: 00010206 RAX: 0000000001ffffff RBX: ffff8800a29b2580 RCX: 000000000000001b RDX: 0000000000067b9a RSI: 000ffffffffffdff RDI: ffff8801320d6dc0 RBP: ffff880130db1698 R08: 0000000000000001 R09: 0000000000000003 R10: ffff8801320d6df8 R11: 0000000000000000 R12: 0000000000000000 R13: ffff8801320d6dc0 R14: ffff88013bc04ff8 R15: 0000000000000001 FS: 00007f284f8a2800(0000) GS:ffff880002c00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000003a53048 CR3: 0000000131a71000 CR4: 00000000000006f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Process NetworkManager (pid: 1367, threadinfo ffff880130db0000, task ffff880130e4c880) Stack: ffff88013707e240 ffff8801320d6dc0 ffff880130db1698 000ffffffffffdff <0> 0000000000000000 ffff8800a29b2580 0000000067b9a000 ffff8801320d6dc0 <0> ffff88013bc04ff8 0000000000000000 ffff880130db16f8 ffffffff8126ed80 Call Trace: [<ffffffff8126ed80>] __intel_map_single.clone.25+0xdc/0x16b [<ffffffff8126eefc>] intel_alloc_coherent+0xae/0xd5 [<ffffffffa016beb4>] e1000_alloc_ring_dma.clone.32+0x94/0xc0 [e1000e] [<ffffffffa016e0dc>] e1000e_setup_tx_resources+0x65/0xaa [e1000e] [<ffffffffa016e611>] e1000_open+0x61/0x41a [e1000e] [<ffffffff813edad1>] __dev_open+0x96/0xcf [<ffffffff813edd15>] __dev_change_flags+0xad/0x130 [<ffffffff813ede19>] dev_change_flags+0x21/0x56 [<ffffffff813f806e>] do_setlink+0x2b5/0x61a [<ffffffff8107d7d3>] ? print_lock_contention_bug+0x1b/0xd5 [<ffffffff8124963f>] ? debug_check_no_obj_freed+0x65/0x18a [<ffffffff8107d7d3>] ? print_lock_contention_bug+0x1b/0xd5 [<ffffffff813f8633>] rtnl_setlink+0xd0/0xf2 [<ffffffff813f8921>] rtnetlink_rcv_msg+0x1eb/0x201 [<ffffffff813f8736>] ? rtnetlink_rcv_msg+0x0/0x201 [<ffffffff8140c2b1>] netlink_rcv_skb+0x45/0x90 [<ffffffff813f7ca3>] rtnetlink_rcv+0x26/0x2d [<ffffffff8140bd8c>] netlink_unicast+0xee/0x157 [<ffffffff8140c0ad>] netlink_sendmsg+0x2b8/0x2d6 [<ffffffff813d964e>] __sock_sendmsg+0x6b/0x77 [<ffffffff813d99a8>] sock_sendmsg+0xa8/0xc1 [<ffffffff8107fe13>] ? lock_acquire+0xee/0xfd [<ffffffff810fad84>] ? might_fault+0x5c/0xac [<ffffffff8107fd19>] ? lock_release+0x19a/0x1a6 [<ffffffff810fadcd>] ? might_fault+0xa5/0xac [<ffffffff813e3db7>] ? copy_from_user+0x2f/0x31 [<ffffffff813e41a7>] ? verify_iovec+0x54/0x91 [<ffffffff813db96f>] sys_sendmsg+0x235/0x2b3 [<ffffffff8112b67a>] ? rcu_read_lock+0x0/0x35 [<ffffffff8107fe13>] ? lock_acquire+0xee/0xfd [<ffffffff8112b67a>] ? rcu_read_lock+0x0/0x35 [<ffffffff813db403>] ? sys_sendto+0x125/0x152 [<ffffffff8112c0f6>] ? fput+0x22/0x1d6 [<ffffffff8112c002>] ? fget_light+0x79/0x83 [<ffffffff811339af>] ? path_put+0x22/0x27 [<ffffffff810a828b>] ? audit_syscall_entry+0x11c/0x148 [<ffffffff8149c5ba>] ? trace_hardirqs_on_thunk+0x3a/0x3f [<ffffffff81009cb2>] system_call_fastpath+0x16/0x1b Code: d4 48 89 ca 48 89 7d b8 6b 8f 84 00 00 00 09 48 89 75 c8 4d 89 c7 83 c1 12 83 f9 3f 7f 0f 4a 8d 44 06 ff 48 d3 e8 48 85 c0 74 02 <0f> 0b 41 f6 c1 03 b8 ea ff ff ff 0f 84 6b 01 00 00 41 81 e1 03 RIP [<ffffffff8126e0b8>] __domain_mapping+0x43/0x1ce RSP <ffff880130db1648> ---[ end trace 906240448e864842 ]--- intel-gpu-tools-2.12.0-6.fc14.x86_64 xorg-x11-drv-intel-2.12.0-6.fc14.x86_64 NetworkManager-0.8.1-6.git20100831.fc15.x86_64 00:02.1 Display controller: Intel Corporation Mobile 4 Series Chipset Integrated Graphics Controller (rev 07)
I just got a new one now, when resuming from suspend(ram): # uname -r 2.6.36-0.32.rc6.git2.fc15.x86_64 # cat /proc/cmdline ro root=/dev/VolGroup00/lv_root rhgb quiet selinux=0 vga=0x318 SYSFONT=latarcyrheb-sun16 LANG=en_US.UTF-8 KEYTABLE=no intel_iommu=igfx_off [15016.425443] PM: resume of devices complete after 994.178 msecs [15016.426097] PM: resume devices took 0.995 seconds [15016.447838] PM: Finishing wakeup. [15016.447983] Restarting tasks ... [15016.448102] usb 2-4: USB disconnect, address 2 [15016.478438] ehci_hcd 0000:00:1d.7: dma_pool_free buffer-2048, ffff8801300ec000/ffffb000 (bad dma) [15016.479854] ehci_hcd 0000:00:1d.7: dma_pool_free buffer-2048, ffff8801300ec080/ffffb080 (bad dma) [15016.480301] cdc_ether 2-4:1.7: wwan0: unregister 'cdc_ether' usb-0000:00:1d.7-4, Mobile Broadband Network Device [15016.488016] done. [15016.488209] video LNXVIDEO:00: Restoring backlight state [15017.346377] ------------[ cut here ]------------ [15017.347007] kernel BUG at drivers/pci/intel-iommu.c:1656! [15017.347007] invalid opcode: 0000 [#1] SMP [15017.347007] last sysfs file: /sys/devices/system/cpu/cpu1/cpufreq/scaling_governor [15017.347007] CPU 0 [15017.347007] Modules linked in: rfcomm sunrpc sco bnep l2cap cpufreq_ondemand acpi_cpufreq freq_table mperf ip6t_REJECT nf_conntrack_ipv6 xt_physdev ipt_MASQUERADE ip6table_filter ip6_tables iptable_nat nf_nat sha256_generic cryptd aes_x86_64 aes_generic cbc dm_crypt uinput arc4 ecb iwlagn snd_hda_codec_conexant snd_hda_intel iwlcore snd_hda_codec microcode mac80211 snd_hwdep snd_seq zaurus r852 sm_common snd_seq_device nand cdc_ether joydev uvcvideo nand_ids snd_pcm nand_ecc usbnet i2c_i801 mtd cdc_acm cfg80211 btusb cdc_wdm mii videodev iTCO_wdt thinkpad_acpi bluetooth iTCO_vendor_support v4l1_compat v4l2_compat_ioctl32 e1000e snd_timer rfkill snd_page_alloc snd wmi soundcore ipv6 sdhci_pci sdhci firewire_ohci mmc_core firewire_core yenta_socket crc_itu_t i915 drm_kms_helper drm i2c_algo_bit i2c_core video output [last unloaded: scsi_wait_scan] [15017.347007] [15017.347007] Pid: 1356, comm: NetworkManager Not tainted 2.6.36-0.32.rc6.git2.fc15.x86_64 #1 6474AR4/6474AR4 [15017.347007] RIP: 0010:[<ffffffff8126e9e8>] [<ffffffff8126e9e8>] __domain_mapping+0x43/0x1ce [15017.347007] RSP: 0018:ffff88012fb9f648 EFLAGS: 00010206 [15017.347007] RAX: 0000000001ffffff RBX: ffff88007e81fa00 RCX: 000000000000001b [15017.347007] RDX: 00000000000967f6 RSI: 000ffffffffffdff RDI: ffff880132107040 [15017.347007] RBP: ffff88012fb9f698 R08: 0000000000000001 R09: 0000000000000003 [15017.347007] R10: ffff880132107078 R11: 0000000000000000 R12: 0000000000000000 [15017.347007] R13: ffff880132107040 R14: ffff88013bc04ff8 R15: 0000000000000001 [15017.347007] FS: 00007fd79152b800(0000) GS:ffff880002c00000(0000) knlGS:0000000000000000 [15017.347007] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [15017.347007] CR2: 000000000042da00 CR3: 0000000132eea000 CR4: 00000000000006f0 [15017.347007] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [15017.347007] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [15017.347007] Process NetworkManager (pid: 1356, threadinfo ffff88012fb9e000, task ffff8801330b0000) [15017.347007] Stack: [15017.347007] ffff88013708e240 ffff880132107040 ffff88012fb9f698 000ffffffffffdff [15017.347007] <0> 0000000000000000 ffff88007e81fa00 00000000967f6000 ffff880132107040 [15017.347007] <0> ffff88013bc04ff8 0000000000000000 ffff88012fb9f6f8 ffffffff8126f6b0 [15017.347007] Call Trace: [15017.347007] [<ffffffff8126f6b0>] __intel_map_single.clone.25+0xdc/0x16b [15017.347007] [<ffffffff8126f82c>] intel_alloc_coherent+0xae/0xd5 [15017.347007] [<ffffffffa016c128>] e1000_alloc_ring_dma.clone.28+0x94/0xc0 [e1000e] [15017.347007] [<ffffffffa016e359>] e1000e_setup_tx_resources+0x65/0xaa [e1000e] [15017.347007] [<ffffffffa016e891>] e1000_open+0x64/0x41e [e1000e] [15017.347007] [<ffffffff813eeab7>] __dev_open+0x9b/0xd2 [15017.347007] [<ffffffff813eecf9>] __dev_change_flags+0xad/0x130 [15017.347007] [<ffffffff813eedfd>] dev_change_flags+0x21/0x56 [15017.347007] [<ffffffff813f9069>] do_setlink+0x2ba/0x61f [15017.347007] [<ffffffff8107d8e7>] ? print_lock_contention_bug+0x1b/0xd5 [15017.347007] [<ffffffff81249f1f>] ? debug_check_no_obj_freed+0x65/0x18a [15017.347007] [<ffffffff8107d8e7>] ? print_lock_contention_bug+0x1b/0xd5 [15017.347007] [<ffffffff813f962e>] rtnl_setlink+0xd0/0xf2 [15017.347007] [<ffffffff813f991c>] rtnetlink_rcv_msg+0x1eb/0x201 [15017.347007] [<ffffffff813f9731>] ? rtnetlink_rcv_msg+0x0/0x201 [15017.347007] [<ffffffff8140d325>] netlink_rcv_skb+0x45/0x90 [15017.347007] [<ffffffff813f8c99>] rtnetlink_rcv+0x26/0x2d [15017.347007] [<ffffffff8140ce00>] netlink_unicast+0xee/0x157 [15017.347007] [<ffffffff8140d121>] netlink_sendmsg+0x2b8/0x2d6 [15017.347007] [<ffffffff813da5de>] __sock_sendmsg+0x6b/0x77 [15017.347007] [<ffffffff813da938>] sock_sendmsg+0xa8/0xc1 [15017.347007] [<ffffffff8107ff27>] ? lock_acquire+0xee/0xfd [15017.347007] [<ffffffff810fb018>] ? might_fault+0x5c/0xac [15017.347007] [<ffffffff8107fe2d>] ? lock_release+0x19a/0x1a6 [15017.347007] [<ffffffff810fb061>] ? might_fault+0xa5/0xac [15017.347007] [<ffffffff813e4d4b>] ? copy_from_user+0x2f/0x31 [15017.347007] [<ffffffff813e513e>] ? verify_iovec+0x57/0x99 [15017.347007] [<ffffffff813dc901>] sys_sendmsg+0x235/0x2b3 [15017.347007] [<ffffffff8112ba96>] ? rcu_read_lock+0x0/0x35 [15017.347007] [<ffffffff8107ff27>] ? lock_acquire+0xee/0xfd [15017.347007] [<ffffffff8112ba96>] ? rcu_read_lock+0x0/0x35 [15017.347007] [<ffffffff813dc395>] ? sys_sendto+0x125/0x152 [15017.347007] [<ffffffff8112c512>] ? fput+0x22/0x1d6 [15017.347007] [<ffffffff8112c41e>] ? fget_light+0x79/0x83 [15017.347007] [<ffffffff81133dcb>] ? path_put+0x22/0x27 [15017.347007] [<ffffffff810a83e3>] ? audit_syscall_entry+0x11c/0x148 [15017.347007] [<ffffffff8149d865>] ? trace_hardirqs_on_thunk+0x3a/0x3f [15017.347007] [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b [15017.347007] Code: d4 48 89 ca 48 89 7d b8 6b 8f 84 00 00 00 09 48 89 75 c8 4d 89 c7 83 c1 12 83 f9 3f 7f 0f 4a 8d 44 06 ff 48 d3 e8 48 85 c0 74 02 <0f> 0b 41 f6 c1 03 b8 ea ff ff ff 0f 84 6b 01 00 00 41 81 e1 03 [15017.347007] RIP [<ffffffff8126e9e8>] __domain_mapping+0x43/0x1ce [15017.347007] RSP <ffff88012fb9f648> [15017.451075] ---[ end trace 32d916dc6cf7b186 ]---
I have reported this bug also here: https://bugs.freedesktop.org/show_bug.cgi?id=30722 https://bugzilla.kernel.org/show_bug.cgi?id=19942 I am now running the vanilla kernel, built like this: localhost:~/linux-2.6.36-rc7:$ cp \ /boot/config-2.6.36-0.35.rc7.git0.fc15.x86_64 .config \ && make With this kernel, the problem does not occur. However, I can trigger it with the new kernel-2.6.36-0.35.rc7.git0.fc15.x86_64 if I do "service NetworkManager restart". Doing the same in the vanilla kernel works just fine. So, are there any bad Fedora patches that triggers this?
Created attachment 453795 [details] diff between .confg of vanilla linux-2.6.36-rc7 based on config-2.6.36-0.35.rc7.git0.fc15.x86_64
Created attachment 453796 [details] .config of linux-2.6.36-rc7 after make
As discussed on irc, it turns out to be a bug at the intersection of PCI CRS and Intel IOMMU. Will be fixed in the next rawhide build. (I think it might be an issue on F-14 too, so I'll fix it there as well. Thanks! Kyle