Bug 574146 - iwl3945 "wrong command queue" on wake-up
Summary: iwl3945 "wrong command queue" on wake-up
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 12
Hardware: All
OS: Linux
low
medium
Target Milestone: ---
Assignee: Stanislaw Gruszka
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks: 601171
TreeView+ depends on / blocked
 
Reported: 2010-03-16 18:14 UTC by James M. Leddy
Modified: 2014-08-11 05:42 UTC (History)
9 users (show)

Fixed In Version: kernel-2.6.32.11-99.fc12
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 601171 (view as bug list)
Environment:
Last Closed: 2010-04-10 10:29:34 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description James M. Leddy 2010-03-16 18:14:28 UTC
Description of problem:

Every time, after suspending my laptop. Network manager doesn't work. If I uncheck and check the box repeatedly, sometimes it comes back, but it takes awhile. Just this latest time I did a straight ifconfig up and everything came back instantly

There are two backtraces, one of alloc_pages_nodemask, and one of iwl_tx_cmd_complete. Looking through my logs, it seems to always be in this order, but I'm not sure.

Relevent sections of /var/log/messages:

Mar 16 13:51:37 localhost NetworkManager: <info>  (wlan0): bringing up device.
Mar 16 13:51:37 localhost kernel: NetworkManager: page allocation failure. order:4, mode:0x40d0
Mar 16 13:51:37 localhost kernel: Pid: 1530, comm: NetworkManager Tainted: G        W  2.6.32.7-37.fc12.x86_64 #1
Mar 16 13:51:37 localhost kernel: Call Trace:
Mar 16 13:51:37 localhost kernel: [<ffffffff810dc85d>] __alloc_pages_nodemask+0x5ad/0x630
Mar 16 13:51:37 localhost kernel: [<ffffffff81104b93>] alloc_pages_current+0x95/0x9e
Mar 16 13:51:37 localhost kernel: [<ffffffff810db0f7>] __get_free_pages+0xe/0x4b
Mar 16 13:51:37 localhost kernel: [<ffffffffa02504e8>] iwl_tx_queue_init+0x102/0x2aa [iwlcore]
Mar 16 13:51:37 localhost kernel: [<ffffffffa02524fe>] iwl_txq_ctx_reset+0x170/0x1fa [iwlcore]
Mar 16 13:51:37 localhost kernel: [<ffffffffa0246f3f>] iwl_hw_nic_init+0x133/0x148 [iwlcore]
Mar 16 13:51:37 localhost kernel: [<ffffffffa02d82db>] __iwl_up+0x16b/0x3ab [iwlagn]
Mar 16 13:51:37 localhost kernel: [<ffffffffa02d955a>] iwl_mac_start+0x103f/0x12bb [iwlagn]
Mar 16 13:51:37 localhost kernel: [<ffffffff813ce237>] ? nlmsg_notify+0x48/0x8e
Mar 16 13:51:37 localhost kernel: [<ffffffffa01e5fcb>] ieee80211_open+0x270/0x5c4 [mac80211]
Mar 16 13:51:37 localhost kernel: [<ffffffff81457a1a>] ? notifier_call_chain+0x37/0x63
Mar 16 13:51:37 localhost kernel: [<ffffffff813b474b>] dev_open+0x9d/0xd8
Mar 16 13:51:37 localhost kernel: [<ffffffff813b3ea5>] dev_change_flags+0xad/0x16d
Mar 16 13:51:37 localhost kernel: [<ffffffff813bd432>] do_setlink+0x26c/0x33d
Mar 16 13:51:37 localhost kernel: [<ffffffff811eaaca>] ? avc_has_perm+0x5c/0x6e
Mar 16 13:51:37 localhost kernel: [<ffffffff813bd616>] rtnl_setlink+0x113/0x126
Mar 16 13:51:37 localhost kernel: [<ffffffff813bce7d>] rtnetlink_rcv_msg+0x1c6/0x1e3
Mar 16 13:51:37 localhost kernel: [<ffffffff813ce85b>] ? netlink_sendmsg+0x174/0x26b
Mar 16 13:51:37 localhost kernel: [<ffffffff813bccb7>] ? rtnetlink_rcv_msg+0x0/0x1e3
Mar 16 13:51:37 localhost kernel: [<ffffffff813ce3b7>] netlink_rcv_skb+0x43/0x94
Mar 16 13:51:37 localhost kernel: [<ffffffff813bccb0>] rtnetlink_rcv+0x26/0x2d
Mar 16 13:51:37 localhost kernel: [<ffffffff813ce186>] netlink_unicast+0x125/0x18e
Mar 16 13:51:37 localhost kernel: [<ffffffff813ce943>] netlink_sendmsg+0x25c/0x26b
Mar 16 13:51:37 localhost kernel: [<ffffffff813a2f86>] __sock_sendmsg+0x61/0x6c
Mar 16 13:51:37 localhost kernel: [<ffffffff813a36e9>] sock_sendmsg+0xcc/0xe5
Mar 16 13:51:37 localhost kernel: [<ffffffff813a35be>] ? sock_recvmsg+0xcf/0xe8
Mar 16 13:51:37 localhost kernel: [<ffffffff81074987>] ? autoremove_wake_function+0x0/0x39
Mar 16 13:51:37 localhost kernel: [<ffffffff81074987>] ? autoremove_wake_function+0x0/0x39
Mar 16 13:51:37 localhost kernel: [<ffffffff813a4285>] ? move_addr_to_kernel+0x48/0x4d
Mar 16 13:51:37 localhost kernel: [<ffffffff813acbcf>] ? verify_iovec+0x51/0x8e
Mar 16 13:51:37 localhost kernel: [<ffffffff813a3923>] sys_sendmsg+0x221/0x2a5
Mar 16 13:51:37 localhost kernel: [<ffffffff8110b397>] ? virt_to_head_page+0xe/0x2f
Mar 16 13:51:37 localhost kernel: [<ffffffff813a3043>] ? sockfd_lookup_light+0x20/0x58
Mar 16 13:51:37 localhost kernel: [<ffffffff813a3021>] ? fput_light+0x12/0x14
Mar 16 13:51:37 localhost kernel: [<ffffffff813a43af>] ? sys_sendto+0x125/0x152
Mar 16 13:51:37 localhost kernel: [<ffffffff8111c485>] ? __fput+0x1cd/0x1dc
Mar 16 13:51:37 localhost kernel: [<ffffffff8111c4ae>] ? fput+0x1a/0x1c
Mar 16 13:51:37 localhost kernel: [<ffffffff8111894d>] ? filp_close+0x68/0x72
Mar 16 13:51:37 localhost kernel: [<ffffffff81011cf2>] system_call_fastpath+0x16/0x1b
Mar 16 13:51:37 localhost kernel: Mem-Info:
Mar 16 13:51:37 localhost kernel: Node 0 DMA per-cpu:
Mar 16 13:51:37 localhost kernel: CPU    0: hi:    0, btch:   1 usd:   0
Mar 16 13:51:37 localhost kernel: CPU    1: hi:    0, btch:   1 usd:   0
Mar 16 13:51:37 localhost dbus: Rejected send message, 1 matched rules; type="method_return", sender=":1.8" (uid=0 pid=1530 comm="NetworkManager) interface="(unset)" member="(unset)" error name="(unset)" requested_reply=0 destination=":1.54" (uid=1000 pid=1896 comm="nm-applet))
Mar 16 13:51:37 localhost kernel: Node 0 DMA32 per-cpu:
Mar 16 13:51:37 localhost kernel: CPU    0: hi:  186, btch:  31 usd:   0
Mar 16 13:51:37 localhost kernel: CPU    1: hi:  186, btch:  31 usd:  32
Mar 16 13:51:37 localhost kernel: active_anon:90830 inactive_anon:50968 isolated_anon:0
Mar 16 13:51:37 localhost kernel: active_file:114298 inactive_file:109357 isolated_file:32
Mar 16 13:51:37 localhost kernel: unevictable:4 dirty:1509 writeback:0 unstable:0

Mar 16 13:51:37 localhost kernel: free:49536 slab_reclaimable:19956 slab_unreclaimable:25350
Mar 16 13:51:37 localhost kernel: mapped:16724 shmem:22401 pagetables:6240 bounce:0
Mar 16 13:51:37 localhost kernel: Node 0 DMA free:7648kB min:40kB low:48kB high:60kB active_anon:0kB inactive_anon:48kB active_file:6332kB inactive_file:1424kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15364kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:292kB slab_unreclaimable:120kB kernel_stack:8kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
Mar 16 13:51:37 localhost kernel: lowmem_reserve[]: 0 1902 1902 1902
Mar 16 13:51:37 localhost kernel: Node 0 DMA32 free:190496kB min:5560kB low:6948kB high:8340kB active_anon:363320kB inactive_anon:203824kB active_file:450860kB inactive_file:436004kB unevictable:16kB isolated(anon):0kB isolated(file):128kB present:1947648kB mlocked:16kB dirty:6036kB writeback:0kB mapped:66896kB shmem:89604kB slab_reclaimable:79532kB slab_unreclaimable:101280kB kernel_stack:2440kB pagetables:24960kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:97 all_unreclaimable? no
Mar 16 13:51:37 localhost kernel: lowmem_reserve[]: 0 0 0 0
Mar 16 13:51:37 localhost kernel: Node 0 DMA: 4*4kB 4*8kB 31*16kB 30*32kB 20*64kB 8*128kB 5*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 7648kB
Mar 16 13:51:37 localhost kernel: Node 0 DMA32: 16554*4kB 14639*8kB 400*16kB 14*32kB 1*64kB 0*128kB 1*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 190496kB
Mar 16 13:51:37 localhost kernel: 254487 total pagecache pages
Mar 16 13:51:37 localhost kernel: 8426 pages in swap cache
Mar 16 13:51:37 localhost kernel: Swap cache stats: add 643927, delete 635501, find 1460195/1524979
Mar 16 13:51:37 localhost kernel: Free swap  = 1795008kB
Mar 16 13:51:37 localhost kernel: Total swap = 2097144kB
Mar 16 13:51:37 localhost kernel: 498688 pages RAM
Mar 16 13:51:37 localhost kernel: 10810 pages reserved
Mar 16 13:51:37 localhost kernel: 242479 pages shared
Mar 16 13:51:37 localhost kernel: 233134 pages non-shared
Mar 16 13:51:37 localhost kernel: iwlagn 0000:03:00.0: kmalloc for auxiliary BD structures failed
Mar 16 13:51:37 localhost kernel: iwlagn 0000:03:00.0: Tx 17 queue init failed
Mar 16 13:51:37 localhost kernel: iwlagn 0000:03:00.0: Unable to init nic
Mar 16 13:52:11 localhost NetworkManager: <info>  Sleeping...
Mar 16 13:52:11 localhost NetworkManager: <info>  (eth0): now unmanaged
Mar 16 13:52:11 localhost NetworkManager: <info>  (eth0): device state change: 2 -> 1 (reason 37)
Mar 16 13:52:11 localhost NetworkManager: <info>  (eth0): cleaning up...
Mar 16 13:52:11 localhost NetworkManager: <info>  (eth0): taking down device.
Mar 16 13:52:11 localhost NetworkManager: <info>  (wlan0): now unmanaged
Mar 16 13:52:11 localhost NetworkManager: <info>  (wlan0): device state change: 2 -> 1 (reason 37)
Mar 16 13:52:13 localhost NetworkManager: <info>  Waking up...
Mar 16 13:52:13 localhost NetworkManager: <info>  Wireless now enabled by radio killswitch
Mar 16 13:52:13 localhost NetworkManager: <info>  (eth0): now managed
Mar 16 13:52:13 localhost NetworkManager: <info>  (eth0): device state change: 1 -> 2 (reason 2)
Mar 16 13:52:13 localhost NetworkManager: <info>  (eth0): bringing up device.
Mar 16 13:52:14 localhost NetworkManager: <info>  (eth0): preparing device.
Mar 16 13:52:14 localhost kernel: ADDRCONF(NETDEV_UP): eth0: link is not ready
Mar 16 13:52:14 localhost NetworkManager: <info>  (eth0): deactivating device (reason: 2).
Mar 16 13:52:14 localhost NetworkManager: <info>  (wlan0): now managed
Mar 16 13:52:14 localhost NetworkManager: <info>  (wlan0): device state change: 1 -> 2 (reason 2)
Mar 16 13:52:14 localhost NetworkManager: <info>  (wlan0): bringing up device.
Mar 16 13:52:14 localhost kernel: ------------[ cut here ]------------
Mar 16 13:52:14 localhost kernel: WARNING: at drivers/net/wireless/iwlwifi/iwl-tx.c:1151 iwl_tx_cmd_complete+0x65/0x2b4 [iwlcore]()
Mar 16 13:52:14 localhost kernel: Hardware name: 4057W7N
Mar 16 13:52:14 localhost kernel: wrong command queue 0, sequence 0x0 readp=0 writep=0
Mar 16 13:52:14 localhost kernel: Modules linked in: ppp_deflate zlib_deflate ppp_async crc_ccitt ppp_generic slhc rndis_wlan rndis_host cdc_ether cdc_phonet usbnet phonet mii cdc_acm nfsd exportfs tun nfs lockd fscache nfs_acl auth_rpcgss cryptd aes_x86_64 aes_generic hidp fuse rfcomm sco bridge stp llc bnep l2cap sunrpc cpufreq_ondemand acpi_cpufreq freq_table ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 uinput arc4 ecb snd_hda_codec_conexant snd_hda_intel iwlagn snd_hda_codec snd_hwdep uvcvideo snd_seq iwlcore snd_seq_device snd_pcm mac80211 videodev btusb v4l1_compat snd_timer v4l2_compat_ioctl32 i2c_i801 joydev bluetooth thinkpad_acpi e1000e cfg80211 iTCO_wdt snd wmi iTCO_vendor_support rfkill soundcore snd_page_alloc nilfs2 dm_multipath i915 drm_kms_helper drm i2c_algo_bit i2c_core video output [last unloaded: microcode]
Mar 16 13:52:14 localhost kernel: Pid: 0, comm: swapper Tainted: G        W  2.6.32.7-37.fc12.x86_64 #1
Mar 16 13:52:14 localhost kernel: Call Trace:
Mar 16 13:52:14 localhost kernel: <IRQ>  [<ffffffff81056338>] warn_slowpath_common+0x7c/0x94
Mar 16 13:52:14 localhost kernel: [<ffffffff810563a7>] warn_slowpath_fmt+0x41/0x43
Mar 16 13:52:14 localhost kernel: [<ffffffffa024f48d>] iwl_tx_cmd_complete+0x65/0x2b4 [iwlcore]
Mar 16 13:52:14 localhost kernel: [<ffffffff8107bbff>] ? timekeeping_get_ns+0x1b/0x3d
Mar 16 13:52:14 localhost kernel: [<ffffffffa02d449d>] iwl_rx_handle+0x2e2/0x3c3 [iwlagn]
Mar 16 13:52:14 localhost kernel: [<ffffffff810804dd>] ? tick_dev_program_event+0x3c/0xaa
Mar 16 13:52:14 localhost kernel: [<ffffffffa02d28f2>] ? __iwl_write32+0xba/0xc9 [iwlagn]
Mar 16 13:52:14 localhost kernel: [<ffffffffa0242f96>] ? __iwl_write32+0xba/0xc9 [iwlcore]
Mar 16 13:52:14 localhost kernel: [<ffffffffa02d6d90>] iwl_irq_tasklet+0x592/0x729 [iwlagn]
Mar 16 13:52:14 localhost kernel: [<ffffffffa0245ee9>] ? tasklet_schedule+0x1a/0x1c [iwlcore]
Mar 16 13:52:14 localhost kernel: [<ffffffffa0248243>] ? iwl_isr_ict+0x42c/0x483 [iwlcore]
Mar 16 13:52:14 localhost kernel: [<ffffffff8105bf6b>] tasklet_action+0x85/0xe4
Mar 16 13:52:14 localhost kernel: [<ffffffff8105d954>] __do_softirq+0xe5/0x1a9
Mar 16 13:52:14 localhost kernel: [<ffffffff810ace25>] ? handle_IRQ_event+0x60/0x121
Mar 16 13:52:14 localhost kernel: [<ffffffff81012e2c>] call_softirq+0x1c/0x30
Mar 16 13:52:14 localhost kernel: [<ffffffff810143aa>] do_softirq+0x46/0x86
Mar 16 13:52:14 localhost kernel: [<ffffffff8105d792>] irq_exit+0x3b/0x7d
Mar 16 13:52:14 localhost kernel: [<ffffffff814594dd>] do_IRQ+0xa5/0xbc
Mar 16 13:52:14 localhost kernel: [<ffffffff81012653>] ret_from_intr+0x0/0x11
Mar 16 13:52:14 localhost kernel: <EOI>  [<ffffffff812943f2>] ? acpi_idle_enter_bm+0x277/0x2ab
Mar 16 13:52:14 localhost kernel: [<ffffffff812943eb>] ? acpi_idle_enter_bm+0x270/0x2ab
Mar 16 13:52:14 localhost kernel: [<ffffffff813880ce>] ? cpuidle_idle_call+0x99/0xf3
Mar 16 13:52:14 localhost kernel: [<ffffffff81010ca5>] ? cpu_idle+0xaa/0xe4
Mar 16 13:52:14 localhost kernel: [<ffffffff8143e507>] ? rest_init+0x6b/0x6d
Mar 16 13:52:14 localhost kernel: [<ffffffff81815de2>] ? start_kernel+0x3f4/0x3ff
Mar 16 13:52:14 localhost kernel: [<ffffffff818152c1>] ? x86_64_start_reservations+0xac/0xb0
Mar 16 13:52:14 localhost kernel: [<ffffffff818153bd>] ? x86_64_start_kernel+0xf8/0x107
Mar 16 13:52:14 localhost kernel: ---[ end trace dd6e56058ddf0962 ]---
Mar 16 13:52:14 localhost kernel: iwl data: 00000000: 00 01 ff 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
Mar 16 13:52:14 localhost kernel: iwl data: 00000010: 6c 93 00 00 00 00 00 00 6d 93 00 00 00 00 00 00  l.......m.......
Mar 16 13:52:18 localhost kernel: iwlagn 0000:03:00.0: START_ALIVE timeout after 4000ms.
Mar 16 13:52:18 localhost NetworkManager: <info>  (wlan0): deactivating device (reason: 2).
Mar 16 13:52:18 localhost kernel: iwlagn 0000:03:00.0: Microcode SW error detected.  Restarting 0x2000000.
Mar 16 13:52:22 localhost kernel: iwlagn 0000:03:00.0: START_ALIVE timeout after 4000ms.
Mar 16 13:52:22 localhost NetworkManager: <info>  (wlan0): supplicant interface state:  starting -> ready
Mar 16 13:52:22 localhost NetworkManager: <info>  (wlan0): device state change: 2 -> 3 (reason 42)
Mar 16 13:52:36 localhost kernel: Registered led device: iwl-phy0::radio
Mar 16 13:52:36 localhost kernel: Registered led device: iwl-phy0::assoc
Mar 16 13:52:36 localhost kernel: Registered led device: iwl-phy0::RX
Mar 16 13:52:36 localhost kernel: Registered led device: iwl-phy0::TX
Mar 16 13:52:36 localhost kernel: ADDRCONF(NETDEV_UP): wlan0: link is not ready
Mar 16 13:52:42 localhost NetworkManager: <info>  Activation (wlan0) starting connection 'Auto Bean & Bean'
Mar 16 13:52:42 localhost NetworkManager: <info>  (wlan0): device state change: 3 -> 4 (reason 0)
Mar 16 13:52:42 localhost NetworkManager: <info>  Activation (wlan0) Stage 1 of 5 (Device Prepare) scheduled...
Mar 16 13:52:42 localhost NetworkManager: <info>  Activation (wlan0) Stage 1 of 5 (Device Prepare) started...



This problem is reproducible, let me know if you would like more information.

Comment 1 Stanislaw Gruszka 2010-03-17 10:58:47 UTC
Hi James

This problem is most likely fixed by upstream commit:f36d04abe684f9e2b07c6ebe9f77ae20eb5c1e84 ("iwlwifi: use dma_alloc_coherent"). It was posted to -stable tree but do not get into fedora. I'm going to backport it and give you test kernel.

During waiting for that, could you please this kernel and see if it fix the issue:
http://koji.fedoraproject.org/koji/taskinfo?taskID=2045788

It uses differed slab allocator. We have more problems with allocations (not reproducible) and would like to know if switch to different slab allocator helps with that. Thanks.

Comment 3 Fedora Update System 2010-04-06 07:12:25 UTC
kernel-2.6.32.11-99.fc12 has been submitted as an update for Fedora 12.
http://admin.fedoraproject.org/updates/kernel-2.6.32.11-99.fc12

Comment 4 Fedora Update System 2010-04-09 01:37:44 UTC
kernel-2.6.32.11-99.fc12 has been pushed to the Fedora 12 testing repository.  If problems still persist, please make note of it in this bug report.
 If you want to test the update, you can install it with 
 su -c 'yum --enablerepo=updates-testing update kernel'.  You can provide feedback for this update here: http://admin.fedoraproject.org/updates/kernel-2.6.32.11-99.fc12

Comment 5 Fedora Update System 2010-04-10 10:29:14 UTC
kernel-2.6.32.11-99.fc12 has been pushed to the Fedora 12 stable repository.  If problems still persist, please make note of it in this bug report.

Comment 6 James M. Leddy 2010-06-04 16:46:18 UTC
Hi Stanislaw,

Sorry for the delay here. This problem persists in 2.6.32.14-127.fc12.x86_64:

WARNING: at drivers/net/wireless/iwlwifi/iwl-tx.c:1174 iwl_tx_cmd_complete+0x65/
0x2b4 [iwlcore]()
Hardware name: 4057W7N
wrong command queue 0, sequence 0x0 readp=0 writep=0

When you said it was fixed in -stable, were you refering to .32 or .33?

Comment 7 James M. Leddy 2010-06-04 16:50:27 UTC
It looks like it was applied to both .32 and .33 -stable, so I don't think that commit fixes the issue.

Comment 8 Stanislaw Gruszka 2010-06-07 08:07:09 UTC
Hi

Fix is for memory allocation problems during resume that you have in comment 0. Wrong command queue is a different problem, I was not aware of it ... Is there something more in logs before WARNING.


Note You need to log in before you can comment on or make changes to this bug.