Bug 833117
| Summary: | kernel 3.4.0 iwlwifi flaky | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Andy Wang <dopey> | ||||
| Component: | kernel | Assignee: | Stanislaw Gruszka <sgruszka> | ||||
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||
| Severity: | unspecified | Docs Contact: | |||||
| Priority: | unspecified | ||||||
| Version: | 17 | CC: | aronparsons, brom, bugtracker, c.david86, dopey, gansalmon, inkybutton, irosenhagen, itamar, jforbes, johannes, jonathan, j.romildo, jvpgomes, jwboyer, jwulf, kernel-maint, kevin, lmedinas, madhu.chinakonda, matt, mavit, meijer.o, omaciel, pwouters, redhat, rkennke, sgruszka, sontek, stefan.kirrmann, tjwhaynes, tom.jenkinson, uckelman, wmealing, xkaf | ||||
| Target Milestone: | --- | ||||||
| Target Release: | --- | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | first=3.4.0 tested=3.4.2-4 iwlwifi | ||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | 825491 | Environment: | |||||
| Last Closed: | 2012-11-16 15:02:43 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
|
Description
Andy Wang
2012-06-18 15:50:38 UTC
As per comments from bug 825491 cloning and opening a new bug. I still have the exact same symptoms as the original bug reporter. Unfortunately, i have no idea how to reproduce this other than to start transferring stuff over wifi and wait for it to occur. --- Additional comment from dopey on 2012-06-16 17:50:28 EDT --- $ uname -r 3.4.2-4.fc17.x86_64 As noted in Comment 25 this is still not fixed [ 9168.354006] iwlwifi 0000:03:00.0: Queue 2 stuck for 2000 ms. [ 9168.354017] iwlwifi 0000:03:00.0: Current SW read_ptr 208 write_ptr 209 [ 9168.354075] iwlwifi 0000:03:00.0: Current HW read_ptr 208 write_ptr 209 [ 9168.354081] iwlwifi 0000:03:00.0: On demand firmware reload [ 9168.354610] ieee80211 phy0: Hardware restart was requested [ 9168.354716] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S [ 9168.354915] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 And also, performance gets progessively worse over time. I belive this was fixed in 3.4.3 see the changelog: http://www.kernel.org/pub/linux/kernel/v3.0/ChangeLog-3.4.3 (In reply to comment #2) > I belive this was fixed in 3.4.3 see the changelog: > http://www.kernel.org/pub/linux/kernel/v3.0/ChangeLog-3.4.3 Not that I can see, as Fedora already carried some (most?; didn't check) of the patches that were merged there in its 3.4.2 kernel FWIW, I'm still seeing problems in 3.5.0-0.rc2.git0.3.fc18.x86_64 I'm still seeing the same issue with kernel-3.4.2-4.fc17.x86_64 and the following NIC 02:00.0 Network controller: Intel Corporation Centrino Advanced-N 6200 (rev 35) There are iwlwifi parameters that could workaround this bug wd_disable and 11n_disable. Actually on 3.4 11n_disable is broken, that will be fixed soon. Sorry, no, my mistake; 11n_disable=1 is broken on 3.5, not 3.4. I think the problem is not visible in kernel 3.5. Or at least, is much better. I tried kernels 3.2, 3.3, 3.4, and 3.5 in Fedora and ubuntu (mint), and kernels 3.3 and 3.4 in arch linux. In all cases, there were many problems with wifi connection when using kernels 3.3 and 3.4 (including 3.4.3). With kernel 3.2 there were no problems. I'm still experimenting kernel 3.5 in both fedora and ubuntu but it seems to be much better. I can now use the wifi conenction with no major problems. However, kernel 3.5 is still a little unstable. wifi controler: 01:00.0 Network controller: Intel Corporation Centrino Advanced-N 6230 (rev 34) After testing more deeply the wifi connection with kernel 3.5, I noticed that the problem remains. Actually, the connection is now much more stable. I can now work without continuouly losing connectivity. However, the connection seemed to be much slower. So, I compared it with Windows and linux with kernel 3.2. I download an ISO file from the same url and using the same machine. When using Windows or linux with kernel 3.2, the connection speed is very similar. With kernel 3.5, the connection is much slower, less than half the speed. And this happens even if I configure the wireless router to use only 802.11 G. Apparently, it isn't an exclusive problem of 802.11 N. I have this problem with kernel 3.4.3-1 on a Lenovo T510 Kernel: Linux dhcp-1-77.bne.redhat.com 3.4.3-1.fc17.x86_64 #1 SMP Mon Jun 18 19:53:17 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux wifi: 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) Frequent occurrences of this: Jun 22 10:55:37 dhcp-1-77 kernel: [14904.742979] iwlwifi 0000:03:00.0: Queue 11 stuck for 2000 ms. Jun 22 10:55:37 dhcp-1-77 kernel: [14904.742985] iwlwifi 0000:03:00.0: Current SW read_ptr 169 write_ptr 183 Jun 22 10:55:37 dhcp-1-77 kernel: [14904.743037] iwlwifi 0000:03:00.0: Current HW read_ptr 169 write_ptr 183 Jun 22 10:55:37 dhcp-1-77 kernel: [14904.743040] iwlwifi 0000:03:00.0: On demand firmware reload Jun 22 10:55:37 dhcp-1-77 kernel: [14904.743490] ieee80211 phy0: Hardware restart was requested Jun 22 10:55:37 dhcp-1-77 kernel: [14904.743563] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S Jun 22 10:55:37 dhcp-1-77 kernel: [14904.743754] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 Jun 22 11:01:22 dhcp-1-77 kernel: [15248.737344] cfg80211: Calling CRDA to update world regulatory domain Jun 22 11:01:22 dhcp-1-77 NetworkManager[781]: <info> (wlan0): supplicant interface state: completed -> disconnected Getting the same thing with 3.3.4-3 in F16, slightly different wifi card: Linux nitai 3.3.4-3.fc16.x86_64 #1 SMP Thu May 3 14:46:44 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 3e) Jun 22 12:21:23 nitai kernel: [41126.742767] iwlwifi 0000:03:00.0: Queue 11 stuck for 2000 ms. Jun 22 12:21:23 nitai kernel: [41126.742771] iwlwifi 0000:03:00.0: Current SW read_ptr 235 write_ptr 59 Jun 22 12:21:23 nitai kernel: [41126.742822] iwlwifi 0000:03:00.0: Current HW read_ptr 235 write_ptr 59 Jun 22 12:21:23 nitai kernel: [41126.742825] iwlwifi 0000:03:00.0: On demand firmware reload Jun 22 12:21:23 nitai kernel: [41126.743226] ieee80211 phy0: Hardware restart was requested Jun 22 12:21:23 nitai kernel: [41126.743313] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S Jun 22 12:21:23 nitai kernel: [41126.743488] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 I get this behaviour when connecting to an Airport Express at home. However, when connecting to the wireless AP at the office I have a rock solid connection; so it seems to be a combination of: [kernel code]: 3.3.4-3.fc16 & 3.4.3-1.fc17 [wireless chipset]: Centrino Ultimate-N 6300 [AP or environmental factors]: Apple Airport Express I'll check the channel settings on the AP at home and compare them with the AP settings in the office, and see if I can narrow it down further. This is another data point from a Lenovo W510. 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) Subsystem: Intel Corporation Centrino Ultimate-N 6300 3x3 AGN Flags: bus master, fast devsel, latency 0, IRQ 55 Memory at f2000000 (64-bit, non-prefetchable) [size=8K] Capabilities: [c8] Power Management version 3 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+ Capabilities: [e0] Express Endpoint, MSI 00 Capabilities: [100] Advanced Error Reporting Capabilities: [140] Device Serial Number 00-24-d7-ff-ff-70-cc-28 Kernel driver in use: iwlwifi Works fine at authenticating with LEAP to the company network. At home using WPA2 against a , it works fine if I sit within 3 metres of the access point but if I move a little further away, it eventually drops out (within a few minutes). If I use VPNC to work, it drops out a whole lot faster. I should add Linux nexus6.torolab.ibm.com 3.4.3-1.fc17.x86_64 #1 SMP Mon Jun 18 19:53:17 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux Have the same problems with Fedora 16 on my Thinkpad T510. When connected to a wireless access point with WPA2 (FRITZ!Box), the connection sometimes stops, however the (k)networkmanager still shows an active and alive connection. If I disable wireless and networking in knetworkmanager, and then re-enable it, wifi works again until the next stop sometime later. Kernel: 3.4.2-1.fc16.x86_64 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) Also, whenever the connection to the internet is not working, dmesg gives me: 1195.465827] wlan0: authenticate with bc:05:43:49:12:c9 [ 1195.470281] wlan0: send auth to bc:05:43:49:12:c9 (try 1/3) [ 1195.479621] wlan0: authenticated [ 1195.480603] wlan0: associate with bc:05:43:49:12:c9 (try 1/3) [ 1195.485430] wlan0: RX AssocResp from bc:05:43:49:12:c9 (capab=0x431 status=0 aid=1) [ 1195.485436] wlan0: associated [ 1738.945625] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 1855.835088] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 1909.752134] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 1911.755617] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 2038.592005] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 2047.635404] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [ 2055.405972] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues just posting a "me too" message with kernel 3.4.3-1.fc17.x86_64 [ 5939.491305] iwlwifi 0000:25:00.0: Queue 2 stuck for 2000 ms. [ 5939.491315] iwlwifi 0000:25:00.0: Current SW read_ptr 110 write_ptr 113 [ 5939.491354] iwlwifi 0000:25:00.0: Current HW read_ptr 110 write_ptr 113 [ 5939.491360] iwlwifi 0000:25:00.0: On demand firmware reload [ 5939.491776] ieee80211 phy0: Hardware restart was requested [ 5939.491840] iwlwifi 0000:25:00.0: L1 Disabled; Enabling L0S [ 5939.491983] iwlwifi 0000:25:00.0: Radio type=0x0-0x3-0x1 [ 5956.055551] wlan0: deauthenticating from 00:dd:9d:86:f1:6b by local choice (reason=3) [ 5956.094408] cfg80211: Calling CRDA to update world regulatory domain [ 5956.115033] bridge-wlan0: disabling the bridge on dev down [ 5956.115073] bridge-wlan0: down [ 5956.127881] bridge-wlan0: detached [ 5956.128042] cfg80211: World regulatory domain updated: [ 5956.128052] cfg80211: (start_freq - end_freq @ bandwidth), (max_antenna_gain, max_eirp) [ 5956.128063] cfg80211: (2402000 KHz - 2472000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.128072] cfg80211: (2457000 KHz - 2482000 KHz @ 20000 KHz), (300 mBi, 2000 mBm) [ 5956.128081] cfg80211: (2474000 KHz - 2494000 KHz @ 20000 KHz), (300 mBi, 2000 mBm) [ 5956.128089] cfg80211: (5170000 KHz - 5250000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.128098] cfg80211: (5735000 KHz - 5835000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.128554] cfg80211: Calling CRDA for country: SE [ 5956.134044] cfg80211: Regulatory domain changed to country: SE [ 5956.134051] cfg80211: (start_freq - end_freq @ bandwidth), (max_antenna_gain, max_eirp) [ 5956.134058] cfg80211: (2402000 KHz - 2482000 KHz @ 40000 KHz), (N/A, 2000 mBm) [ 5956.134064] cfg80211: (5170000 KHz - 5250000 KHz @ 40000 KHz), (N/A, 2000 mBm) [ 5956.134069] cfg80211: (5250000 KHz - 5330000 KHz @ 40000 KHz), (N/A, 2000 mBm) [ 5956.134074] cfg80211: (5490000 KHz - 5710000 KHz @ 40000 KHz), (N/A, 2700 mBm) [ 5956.248893] cfg80211: Calling CRDA to update world regulatory domain [ 5956.252853] cfg80211: World regulatory domain updated: [ 5956.252860] cfg80211: (start_freq - end_freq @ bandwidth), (max_antenna_gain, max_eirp) [ 5956.252867] cfg80211: (2402000 KHz - 2472000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.252873] cfg80211: (2457000 KHz - 2482000 KHz @ 20000 KHz), (300 mBi, 2000 mBm) [ 5956.252879] cfg80211: (2474000 KHz - 2494000 KHz @ 20000 KHz), (300 mBi, 2000 mBm) [ 5956.252884] cfg80211: (5170000 KHz - 5250000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.252889] cfg80211: (5735000 KHz - 5835000 KHz @ 40000 KHz), (300 mBi, 2000 mBm) [ 5956.259032] Intel(R) Wireless WiFi Link AGN driver for Linux, in-tree:d [ 5956.259034] Copyright(c) 2003-2012 Intel Corporation [ 5956.259129] iwlwifi 0000:25:00.0: pci_resource_len = 0x00002000 [ 5956.259131] iwlwifi 0000:25:00.0: pci_resource_base = ffffc900117ec000 [ 5956.259132] iwlwifi 0000:25:00.0: HW Revision ID = 0x35 [ 5956.259226] iwlwifi 0000:25:00.0: irq 58 for MSI/MSI-X [ 5956.263597] iwlwifi 0000:25:00.0: loaded firmware version 9.221.4.1 build 25532 [ 5956.263828] iwlwifi 0000:25:00.0: CONFIG_IWLWIFI_DEBUG enabled [ 5956.263830] iwlwifi 0000:25:00.0: CONFIG_IWLWIFI_DEBUGFS enabled [ 5956.263831] iwlwifi 0000:25:00.0: CONFIG_IWLWIFI_DEVICE_TRACING disabled [ 5956.263832] iwlwifi 0000:25:00.0: CONFIG_IWLWIFI_DEVICE_TESTMODE disabled [ 5956.263833] iwlwifi 0000:25:00.0: CONFIG_IWLWIFI_P2P disabled [ 5956.263835] iwlwifi 0000:25:00.0: Detected Intel(R) Centrino(R) Ultimate-N 6300 AGN, REV=0x74 Sometimes it fixes itself, and sometimes I have to reload the module with modprobe or cycle the hardware switch manually "Me too": FC17 with latest updates: [tom@tomsfc17 ~]$ uname -r 3.4.3-1.fc17.x86_64 [ 898.444187] usb 1-1.1: USB disconnect, device number 8 [ 2104.110358] iwlwifi 0000:03:00.0: Queue 2 stuck for 2000 ms. [ 2104.110364] iwlwifi 0000:03:00.0: Current SW read_ptr 177 write_ptr 183 [ 2104.110416] iwlwifi 0000:03:00.0: Current HW read_ptr 177 write_ptr 183 [ 2104.110419] iwlwifi 0000:03:00.0: On demand firmware reload [ 2104.110823] ieee80211 phy0: Hardware restart was requested [ 2104.110895] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S [ 2104.111083] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 [tom@tomsfc17 ~]$ lspci | grep "Network controller" 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) Lenovo Thinkpad T510 Me too: FC16 with latest updates: Jun 27 06:33:48 localhost kernel: [78505.341557] iwlwifi 0000:03:00.0: Queue 12 stuck for 2000 ms. Jun 27 06:33:48 localhost kernel: [78505.341570] iwlwifi 0000:03:00.0: Current SW read_ptr 194 write_ptr 32 Jun 27 06:33:48 localhost kernel: [78505.341649] iwlwifi 0000:03:00.0: Current HW read_ptr 194 write_ptr 32 Jun 27 06:33:48 localhost kernel: [78505.341657] iwlwifi 0000:03:00.0: On demand firmware reload Jun 27 06:33:48 localhost kernel: [78505.342480] ieee80211 phy0: Hardware restart was requested Jun 27 06:33:48 localhost kernel: [78505.342638] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S Jun 27 06:33:48 localhost kernel: [78505.342889] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 lspci | grep "Network controller" 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) uname -r 3.4.2-1.fc16.x86_64 Got it again on FC16 though: [21528.114050] iwlwifi 0000:03:00.0: Queue 2 stuck for 2000 ms. [21528.114056] iwlwifi 0000:03:00.0: Current SW read_ptr 126 write_ptr 133 [21528.114109] iwlwifi 0000:03:00.0: Current HW read_ptr 126 write_ptr 133 [21528.114112] iwlwifi 0000:03:00.0: On demand firmware reload [21528.114586] ieee80211 phy0: Hardware restart was requested [21528.114659] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S [21528.114850] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 And twice more this morning with F16 (same config as above) :-( [ 160.600837] iwlwifi 0000:03:00.0: Queue 2 stuck for 2000 ms. [ 160.600852] iwlwifi 0000:03:00.0: Current SW read_ptr 182 write_ptr 188 [ 160.600982] iwlwifi 0000:03:00.0: Current HW read_ptr 182 write_ptr 188 [ 160.600984] iwlwifi 0000:03:00.0: On demand firmware reload [ 160.601623] ieee80211 phy0: Hardware restart was requested [ 160.601774] iwlwifi 0000:03:00.0: L1 Enabled; Disabling L0S [ 160.602028] iwlwifi 0000:03:00.0: Radio type=0x0-0x3-0x1 [ 445.386086] TCP: lp registered Still happening with 3.4.4-3.fc17.x86_64 It seems for me power_save=0 is the module parameter that helps me best Hi Paul, Please can you elaborate? Is power_save=0 an option that we need to add into linux parameters for: /boot/grub2/grub.cfg Also, when you say "helps me best", does that mean that you have never had the issue when you have that set? Tom It's a module parameter, so I put it in /etc/modprobe.d/iwlwifi.conf options iwlwifi power_save=0 I still have some issues, and when it gets busy in our hackerspace I still see a lot more issues then other people remaining connected. But with this setting, at home where I use a TPlink http://www.tp-link.com/ca/products/details/?categoryid=241&model=TL-WR702N which causes my wifi card to lock up in seconds and reboot itself, so I can't actually use it at all, even if I'm a meter away from it. One every 10 pings will take 3000ms. With this option set, I can stay online at home. Thanks for the clarification, much appreciated This one is giving me the love. Thanks! (In reply to comment #25) > It's a module parameter, so I put it in /etc/modprobe.d/iwlwifi.conf > > options iwlwifi power_save=0 > > > I still have some issues, and when it gets busy in our hackerspace I still > see a lot more issues then other people remaining connected. But with this > setting, at home where I use a TPlink > http://www.tp-link.com/ca/products/details/?categoryid=241&model=TL-WR702N > which causes my wifi card to lock up in seconds and reboot itself, so I > can't actually use it at all, even if I'm a meter away from it. One every 10 > pings will take 3000ms. With this option set, I can stay online at home. I've also been running with wd_disable=1 for about a week with no issues. Created attachment 596332 [details]
errors I'm getting
Linux ovo 3.4.4-3.fc17.i686 #1 SMP Tue Jun 26 21:32:03 UTC 2012 i686 i686 i386 GNU/Linux
At least for me the recent kernel update towards 3.4.4-4.fc16.x86_64 fixed the problem. My PC configuration is detailed in Comment 15. (In reply to comment #30) > At least for me the recent kernel update towards 3.4.4-4.fc16.x86_64 fixed > the problem. My PC configuration is detailed in Comment 15. I have a similar configuration---ThinkPad W520, and a Fritz!Box access point using WPA2---and I still get the same stuck queue problem with 3.4.4-5 in F17. Same here, still issues with 3.4.4-5 in F17 on Thinkpad X201 Problem still there on Thinkpad x201 with 3.4.6: ingmar@th1nkpad ~ % uname -a Linux th1nkpad 3.4.6-2.fc17.x86_64 #1 SMP Thu Jul 19 22:54:16 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux 02:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 35) [ 754.136836] iwlwifi 0000:02:00.0: Queue 2 stuck for 2000 ms. [ 754.136843] iwlwifi 0000:02:00.0: Current SW read_ptr 15 write_ptr 55 [ 754.136898] iwlwifi 0000:02:00.0: Current HW read_ptr 15 write_ptr 55 [ 754.136902] iwlwifi 0000:02:00.0: On demand firmware reload [ 754.141446] iwlwifi 0000:02:00.0: Failing on timeout while stopping DMA channel 1 [0x07fd0001] [ 754.141995] ieee80211 phy0: Hardware restart was requested As a further note, I did not experience any issues at a super busy IETF network, so i think this really relates to some (cheao?) AP implementation interaction with my intel card. Does the problem still happen on 3.6.5 or later ? Kernel 3.6 and later fixes the bug for me. Network controller: Intel Corporation Centrino Advanced-N 6230 (rev 34) Best regards, Otto The problem did not appear anymore on my Thinkpad T510 with Fedora 16 for quite a while. The last 3.4.x kernels as well as the 3.6.x kernel do not have the flaky wifi behaviour anymore. 3.6.6-1.fc16.x86_64 iwlwifi 0000:03:00.0: Detected Intel(R) Centrino(R) Ultimate-N 6300 AGN, REV=0x74 Closing per above comments ... |