Bug 536988 - iwlagn intel 4965 intermittently fails when connected to 5ghz network
Summary: iwlagn intel 4965 intermittently fails when connected to 5ghz network
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 12
Hardware: All
OS: Linux
low
medium
Target Milestone: ---
Assignee: Stanislaw Gruszka
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2009-11-12 04:04 UTC by Andy Wang
Modified: 2013-01-10 05:35 UTC (History)
16 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2010-03-08 16:46:07 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Andy Wang 2009-11-12 04:04:49 UTC
Description of problem:
iwlagn intel 4965 intermittently fails when connected to 5ghz network

Version-Release number of selected component (if applicable):
kernel-2.6.31.5-127.fc12.x86_64

How reproducible:
sometimes

Steps to Reproduce:
1. attempt to connect to 5ghz 802.11n network
2. network manager associates with access point
3. ip address received from dhcp server
  
Actual results:
iwlagn 0000:03:00.0: ERROR: No TX rate available.
following error message is repeated in kernel over and over
network communication doesn't work

Expected results:
network communication works

Additional info:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/420678
seen with ubuntu as well

Comment 1 Alan 2009-12-12 03:35:23 UTC
I see the same behavior with my Thinkpad T61.
This is slightly different than the [also flawed] behavior I saw with Fedora 11 on the same hardware.  That behavior was consistent with Bug 493018, which intermittently cut out but would eventually restore.  

In Fedora 12 the iwlagn 0000:03:00.0: ERROR: No TX rate available errors are continuous and essentially immediate, rendering the interface useless. 
One thing to note is that this seems to affect the handling of networking beyond just the chosen interface.  If I shut down wireless in NetworkManager and plug in a cable that also fails to work.  I have to first rmmod the iwlagn module and then plug in a cable before I can do any sort of networking again.

Also it should be worth noting that this not seem to be a problem of 5ghz per se, just 802.11n running 5ghz.  I use an [Intel] 802.11a AP at work with no problems all day.  It is only when I come home and associate with my Linksys 610n in 5ghz mode that I run into trouble.

btw, I did not see this at all until I upgraded from the 2.6.31.6-162.fc12.x86_64 kernel to the 2.6.31.6-166.fc12.x86_64 kernel.  If I try to go back to 162 now it fails as well though (perhaps because I have the newer kernel-firmware pkg?).

Comment 2 Manny 2009-12-17 19:14:44 UTC
Same problem. I tried it at home and at work at 5ghz and just constant ERROR: No TX rate available. I can confirm that when F12 was initially released, it did not work. Then some upgrades later it did work. Then it stopped working again after another upgrade. Looking at the 2.6.32 changelog, there were a couple of commits which pointed to this problem... I tried to install the 2.6.32 kernel, but it didn't play nice with my graphics.

commit b58ef214b7db57cfcbca0e1edae08566cdfd56b7
Author: Daniel C Halperin <daniel.c.halperin>
Date:   Fri Aug 28 09:44:46 2009 -0700

    iwlwifi: remove incorrect uses of ieee80211_get_tx_rate to prevent TX stall
    
    Refactor and correct rate selection for outgoing transmitted
    packets.
    
    First, note that HT rates in the mac80211 rate table do not provide valid
    indices when ieee80211_get_tx_rate is called; the check to see if we could to
    abort a transmission early in iwl_tx_skb() would thus occasionally read invalid
    memory and occasionally stall transmission (if the erroneous byte was 0xff).
    We remove that code; the check wasn't valid anyway.
    
    Second, iwl_tx_cmd_build_rate() also called ieee80211_get_tx_rate to be used
    for sending management packets, which do not use the uCode station table.  This
    patch refactors that function and adds comments to enhance legibility, replaces
    the call to ieee80211_get_tx_rate() with a direct lookup, and adds error
    handling in case the table entry is invalid.
    
    Signed-off-by: Daniel C Halperin <daniel.c.halperin>
    Signed-off-by: Reinette Chatre <reinette.chatre>
    Signed-off-by: John W. Linville <linville>

Comment 3 drago01 2010-01-01 08:13:48 UTC
I can confirm this using a iwl5300 card; the drivers from 2.6.32 do indeed fix it (http://www.orbit-lab.org/kernel/compat-wireless-2.6-stable/v2.6.32/compat-wireless-2.6.32.2.tar.bz2 for people who want to test without rebuild/installing a different kernel).

(Problem only happens with 80211n + 5GHZ band)

Comment 4 drago01 2010-01-01 08:21:07 UTC
(In reply to comment #3)
> I can confirm this using a iwl5300 card; the drivers from 2.6.32 do indeed fix
> it
> (http://www.orbit-lab.org/kernel/compat-wireless-2.6-stable/v2.6.32/compat-wireless-2.6.32.2.tar.bz2
> for people who want to test without rebuild/installing a different kernel).
> 
> (Problem only happens with 80211n + 5GHZ band)  

Have to correct myself; the error message is not displayed but the symptoms are the same (connection stalls).

Comment 5 drago01 2010-01-01 11:35:24 UTC
(In reply to comment #4)
> (In reply to comment #3)
> > I can confirm this using a iwl5300 card; the drivers from 2.6.32 do indeed fix
> > it
> > (http://www.orbit-lab.org/kernel/compat-wireless-2.6-stable/v2.6.32/compat-wireless-2.6.32.2.tar.bz2
> > for people who want to test without rebuild/installing a different kernel).
> > 
> > (Problem only happens with 80211n + 5GHZ band)  
> 
> Have to correct myself; the error message is not displayed but the symptoms are
> the same (connection stalls).  

OK; rebooting after installing the new driver fixed everything (just reloading the module did not seem to be enough).

Just downloaded a big file while surfing the net an everything seems fine (stable connection + good performance)

Comment 6 drago01 2010-01-01 17:34:47 UTC
(In reply to comment #5)
> (In reply to comment #4)
> > (In reply to comment #3)
> > > I can confirm this using a iwl5300 card; the drivers from 2.6.32 do indeed fix
> > > it
> > > (http://www.orbit-lab.org/kernel/compat-wireless-2.6-stable/v2.6.32/compat-wireless-2.6.32.2.tar.bz2
> > > for people who want to test without rebuild/installing a different kernel).
> > > 
> > > (Problem only happens with 80211n + 5GHZ band)  
> > 
> > Have to correct myself; the error message is not displayed but the symptoms are
> > the same (connection stalls).  
> 
> OK; rebooting after installing the new driver fixed everything (just reloading
> the module did not seem to be enough).
> 
> Just downloaded a big file while surfing the net an everything seems fine
> (stable connection + good performance)  

This seems to depend on the phase of the moon or in other words, it is random it might work but the next time the connection drops after some minutes.

Comment 7 drago01 2010-01-01 22:43:53 UTC
Seems like everytime it happens it is "stuck" at 162MB/s iw reports:

iw wlan0 station dump
Station 00:xx:xx:xx:xx (on wlan0)
	inactive time:	957 ms
	rx bytes:	98786
	rx packets:	327
	tx bytes:	18885
	tx packets:	122
	signal:  	-80 dBm
	tx bitrate:	162.0 MBit/s MCS 12 40Mhz

Can someone confirm this?

Comment 8 Jesse Keating 2010-02-10 20:48:52 UTC
Yep, I can confirm, I'm seeing the same thing.

Comment 9 Jeremy Fitzhardinge 2010-02-18 17:44:34 UTC
I see this too.  I rarely get a functional 5GHz connection for more than a few seconds.  If it survives past that, it seems pretty solid.

Comment 10 drago01 2010-02-18 18:04:40 UTC
Just a note, this seems to be fixed in the 2.6.32 builds (i.e current kernels from koji).

Comment 11 Stanislaw Gruszka 2010-02-26 12:53:05 UTC
Andy, all


Do You confirm issue is fixed on 2.6.32 based kernel? You can update kernel using:

yum --enablerepo="updates-testing" update kernel

Comment 12 Jesse Keating 2010-02-26 17:24:21 UTC
I can confirm that it works.

Comment 13 Manny 2010-03-01 07:29:54 UTC
It seems to have solved the problem for me as well.

Comment 14 Jeremy Fitzhardinge 2010-03-02 18:36:59 UTC
This specific problem seems to be resolved by the new kernel, but wifi is very flaky after resuming (and I'm seeing bluetooth regressions).

Comment 15 Stanislaw Gruszka 2010-03-05 13:21:04 UTC
(In reply to comment #14)
> This specific problem seems to be resolved by the new kernel, but wifi is very
> flaky after resuming 

Please open bugzilla for that and assign it to me.

> (and I'm seeing bluetooth regressions).

We can have opened bz for that, if not open new one and cc me.

Comment 16 Andy Wang 2010-03-07 22:57:01 UTC
I'm not having any wifi flakiness problems and my laptop doesn't have bluetooth.

Otherwise, everything seems to be working great with the new kernel.  haven't had any connect issues since upgrading earlier today.  nice that it gives the connection speed now.

Comment 17 Jeremy Fitzhardinge 2010-03-08 06:52:37 UTC
(In reply to comment #15)
> > This specific problem seems to be resolved by the new kernel, but wifi is very
> > flaky after resuming 
> 
> Please open bugzilla for that and assign it to me.

I haven't managed to reproduce it enough to be able to clearly report anything.

> > (and I'm seeing bluetooth regressions).
> 
> We can have opened bz for that, if not open new one and cc me.    

I filed it as bug 570291.

Comment 18 Stanislaw Gruszka 2010-03-08 16:46:07 UTC
Since F12 update kernel to 2.6.32 I'm closing this bug.

Comment 19 Derek Atkins 2010-03-18 18:59:56 UTC
FYI, after updating to the 2.6.32 kernel on F12 this problem does go away HOWEVER I still lose connections to my 5GHz a/n network.  Instead of seeing tons of:

Mar 18 12:52:12 pgpdev kernel: iwlagn 0000:03:00.0: ERROR: No TX rate available.

I now see:

Mar 18 13:50:58 pgpdev kernel: iwlagn 0000:03:00.0: iwl_tx_agg_start on ra = 00:25:9c:ce:f5:71 tid = 0

And then the network is unusable.

Switching from my 5GHz a/n network to my 2.4GHz g network and I'm fine.  So this only happens on 5GHz a/n.

AP is a WTN610N running DD-WRT configured with WPA2/PSK/AES(CCMP)

I've submitted Bug #574885 on that issue...


Note You need to log in before you can comment on or make changes to this bug.