411191 – [patch] iwl3945: prevent scans during association

Bug 411191 - [patch] iwl3945: prevent scans during association

Summary: [patch] iwl3945: prevent scans during association

Keywords:
Status:	CLOSED CURRENTRELEASE
Alias:	None
Product:	Fedora
Classification:	Fedora
Component:	kernel
Sub Component:
Version:	8
Hardware:	All
OS:	Linux
Priority:	low
Severity:	high
Target Milestone:	---
Assignee:	John W. Linville
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2007-12-04 22:03 UTC by Derek Atkins
Modified:	2007-12-10 16:50 UTC (History)
CC List:	0 users
Fixed In Version:	2.6.23.9-45.fc7
Clone Of:
Environment:
Last Closed:	2007-12-10 16:50:20 UTC
Type:	---
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Description Derek Atkins 2007-12-04 22:03:55 UTC

Description of problem:

In a very busy network space (i.e., I can see over 90 APs from here), the
iwl3945 driver can sometimes fail to associate.  According to a member on the
networkmanager list this is due to the fact that the iwl3945 driver can scan
while associating, and this can cause the association to fail.  The patch at
http://marc.info/?l=linux-wireless&m=119668234926912&w=2 is supposed to correct
this.

Version-Release number of selected component (if applicable):

Tested in both kernel-2.6.23.8-34.fc7 and kernel-2.6.23.1-21.fc7

How reproducible:

Somewhat reproducible, but not 100%.  It seems to be a race condition..  Are you
scanning while you're trying to associate (or re-associate).  It's a little hard
to actually test this, but if you can roll me a new kernel in the next two days
I'll still be in this environment and can certainly test it for you!  But only
through 12 noon US/PST on Friday, December 7.

Steps to Reproduce:
1. try to connect to a network
2. watch the driver flail
3. lather, rinse, repeat until you get connected
  
Actual results:

eth1: Initial auth_alg=0
eth1: authenticate with AP 00:19:a9:45:2f:a1
eth1: Initial auth_alg=0
eth1: authenticate with AP 00:19:a9:45:2f:a1
eth1: Initial auth_alg=0
eth1: authenticate with AP 00:19:a9:45:2f:a1
eth1: authenticate with AP 00:19:a9:45:2f:a1
eth1: authenticate with AP 00:19:a9:45:2f:a1
eth1: authentication with AP 00:19:a9:45:2f:a1 timed out

Expected results:

eth1: authenticate with AP 00:1c:b0:e6:d3:21
eth1: RX authentication from 00:1c:b0:e6:d3:21 (alg=0 transaction=2 status=0)
eth1: authenticated
eth1: associate with AP 00:1c:b0:e6:d3:21
eth1: RX ReassocResp from 00:1c:b0:e6:d3:21 (capab=0x1 status=0 aid=63)
eth1: associated

Additional info:

http://mail.gnome.org/archives/networkmanager-list/2007-December/msg00087.html
http://marc.info/?l=linux-wireless&m=119668234926912&w=2

Comment 1 Dan Williams 2007-12-04 22:30:01 UTC

Would be a good patch to get in anyway; there's no guarantee when some random
app can call SIOCSIWSCAN and the driver (or stack) allowing scans during
association or reassociation (or even EAP exchanges) is just plain 100% broken.

Comment 2 John W. Linville 2007-12-05 14:17:12 UTC

The patch in question is available in the rawhide kernels here:

   http://koji.fedoraproject.org/koji/buildinfo?buildID=26735

I'll probably have an F8 kernel w/ the new stuff soon as well.  If you get a 
chance in the mean time, please test the kernel above.

Comment 3 Derek Atkins 2007-12-05 14:53:31 UTC

Any chance you could build an FC7 kernel too?  I'm not running 8 or rawhide.

Thanks,

Comment 4 John W. Linville 2007-12-05 14:56:46 UTC

Hmmm...well, it may be a while...

Comment 5 Derek Atkins 2007-12-05 15:09:34 UTC

Define "a while".  If you mean "I can't get the patch in and get a kernel
rebuild until the end of the day", then that's COMPLETELY fine.  If, however, if
means "I wont get it it at all in the next few days", I'd ask you humbly to
change your mind because I can actually test it in a live, hostile environemnt
only through Friday this week.

Comment 6 John W. Linville 2007-12-05 16:32:30 UTC

You are the itinerant tester, aren't you? :-)

I'll see what I can do -- probably tomorrow at the earliest.

Comment 7 Derek Atkins 2007-12-05 16:48:35 UTC

Yes, I am.  I do a lot of travel into various environments.  :-D  It also means
I find lots of issues because I have access to (and experience with) a multitude
of harsh environments.  The IETF meeting is the best testbed you can find!

Thank you.  Tomorrow would be perfect.

Comment 8 John W. Linville 2007-12-06 14:20:31 UTC

http://koji.fedoraproject.org/koji/buildinfo?buildID=26993

Wanna try that?

Comment 9 Derek Atkins 2007-12-06 16:48:45 UTC

So far so good.  It booted (although the screen didn't come up on the first boot
-- but a cold restart later and it came right up).

First thing I noticed is that it took a bit longer than usual to get onto the
net.  I think the reason is that when the device is first started, fedora tries
to ifup the device and it goes and tries to associate on it's own..  So NM can't
perform a scan until the initscripts timeout, at which point the device was down
and NM couldn't get a read at all.

I'm in a meeting right now but I'll see what happens when I move locations in
about 15 minutes or so.  But the good news is that once NM got ahold of the
device, it connected on the first try, whereas many times before I would need
three or four attempts to connect.  So it's looking promising.

Comment 10 Derek Atkins 2007-12-06 17:25:32 UTC

Okay...  I just migrated across.  I did lose connectivity as I migrated, but it
came back all on its own by NM.  Unfortunately I got a different IP address so
my TCP connections died.  But this is still a better situation than it used to be!

My next test will be next week, an open, non-broadcast (SSID) network!  But so
far 2.6.23.9-45.fc7 is looking promising!

Comment 11 Derek Atkins 2007-12-06 19:20:44 UTC

Oh, I figured out why my IP Address changed..  when I migrated, NM jumped to a
different SSID and that different SSID hands out a different range of IPs. 
Oops.  But that's probably more of a NM issue than a driver issue, I would
guess.  I'll be migrating again in about 10 minutes so I'll see what happens in
the other direction.

Comment 12 John W. Linville 2007-12-10 16:50:20 UTC

It sounds like this problem is resolved in 2.6.23.9-45.fc7...

Note You need to log in before you can comment on or make changes to this bug.