Bug 495697

Summary: Intel Corporation PRO/Wireless 5100 AGN [Shiloh] possible suspend problem
Product: Red Hat Enterprise Linux 5 Reporter: Taunus <codezilla>
Component: kernelAssignee: Stanislaw Gruszka <sgruszka>
Status: CLOSED CURRENTRELEASE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: medium Docs Contact:
Priority: low    
Version: 5.4CC: dzickus, linville, sgruszka
Target Milestone: rcKeywords: Reopened
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-04-06 09:13:01 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
lspci -n output none

Description Taunus 2009-04-14 13:06:41 UTC
Description of problem:
Now I'm running kernel-2.6.18-138.el5.jwltest.84.x86_64 on Thinkpad T400. Sometimes when suspending from Power Manager the suspend does not work. It stops to "Disabling non boot CPUs..." and after waiting a while, a minute maybe, I see:

iwlagn: No space for Tx
iwlagn: Error sending REPLY_STATISTICS_CMD: enqueue_hcmd failed: -28

and the computer just hangs there. The message did appear before on occasions I
did not make note of but I'm wondering if it causes the suspend to not work
properly. The suspend seems to work better if WLAN is disabled from the switch. 


Version-Release number of selected component (if applicable):


How reproducible:
Suspend from Power Manager. WLAN on with some traffic?

Steps to Reproduce:
1.
2.
3.
  
Actual results:
Sometimes hangs to "Disabling non boot CPUs..."

Expected results:
Goes to suspend

Additional info:

Comment 1 John W. Linville 2009-04-30 17:36:45 UTC
Can you post the output of 'lspci -n' please?

Comment 2 Taunus 2009-05-04 08:47:50 UTC
Created attachment 342282 [details]
lspci -n output

Attached is the lspci -n command output.

Actually I haven't seen this anymore since I upgraded to kernel-2.6.18-140.el5.x86_64

It was not happening all the time before the upgrade. I'll keep an eye for it for a while.

Comment 3 John W. Linville 2009-07-01 14:07:53 UTC
I'm going to close this based on comment 2.  Please reopen if the problem persists with current kernels from here (or the actual RHEL 5.4 release kernel):

   http://people.redhat.com/dzickus/el5/

Comment 4 Taunus 2009-10-30 21:28:03 UTC
Suspending was working somehow till 2.6.18-164
Now with 2.6.18-164.2.1 first or second or third suspend crashes. If I disable wlan from the switch I can suspend as many times I want.

So the wlan and suspend problems are definitely related. The only clue is still this message which appears after a while:
iwlagn: No space for Tx
iwlagn: Error sending REPLY_STATISTICS_CMD: enqueue_hcmd failed: -28

Also, without the wlan on suspending is faster. When wlan is on it takes more time to suspend, looks like it is moving in slow motion.

Comment 5 Taunus 2009-11-03 13:47:52 UTC
I noticed that I get these in the syslog when computer is connected to wlan (wlan disconnects by itself sometimes):

BUG: warning at include/../net/mac80211/rate.h:153/rate_lowest_index() (Not tainted)

Call Trace:
 [<ffffffff882f7784>] :iwlagn:rs_get_rate+0x176/0x1b2
 [<ffffffff882263a2>] :mac80211:rate_control_get_rate+0x85/0xe5
 [<ffffffff8822b360>] :mac80211:ieee80211_tx_h_rate_ctrl+0x31/0xfa
 [<ffffffff8822bf5e>] :mac80211:ieee80211_master_start_xmit+0x226/0x3fc
 [<ffffffff80239147>] __qdisc_run+0x136/0x1f9
 [<ffffffff8002f9cb>] dev_queue_xmit+0x150/0x271
 [<ffffffff882256f1>] :mac80211:ieee80211_sta_work+0x50f/0x720
 [<ffffffff882251e2>] :mac80211:ieee80211_sta_work+0x0/0x720
 [<ffffffff8004d80f>] run_workqueue+0x94/0xe4
 [<ffffffff8004a057>] worker_thread+0x0/0x122
 [<ffffffff8009f9f5>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8004a147>] worker_thread+0xf0/0x122
 [<ffffffff8008c3c2>] default_wake_function+0x0/0xe
 [<ffffffff8009f9f5>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8009f9f5>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8003298b>] kthread+0xfe/0x132
 [<ffffffff8005dfb1>] child_rip+0xa/0x11
 [<ffffffff8009f9f5>] keventd_create_kthread+0x0/0xc4
 [<ffffffff8003288d>] kthread+0x0/0x132
 [<ffffffff8005dfa7>] child_rip+0x0/0x11

Comment 6 John W. Linville 2009-12-02 15:56:24 UTC
Please also make sure you have the lastest available firmware package for your wireless device.

Also, please try the test kernels here:

   http://people.redhat.com/linville/kernels/rhel5/

Do these kernels work better for you?

Comment 7 Taunus 2009-12-09 07:25:49 UTC
Ok, The wlan part seems to work faster and better. Waking up from suspend sometimes freezes with kernel-2.6.18-175.el5.jwltest.96.3.x86_64 and latest v2 firmware. The trouble of going in to suspend seems to have gone away.

Comment 9 Taunus 2009-12-14 13:04:53 UTC
I have got also couple of kernel panics when waking up from suspend.

Comment 10 Stanislaw Gruszka 2009-12-14 13:45:20 UTC
Any chance to get a logs (serial console, netconsole, kdump) when kernel do resume and crash ?

Comment 11 Taunus 2009-12-22 08:44:47 UTC
Maybe if you send a link to instructions howto use netconsole or kdump

Comment 12 Stanislaw Gruszka 2009-12-22 11:37:42 UTC
Sure 

Kdump: http://kbase.redhat.com/faq/docs/DOC-6039

Don't know about RHEL netconsole howto, here is fedora doc: https://fedoraproject.org/wiki/Netconsole

Easiest way to setup netconsole on RHEL is edit /etc/sysconfig/netconsole file and run "/etc/init.d/netconsole start". 

I'm not sure if such debugging options will be useful, it depends if proper subsystems initialize before kernel resume crash. Please let as know.

Comment 13 Taunus 2010-01-28 12:27:43 UTC
I'm sorry, I can't find time to debug this...

Comment 14 Taunus 2010-04-01 09:41:51 UTC
As of kernel-2.6.18-194 the suspend bug when wireless network is on seems to have been fixed.

Comment 15 Stanislaw Gruszka 2010-04-06 09:13:01 UTC
I glad this is fixed (please reopen if not). Thanks.