RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 715452 - tg3: tg3_abort_hw timed out - No WOL from hibernate, no network on resume
Summary: tg3: tg3_abort_hw timed out - No WOL from hibernate, no network on resume
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Red Hat Enterprise Linux 6
Classification: Red Hat
Component: kernel
Version: 6.1
Hardware: All
OS: Linux
medium
low
Target Milestone: rc
: ---
Assignee: Kernel Drivers
QA Contact: Ma Yuying
URL:
Whiteboard:
Depends On: 285721
Blocks: 846704 917159
TreeView+ depends on / blocked
 
Reported: 2011-06-22 22:24 UTC by Orion Poplawski
Modified: 2020-07-14 12:57 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of: 285721
Environment:
Last Closed: 2015-09-19 02:13:48 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Orion Poplawski 2011-06-22 22:24:46 UTC
+++ This bug was initially created as a clone of Bug #285721 +++

Description of problem:
On hibernate, the following messages are displayed on the console:

tg3 0000:08:04.1: tg3_abort_hw timed out, TX_MODE_ENABLED will not clear MAC_TX_MODE=ffffffff

system will not wake from lan from hibernate

on resume:

tg3 0000:08:04.0: PME# disabled
tg3 0000:08:04.0: eth0: Link is down
tg3 0000:08:04.1: PME# disabled
tg3 0000:08:04.1: eth1: Link is down

and network in non-functional until module is removed and re-added.

Adding tg3 to SUSPEND_MODULES allows the network to work on resume, but also prevents wake on lan.

08:04.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5715 Gigabit Ethernet (rev a3)
08:04.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5715 Gigabit Ethernet (rev a3)

Version-Release number of selected component (if applicable):
2.6.32-131.2.1.el6.x86_64


From Bug 285721:

When suspending my computer with pm-hibernate --quirk-vbe-post I get this error
message on console (and in dmesg):

tg3: tg3_abort_hw timed out for eth0, TX_MODE_ENABLE will not clear
MAC_TX_MODE=ffffffff

After resume, I have to modprobe -v -r not only my wifi driver (iwl3945) but tg3
as well in order to make wifi to work.

How reproducible:
100%

Steps to Reproduce:
1.suspend/resume cycle
2.
3.
  
Actual results:
error message on console/dmesg and no wireless after resume, have to rmmod both
wifi and Ethernet driver

Expected results:
just works

Additional info:
having
SUSPEND_MODULES="kvm_intel kvm" in /etc/pm/config.d/unload_modules 
(with having iwl3945 there as well didn't help; will try now with both iwl3945
and tg3)

--- Additional comment from mcepl on 2007-09-11 06:06:26 EDT ---

Created attachment 192361 [details]
output of dmesg command

--- Additional comment from pknirsch on 2007-09-11 08:39:09 EDT ---

Hm thats sounds more like the tg3 driver has a problem when you hibernate using
this quirk.

Have you tried hibernating without the quirk? Or doesn't the machine hibernate
properly when you leave it out.

But in general, it's a kernel module spitting out an error message, so i rather
think this is a kernel problem, therefore i'm reassigning it to kernel.

Thanks,

Read ya, Phil

--- Additional comment from mcepl on 2007-09-25 09:47:39 EDT ---

yes, I tried to hibernate with any possible quirks or without them at all and it
makes this message all the time.

--- Additional comment from snecklifter on 2007-10-03 09:19:02 EDT ---

Hello,

I'm reviewing this bug as part of the kernel bug triage project, an attempt to
isolate current bugs in the fedora kernel.

http://fedoraproject.org/wiki/KernelBugTriage

From the 2.6.23-rc3 Changelog:

commit 3e0c95fd648c0d3175b9ff2232597d0b02eb7d46
Author: Michael Chan <mchan>
Date:   Fri Aug 3 20:56:54 2007 -0700

    [TG3]: Fix suspend/resume problem.
    
    Joachim Deguara <joachim.deguara> reported that tg3 devices
    would not resume properly if the device was shutdown before the system
    was suspended.  In such scenario where the netif_running state is 0,
    tg3_suspend() would not save the PCI state and so the memory enable bit
    and bus master enable bit would be lost.
    
    We fix this by always saving and restoring the PCI state in
    tg3_suspend() and tg3_resume() regardless of netif_running() state.
    
    Signed-off-by: Michael Chan <mchan>
    Signed-off-by: David S. Miller <davem>

Matej, can you test with a kernel based off this?

Also, could you clear up whether you are suspend/resuming or hibernate/waking?
You mention suspend/resume but then indicate you are running pm-hibernate. Do
you still see this issue with pm-suspend?

Cheers
Chris

--- Additional comment from mcepl on 2007-10-04 19:47:07 EDT ---

Tried kernel-2.6.23-0.217.rc9.git1.fc8 and the results are no good:

a) iwl3945 driver didn't work with NetworkManager-0.6.5-7.fc7 (it did with
kernel-2.6.22.9-91.fc7), so ifup had to be used.
b) tg3 warning message went away, but after suspend no wireless card whatsoever
(I have the computer at home, so I have no chance to test wired ethernet
function actually), and there were some backtraces in dmesg (see attached).

--- Additional comment from mcepl on 2007-10-04 19:47:32 EDT ---

Created attachment 216801 [details]
output of dmesg

--- Additional comment from mcepl on 2007-10-04 19:49:14 EDT ---

Created attachment 216811 [details]
/var/log/messages

--- Additional comment from snecklifter on 2007-10-05 06:58:04 EDT ---

Matej,

As the original issue with tg3 appears resolved in 2.6.23, I'm changing the
subject to reflect that. It might even be worth filing a new bug for the
wireless but to be honest there is so much work going into NetworkManager and
intel wifi at the moment it will likely get lost in the noise.

I'd suggest your best option would be to test with NetworkManager (again from
development if you can) and see if this helps. In the meantime I'll re-assign to
the wireless team. For brevity here is the backtrace which is related to your
touchpad rather than network driver issues:

=============================================
[ INFO: possible recursive locking detected ]
2.6.23-0.217.rc9.git1.fc8 #1
---------------------------------------------
kseriod/253 is trying to acquire lock:
 (&ps2dev->cmd_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24

but task is already holding lock:
 (&ps2dev->cmd_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24

other info that might help us debug this:
4 locks held by kseriod/253:
 #0:  (serio_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24
 #1:  (&serio->drv_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24
 #2:  (psmouse_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24
 #3:  (&ps2dev->cmd_mutex){--..}, at: [<c0631cb3>] mutex_lock+0x21/0x24

stack backtrace:
 [<c0406463>] show_trace_log_lvl+0x1a/0x2f
 [<c0406e4d>] show_trace+0x12/0x14
 [<c0406e65>] dump_stack+0x16/0x18
 [<c0449c56>] __lock_acquire+0x189/0xc67
 [<c044abae>] lock_acquire+0x7b/0x9e
 [<c0631ac0>] __mutex_lock_slowpath+0x10a/0x2dc
 [<c0631cb3>] mutex_lock+0x21/0x24
 [<c059bc3f>] ps2_command+0x92/0x30e
 [<c05a23c6>] psmouse_sliced_command+0x1c/0x5a
 [<c05a46eb>] synaptics_pt_write+0x21/0x46
 [<c059ba14>] ps2_sendbyte+0x39/0xcb
 [<c059bcbe>] ps2_command+0x111/0x30e
 [<c05a2001>] psmouse_probe+0x1d/0x6c
 [<c05a314d>] psmouse_connect+0xf8/0x20c
 [<c05993e0>] serio_connect_driver+0x1e/0x2e
 [<c0599406>] serio_driver_probe+0x16/0x18
 [<c05767bd>] driver_probe_device+0xf2/0x173
 [<c0576846>] __device_attach+0x8/0xa
 [<c0575b92>] bus_for_each_drv+0x3c/0x67
 [<c05768dc>] device_attach+0x75/0x8a
 [<c059941c>] serio_find_driver+0x14/0x3c
 [<c0599f59>] serio_thread+0x166/0x2b9
 [<c043e7f7>] kthread+0x3b/0x64
 [<c0405ee3>] kernel_thread_helper+0x7/0x10
 =======================

The logs then indicates the device coming back up and dhclient restarting a few
times before sleeping. NetworkManager fares no better later on it would seem.

--- Additional comment from linville on 2007-10-05 09:02:44 EDT ---

FWIW, it is really bad form to simply hijack a bug for another problem, rename 
it, and expect it to be treated as an extension of the same bug.  It isn't 
clear to me that this is the same issue at all, and having synaptics 
backtraces and tg3 stuff in what now claims to be an iwl3945 bug just creates 
confusion...

--- Additional comment from mcepl on 2007-10-05 09:24:27 EDT ---

Sure, being a bugmaster, I thought that kernel folks have different mores ;-).
No, seriously, should I file a new bug?

--- Additional comment from linville on 2007-10-05 09:35:18 EDT ---

Matej, you weren't the one I was hoping to educate. :-)  Yes, please open a 
new bug.  Restoring the name and assignee of this one (and closing it, if 
appropriate) seems like a good idea too.

--- Additional comment from mcepl on 2007-10-05 15:56:07 EDT ---

OK, so let's close this bug as CLOSED/RAWHIDE, and I will upgrade on Monday my
computer to F8test3 update that to the latest Rawhide, and I will all bugs which
will be eventually found out and you will fix them. Is it a deal? :-)

--- Additional comment from snecklifter on 2007-10-05 16:40:03 EDT ---

(In reply to comment #13)
> FWIW, it is really bad form to simply hijack a bug for another problem, rename 
> it, and expect it to be treated as an extension of the same bug.  It isn't 
> clear to me that this is the same issue at all, and having synaptics 
> backtraces and tg3 stuff in what now claims to be an iwl3945 bug just creates 
> confusion...

I'm no hi-jacker, you must have me confused with someone else. I don't *expect*
it to be treated as an extension, just that the underlying issue may be the same
however the initial tg3 errors were resolved so I felt a change of subject was
appropriate. Its your call to ask the reporter to file a new bug or continue on
this one. Please don't accuse me of hi-jacking - as indicated I am attempting to
triage kernel bugs.

(In reply to comment #16)
> OK, so let's close this bug as CLOSED/RAWHIDE, and I will upgrade on Monday my
> computer to F8test3 update that to the latest Rawhide, and I will all bugs which
> will be eventually found out and you will fix them. Is it a deal? :-)

Deal.

Cheers
Chris

--- Additional comment from mcepl on 2007-10-10 16:11:13 EDT ---

Sorry guys, if I can have my original summary back (I believe many people search
by the error they get in logs), and unfortunately I have to reopen this.

I have upgraded to full Rawhide, so I have now here kernel
2.6.23-0.224.rc9.git6.fc8 and 
NetworkManager-0.7.0-0.3.svn2914.fc8.

Unfortunately, the error message is back.

--- Additional comment from mcepl on 2007-10-10 16:25:57 EDT ---

Created attachment 223371 [details]
/var/log/messages

These are the /var/log/messages contain both suspend/resume cycle and reboot.

After suspend/resume cycle, there is no network (neither wireless nor wired). I
will file a different bug about this.

--- Additional comment from mcepl on 2008-01-14 01:27:05 EST ---

I cannot find in any log any error messages for now. So, lets CLOSE this for
now, and I will reopen it if every needed again.

Comment 1 Orion Poplawski 2011-06-22 22:30:45 UTC
Actually, on this machine, WOL doesn't work from shutdown either until the machine's power is cut and restored, so that may be a separate issue.

Comment 3 RHEL Program Management 2011-10-07 15:38:42 UTC
Since RHEL 6.2 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 5 John Shortt 2015-09-19 02:13:48 UTC
Closing since the last updates were 5 minor releases ago.  If you feel this is still relevant, please reopen or file a new BZ after confirming the problem still exists in RHEL6.

Thanks


Note You need to log in before you can comment on or make changes to this bug.