Bug 236301

Summary: rmmod netconsole module hangs with 100% CPU
Product: [Fedora] Fedora Reporter: David Rees <drees76>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED CURRENTRELEASE QA Contact: Brian Brock <bbrock>
Severity: medium Docs Contact:
Priority: medium    
Version: 6CC: jmoyer, jonstanley, zoli
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: kernel-2.6.23.9-85.fc8 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2008-01-10 06:44:52 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 427887    

Description David Rees 2007-04-12 22:31:25 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.3) Gecko/20070309 Firefox/2.0.0.3

Description of problem:
When trying to unload the netconsole module, for example when trying to reconfigure the module, the command hangs with the kernel in an apparent 100% system time CPU loop.

Rebooting the system is the only way I can figure out how to recover.

There have been a couple posts on linux-kernel regarding this, but no solution yet.

http://marc.info/?l=linux-kernel&m=117252217901235&w=2

Version-Release number of selected component (if applicable):
kernel-2.6.20-1.2933.fc6

How reproducible:
Always


Steps to Reproduce:
1. modprobe netconsole module
2. rmmod netconsole
3. *hang*

Actual Results:


Expected Results:


Additional info:
As an aside, the netdump package is fairly broken, it still tries to load the netdump module which doesn't even exist.

Comment 1 Chuck Ebbert 2007-04-17 23:46:26 UTC
Final post:

http://marc.info/?l=linux-netdev&m=117253232229182&w=2


Comment 2 Zoltan Arpadffy 2007-11-24 03:42:35 UTC
Hello,

I wonder is there any development here?
It passed almost a year and the bug still exists.

If the netconsole service is enabled it is not possible to halt/reboot the box.

root      5324  0.0  0.1   4648  1220 pts/1    S+   04:27   0:00  
|                   \_ /bin/sh /sbin/service netconsole stop
root      5331  0.0  0.1   4700  1284 pts/1    S+   04:27   0:00  
|                       \_ /bin/bash /etc/init.d/netconsole stop
root      5341 94.5  0.0   1716   448 pts/1    R+   04:27   2:39  
|                           \_ rmmod netconsole

# uname -r
2.6.21-1.3194.fc7

Thank you in advance.

Regards,
Z

Comment 3 Jon Stanley 2008-01-08 01:47:48 UTC
(This is a mass-update to all current FC6 kernel bugs in NEW state)

Hello,

I'm reviewing this bug list as part of the kernel bug triage project, an attempt
to isolate current bugs in the Fedora kernel.

http://fedoraproject.org/wiki/KernelBugTriage

I am CC'ing myself to this bug, however this version of Fedora is no longer
maintained.

Please attempt to reproduce this bug with a current version of Fedora (presently
Fedora 8). If the bug no longer exists, please close the bug or I'll do so in a
few days if there is no further information lodged.

Thanks for using Fedora!

Comment 4 Zoltan Arpadffy 2008-01-08 21:15:11 UTC
Hello,

I confirm that this ug is reproducible on:

# uname -a
Linux gate.polarhome.com 2.6.21-1.3194.fc7 #1 SMP Wed May 23 22:35:01 EDT 2007 
i686 i686 i386 GNU/Linux
# cat /etc/fedora-release
Fedora release 8 (Werewolf)

Thank you.

 

Comment 5 Jon Stanley 2008-01-08 21:41:12 UTC
Hmmm, you have a most interesting system.  An F7 kernel on a (purportedly) F8
system?  How did you upgrade? The kernel that you are running is the GA version
from F7 anyways, which is over 6 months old at this point.  Please try upgrading
to at least 2.6.23.9-85.fc8 and attempt to reproduce.  I could not reproduce
this on that kernel, but I also do not have a valid netconsole setup - I just
loaded it up with some dummy values and unloaded it (which worked fine).

[root@dhcp-126 ~]# modprobe netconsole netconsole=192.168.1.3
[root@dhcp-126 ~]# lsmod | fgrep netconsole
netconsole              6881  0
[root@dhcp-126 ~]# rmmod netconsole
[root@dhcp-126 ~]# lsmod | fgrep netconsole
[root@dhcp-126 ~]# uname -a
Linux dhcp-126 2.6.23.9-85.fc8 #1 SMP Fri Dec 7 15:49:59 EST 2007 i686 i686 i386
GNU/Linux



Comment 6 Jeff Moyer 2008-01-08 22:01:29 UTC
There were a bunch of changes to the workqueue cancellation code that may have
fixed this problem.  I was unable to reproduce it on 2.6.24-rc7;  sorry, I don't
have an f8 box handy for testing.

Reproducing the bug likely requires a working netconsole configuration
(something that is quite easily achieved).  I tried to reproduce (as I said, on
2.6.24) using a while loop that issued sysrq-t's while another loop
insmod/rmmod'd the netconsole module.

Could the reporter please upgrade his kernel and try again (as suggested by Jon)?

Comment 7 David Rees 2008-01-10 06:43:47 UTC
I can no longer reproduce this on Fedora 8 using kernel-2.6.23.9-85.fc8. This
bug should be safe to close.