Description of problem:
Whenever I try to force a netdump via sysrq "c" option, I get logs in /var/crash
on the netdump-server, but no core dumps. The client is a Thinkpad T41 laptop
running FC5 Test3 and the server is RHEL4-U2 AS2. This laptop ran RHEL3 WS, and
netdump worked just fine. Server has more then ample free disk space in the
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1. Install FC5 Test3 on IBM T41 Thinkpad
2. Set up netdump and sysrq
3. Manually force crash on client via sysrq ctrl+alt+PrtSc+c
Client hangs while performing handshake with server- I have to manually power
cycle it. I allowed it to run overnight once.
A complete netdump with logs and core file on the server.
****Client Side Console******
I see the following on the laptop console:
Continuing in 120 seconds, and counts down.
CPU#0 is executing netdump
< netdump activated - performing handshake with the server. >
Mar 3 18:11:27 blacksamba netdump: Got too many timeouts in handshaking,
ignoring client 192.168.1.99
Mar 3 18:11:30 blacksamba netdump: Got too many timeouts waiting for
SHOW_STATUS for client 192.168.1.99, rebooting it
Mar 3 18:12:19 blacksamba netdump: Got too many timeouts in handshaking,
ignoring client 192.168 .1.99
Mar 3 18:12:22 blacksamba netdump: Got too many timeouts waiting for
SHOW_STATUS for client 192. 168.1.99, rebooting it
I did a google search and found the following thread pertaining to e1000
devices: (possibly related?)
Created attachment 125631 [details]
logs from /var/crash on netdump server for netdump client
Still seeing this issue in FC5 Gold. I was just going through the FC5 bugs for
netdump and found the following:
Sounds like the same thing I saw with FC5-test3 and Gold. Let me know if you
come up with anything I can help test out.
This sounds like bz193688. We're working on an upstream fix right now, and I'll
see to that it gets ported into the FC update once we settle on what the fix is
going to be.
Created attachment 130877 [details]
This is the upstream fix for the netpoll recursion problem
Could you please run a kernel with this patch included and verify that it fixes
your problem? Thanks
Fedora Core 5 and Fedora Core 6 are, as we're sure you've noticed, no longer
test releases. We're cleaning up the bug database and making sure important bug
reports filed against these test releases don't get lost. It would be helpful if
you could test this issue with a released version of Fedora or with the latest
development / test release. Thanks for your help and for your patience.
[This is a bulk message for all open FC5/FC6 test release bugs. I'm adding
myself to the CC list for each bug, so I'll see any comments you make after this
and do my best to make sure every issue gets proper attention.]