145476 – netdump client/server problems

Bug 145476 - netdump client/server problems

Summary: netdump client/server problems

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 3
Classification:	Red Hat
Component:	kernel
Sub Component:
Version:	3.0
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Assignee:	Jeff Moyer
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	156320
TreeView+	depends on / blocked

Reported:	2005-01-18 20:53 UTC by Joshua Jensen
Modified:	2007-11-30 22:07 UTC (History)
CC List:	8 users (show)
Fixed In Version:	RHSA-2005-663
Doc Type:	Enhancement
Doc Text:
Clone Of:
Environment:
Last Closed:	2005-10-21 15:42:40 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)
log from the /var/crash/IP-date directory (2.24 KB, text/plain) 2005-01-18 20:54 UTC, Joshua Jensen	no flags	Details
tcpdump from the server-side, taken during the "Got too many timeouts" messages (13.99 KB, application/octet-stream) 2005-01-18 20:55 UTC, Joshua Jensen	no flags	Details
syslog from server side... notice all the different "timouts" messages (4.14 KB, text/plain) 2005-01-18 20:56 UTC, Joshua Jensen	no flags	Details
try to auto-negotiate 100mb fd so we are not inundated with packets (1.01 KB, patch) 2005-01-19 22:28 UTC, Jeff Moyer	no flags	Details \| Diff
same patch, but this one compiles (1.33 KB, patch) 2005-01-19 22:33 UTC, Jeff Moyer	no flags	Details \| Diff
Kernel built with proposed netdump patch (8.86 MB, audio/x-pn-realaudio-plugin) 2005-01-20 15:57 UTC, Jeff Moyer	no flags	Details
2nd netdump tarfile (2.65 MB, application/octet-stream) 2005-01-31 21:30 UTC, Joshua Jensen	no flags	Details
Reset dev->quota to dev->weight in tg3_poll_controller (431 bytes, patch) 2005-04-19 21:38 UTC, Jeff Moyer	no flags	Details \| Diff
Patch to fix up all NAPI enabled netdump drivers. (1.65 KB, patch) 2005-04-22 01:39 UTC, Jeff Moyer	no flags	Details \| Diff
Show Obsolete (1) View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHSA-2005:663	0	qe-ready	SHIPPED_LIVE	Important: Updated kernel packages available for Red Hat Enterprise Linux 3 Update 6	2005-09-28 04:00:00 UTC

Description Joshua Jensen 2005-01-18 20:53:39 UTC

Description of problem:

netdump isn't reliable

The client is using the MAC address of the router, and the IP the
netdump server.  They both agree on the ssh keys involved... and
netdump on the client can talk to the server.  However, it never gives
us a full dump.

Attached is the log file that is created on the server side and the
relevant syslog lines from the serverside, and a tcpdump.bin file of
the UDP packets going back and forth.

Version-Release number of selected component (if applicable):

RHEL3 x86, newest kernel and 
netdump-server-0.6.11-3.i386 and netdump-0.6.11-3.i386

Comment 1 Joshua Jensen 2005-01-18 20:54:26 UTC

Created attachment 109945 [details]
log from the /var/crash/IP-date directory

Comment 2 Joshua Jensen 2005-01-18 20:55:36 UTC

Created attachment 109946 [details]
tcpdump from the server-side, taken during the "Got too many timeouts"  messages

Comment 3 Joshua Jensen 2005-01-18 20:56:30 UTC

Created attachment 109947 [details]
syslog from server side... notice all the different "timouts" messages

Comment 4 Dave Anderson 2005-01-18 21:03:26 UTC

Right -- the handshake is not working in your configuration.
Can you post a copy of your /etc/sysconfig file?

Comment 5 Joshua Jensen 2005-01-18 21:15:35 UTC

I will note that the log produced isn't worthless... it looks like a
decent kernel OOPS.  However, netdump is producing all that is
promised... we are supposed to get more, right?

Comment 6 Joshua Jensen 2005-01-18 21:18:31 UTC

Sure.... here is the /etc/sysconfig/netdump file:

# LOCALPORT=6666
DEV=eth0
NETDUMPADDR=64.102.29.132
# NETDUMPPORT=6666
NETDUMPMACADDR=00:00:0C:07:AC:01
# IDLETIMEOUT=
# NETDUMPKEYEXCHANGE=none
#
SYSLOGADDR=64.102.29.131
# SYSLOGPORT=514
SYSLOGMACADDR=00:00:0C:07:AC:01

Comment 7 Joshua Jensen 2005-01-18 21:20:52 UTC

I will note that I found after I posted this that the client and
server are on the SAME subnet... didn't realize that before.  However,
when changing the MAC as specified on the client to that of the
netdump server, the behavior of everything... syslog, tcpdump, log
file created on server... is the exact same.

Comment 8 Joshua Jensen 2005-01-18 21:22:17 UTC

Sorry, the NETDUMPADDR and SYSLOGADDR values above are in fact both
64.102.29.132

Comment 9 Dave Anderson 2005-01-18 21:36:14 UTC

So, if you remove the NETDUMPMACADDR setting in the netdump config
file, does netdump start working correctly?

Comment 10 Joshua Jensen 2005-01-18 22:27:40 UTC

Hmm... no, it doesn't change the behavior.  We still get the same
behavior:

netdump[13741]: Got too many timeouts in handshaking, ignoring client
0x40661d25
netdump[13741]: Got too many timeouts waiting for SHOW_STATUS for
client 0x40661d25, rebooting it

Is specifying the NETDUMPMACADDR not really needed... even on machines
that dumping to a non-local-subnet'ed server?

Comment 11 Dave Anderson 2005-01-18 22:36:26 UTC

NETDUMPMACADDR would only be needed if the server is not on
the local subnet.

I'm presuming that after you removed the NETDUMPMACADDR that you did
a "service netdump stop" followed by a "service netdump start", but
I just want to make sure. 

Anyway, presuming that was done, Jeff, do you see any reason why a
server on the same subnet could not communicate with the client on
the initial handshake?  (I added Jeff Moyer to the cc: list, because
he is the "netdump/network guy", who handles this type of situation
far better than I...)

Comment 12 Joshua Jensen 2005-01-18 22:47:27 UTC

Yes... I stopped and started the netdump client after making the
changes, and before performing the last test.

Comment 13 Joshua Jensen 2005-01-19 15:55:55 UTC

So what is next here?  Why is this failing on a local switched subnet?

Comment 14 Dave Anderson 2005-01-19 15:59:31 UTC

Don't know.

Jeff, do you have any ideas here?

Comment 15 Jeff Moyer 2005-01-19 16:12:01 UTC

The tcpdump output does not include packet data, which would be useful in this
case.  I'm guessing, based on the syslog output, that the client is continuously
trying to handshake with the server, but never receiving the responses.  You can
verify this by telling me whether the client actually reboots (notice the server
repeatedly trying to reboot it).  I'm guessing it doesn't.

Please include the exact kernel version used.  "latest" is hardly useful when I
have to look at a billion bug reports.  Help a brother out.

Also, what network card are you using on the dumping system?

Comment 16 Joshua Jensen 2005-01-19 16:23:02 UTC

The client never reboots... but then again I don't have the IDLETIMEOUT set
either.  The kernel version used is: 2.4.21-27.0.1.ELsmp on both the client and
server.

On the client we are using this card:

eth0: Tigon3 [partno(NA) rev 1002 PHY(5703)] (PCIX:100MHz:64-bit)
10/100/1000BaseT Ethernet 00:0e:7f:20:66:e3

lspci show it as this:

02:02.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5703X Gigabit
Ethernet (rev 02)
        Subsystem: Compaq Computer Corporation NC7781 Gigabit Server Adapter
(PCI-X, 10,100,1000-T)
        Flags: bus master, 66Mhz, medium devsel, latency 64, IRQ 15
        Memory at f7de0000 (64-bit, non-prefetchable) [size=64K]
        Expansion ROM at <unassigned> [disabled] [size=64K]
        Capabilities: [40] PCI-X non-bridge device.
        Capabilities: [48] Power Management version 2
        Capabilities: [50] Vital Product Data
        Capabilities: [58] Message Signalled Interrupts: 64bit+ Queue=0/3 Enable

Comment 17 Jeff Moyer 2005-01-19 17:13:05 UTC

Ok, I don't see any regressions in that bit of code.

In the past, we have seen problems like this when the network is heavily loaded,
especially with UDP and/or broadcast traffic.  Can you test the netdump setup
with a cross-over cable to a server?  This would help narrow down the problem.

Thanks.

Comment 18 Joshua Jensen 2005-01-19 21:14:20 UTC

Bingo... with a crossover cable it works perfectly and completely (first time
I've ever seen it do so).  This can't be "the solution" though.  How can we make
netdump work on a real network?

Comment 19 Jeff Moyer 2005-01-19 22:28:53 UTC

Created attachment 109994 [details]
try to auto-negotiate 100mb fd so we are not inundated with packets

Ok, here is a really disgusting patch that may just work.  I think what is
happening is as follows:

When netdump starts, we disable interrupts and switch to polling mode for the
network card.  On faster networks, we may not be able to process packets as
quickly as they come in.  I don't believe this to be a limitation of the CPU
speed, but rather a limitation in the way the code is structured.  (this is
just a theory).

What this patch does is tries to re-negotiate the speed of the network card. 
It tries to set it at 100MB Full Duplex.

As such, you need to MAKE SURE that the switch port that the netdump client is
plugged into has auto negotiation enabled.

Please give this a try.

Comment 20 Jeff Moyer 2005-01-19 22:33:42 UTC

Created attachment 109995 [details]
same patch, but this one compiles

Sorry, fired too soon.	This patch should do the trick.

Comment 21 Joshua Jensen 2005-01-19 23:08:12 UTC

Can we get this in the form of a binary test kernel?  Or can we just
get a netdump.o to replace?

Comment 22 Jeff Moyer 2005-01-19 23:09:57 UTC

Sure.  I've got a build running now.  I'll post it in the morning.

Comment 23 Neil Horman 2005-01-20 14:35:14 UTC

Just to add this in if its helpful at all, I've seen cases where netdump will
time out because someone pulled the cable btw. a dumping host and the swtich. 
The timeout was triggered because STP was enabled on the switch and the switch
port when into blocking mode while the spanning tree re-negotiated.  While in
blocking mode, the frames to the dumping client were dropped.  If the
renegotiate patch is tested, you should probably disable stp on the port
connected to the host (if its not done already).  In fact, it may not hurt to
disable STP on that port anyway, just to see if it clears up the problem.

Comment 24 Brian Long 2005-01-20 15:13:15 UTC

We have portfast enabled on all switch ports, so spanning tree is not an issue.

Comment 25 Jeff Moyer 2005-01-20 15:57:22 UTC

Created attachment 110016 [details]
Kernel built with proposed netdump patch

Here is a debug kernel for you to test.

Comment 26 Joshua Jensen 2005-01-20 19:33:08 UTC

The new kernel doesn't even try to do the handshake... at least it doesn't say
it does.  I get the kernel OOPS in the "log" file, but no vmcore.

The last thing on the console of the client is:

CPU#0 is executing netdump.
CPU#1 is frozen.
 P CISS Driver (v 2.4.52.RH2)

Comment 27 Jeff Moyer 2005-01-20 20:43:38 UTC

Yeah, that was a longshot.  I'll see if I can reproduce the problem in-house.

Comment 28 Joshua Jensen 2005-01-28 18:01:49 UTC

Are you able too reproduce?  This is critical for Linux in our
environment.

Comment 29 Jeff Moyer 2005-01-31 13:00:06 UTC

I haven't had time to look into this further.  I'll update this bug
when I have any further status.

Comment 30 Joshua Jensen 2005-01-31 21:28:35 UTC

Here is a new datapoint.  From the Cisco switch that the client and server are
plugged into, we captured all traffic flowing between the two ports.

The tarball I'll attach contains the log file, the vmcore-incomplete file (which
sometimes we get, and sometimes we don't), and the  switch sniffer file
nam_rtp-lnx-16.enc (ethereal can decode).

Comment 31 Joshua Jensen 2005-01-31 21:30:10 UTC

Created attachment 110462 [details]
2nd netdump tarfile

Comment 32 Neil Horman 2005-02-01 12:38:31 UTC

Any thoughts as to why there are two of every frame in the trace?  That seems
rather strange to me.

Comment 33 Joshua Jensen 2005-02-01 15:07:26 UTC

Hmmm... it could be because it was a port-to-port trace... so if one side sent
it, the other side would see it too... hence the duplication.  In fact, this
would give us a way to see if a packet got dropped when going through the
switch, no?

Comment 34 Neil Horman 2005-02-01 16:16:07 UTC

Ok, that makes sense.  I'll see if I can find a lost duplicate packet.

Comment 35 ginnie nuckles 2005-02-08 18:04:15 UTC

please note that I am in the same situation. For a while I was 
getting the "Got to many time outs" messages. Now on the server Im 
testing on all I see the handshake attempt.  I tried a cross over 
cable but it did not work. Im not sure if I have the netdump conf 
file coded correctly. But I can ping on the cross over between the 
server and client.  This all worked before our upgrade from RHEL 3 U 
3 to U 4. Now its broken. 
Please verify my configuration,, I would like to see if the cross 
over cable fixs this. 

#
# LOCALPORT=6666
DEV=eth1    ( on client) 
NETDUMPADDR=192.168.1.41  ( on server  ) 
# NETDUMPPORT=6666
NETDUMPMACADDR=00:02:A5:48:39:91  ( on server ) 
# IDLETIMEOUT=
#
# If you want the console log (not crash dumps) sent via the
# syslog service, set SYSLOGADDR to the IP of the syslog server.
# The other two values normally remain unchanged.
SYSLOGADDR=192.168.1.41
SYSLOGPORT=514
SYSLOGMACADDR=00:02:A5:48:39:91

Comment 36 ginnie nuckles 2005-02-10 18:09:17 UTC

We recieved a crash on one of our systems today but no netdump. With 
the frequency and instability of the crashes we recieve on redhat 
kernels it is critical to have this working properly. Please change 
the severity and priority to HIGH for this call ! 

Feb 10 08:50:00 maxdev4
Feb 10 08:50:03 maxdev4  CPU#0 is frozen.
Feb 10 08:50:03 maxdev4  CPU#1 is frozen.
Feb 10 08:50:03 maxdev4  CPU#2 is frozen.
Feb 10 08:50:03 maxdev4  CPU#3 is executing netdump.
Feb 10 08:50:03 maxdev4  CPU#4 is frozen.
Feb 10 08:50:03 maxdev4  CPU#5 is frozen.
Feb 10 08:50:03 maxdev4  CPU#6 is frozen.
Feb 10 08:50:03 maxdev4  CPU#7 is frozen.
Feb 10 08:50:03 maxdev4  < netdump activated - performing handshake 
with the client. >
Feb 10 08:50:13 pacexp1 automount[4124]: attempting to mount 
entry /home/sanjit
Feb 10 08:50:13 pacexp1 sshd(pam_unix)[29936]: session opened for 
user sanjit by (uid=1003)
Feb 10 08:53:59 pacexp1 sshd(pam_unix)[30086]: session opened for 
user root by (uid=0)
Feb 10 08:57:19 pacexp1 sshd(pam_unix)[29936]: session closed for 
user sanjit
Feb 10 09:00:35 adminp2 dhcpd: DHCPREQUEST for 172.24.111.58 
(172.24.111.251) from 00:0e:7f:ff:ea:56 via eth1
Feb 10 09:00:35 adminp2 dhcpd: DHCPACK on 172.24.111.58 to 
00:0e:7f:ff:ea:56 via eth1
Feb 10 09:01:55 pacexp1 sshd(pam_unix)[30086]: session closed for 
user root
Feb 10 09:03:02 pacexp1 automount[30312]: expired /home/sanjit
Feb 10 09:07:41 adminp2 dhcpd: DHCPREQUEST for 172.24.111.50 
(172.24.111.251) from 00:0b:cd:cc:ba:32 via eth1
Feb 10 09:07:41 adminp2 dhcpd: DHCPACK on 172.24.111.50 to 
00:0b:cd:cc:ba:32 via eth1
Feb 10 09:11:35 msp1 sshd(pam_unix)[17753]: session opened for user 
tmiller by (uid=1058)
Feb 10 09:12:21 adminp2 sshd(pam_unix)[15195]: session opened for 
user root by (uid=0)
Feb 10 09:12:48 adminp2 sshd(pam_unix)[15195]: session closed for 
user root
Feb 10 09:14:08 adminp2 sshd(pam_unix)[15263]: session opened for 
user netdump by (uid=34)
Feb 10 09:14:08 adminp2 sshd(pam_unix)[15263]: session closed for 
user netdump
Feb 10 09:14:09 maxdev4  [...network console startup...]
Feb 10 09:14:21 adminp2 sshd(pam_unix)[15265]: session opened for 
user root by (uid=0)
Feb 10 09:17:43 fhhp1 sshd(pam_unix)[24821]: session closed for user 
pcxoper
Feb 10 09:18:02 msp3 sshd(pam_unix)[30278]: session opened for user 
tmiller by (uid=1058)

Comment 38 Howard Owen 2005-02-25 00:20:43 UTC

Those duplicates are probably an artifact of the spanned port on the catalyst
used to tap the traffic for the sniffer.

Comment 40 Jeff Moyer 2005-03-02 21:00:40 UTC

Here is one more thing to try.  Manually set the card to 100 Mbit on the dumping
system.  I believe the supported way of doing this is via ethtool.  Then, cause
a crashdump and let me know if it finishes.

Thanks.

Comment 41 Joshua Jensen 2005-03-07 14:58:32 UTC

I have found the problem.  The netdump-server is running on a machine with an
eth card that has a few IP address, all but one of which are assigned via
interface aliases.  All these IP addresses are on the same logical subnet
(vlan). The netdump clients all point to an IP that is on one of the IPs that is
associated with and interface alias.  I found through network sniffing on the
server side that the netdump client sends its request to IP A, and while the
server *does* receive the request and answer back, it answers from the main
non-aliased IP B.

Talking to A and hearing back from B apparently doesn't fly with the netdumping
client.  When I change the client to point at IP B... it works just fine.

From my understanding of it, the netdump-server would in fact answer back to th
e client from an aliased IP A if it was bound specifically to that IP.  Is there
a way to get netdump-server, to bind to a specific IP address?  This is an
option for most other network services... but I don't see any mention of that in
the netdump-server man page or init script.

Comment 42 Jeff Moyer 2005-03-08 13:57:26 UTC

Wow, that's the problem?  Good work!  Consider this an RFE (request for
enhancement).  I'll work to get the configuration option you describe into the
next release.

Thanks!

Comment 43 Brent Fox 2005-04-05 18:56:19 UTC

Jeff, any idea if this problem would also apply to machines with bonded
interfaces?

Comment 45 Jeff Moyer 2005-04-05 19:01:45 UTC

Netdump doesn't support the bonding driver.  It is a separate issue.

Comment 46 Jeff Moyer 2005-04-11 21:23:24 UTC

I think the next step here is to change the netdump server to send batches of
requests when it misses some pre-defined number of responses.  If we flood the
network with requests, we are more likely to get them through a saturated network.

Comment 47 Jeff Moyer 2005-04-14 21:07:16 UTC

I've started in on this, but to no avail.  It's clear that the messages coming
from the panicked system make it to the netdump-server.  The responses from the
server never get back to the panicked system, though.  Whether this is due to
the packets not reaching the server, or the netdump server not being able to
process all incoming packets remains up in the air (though I suspect the
latter).  I'll plug in a hub, next, and get some tcpdump output.

Note that I removed the udelay(100) in the netdump handshake polling code, to
try to process more packets, and this did not help the situation.

Comment 48 Jeff Moyer 2005-04-19 21:38:05 UTC

Created attachment 113379 [details]
Reset dev->quota to dev->weight in tg3_poll_controller

Please try this patch.	It fixes a bug whereby if the controller exhausts its
current NAPI budget, it will never be reset.  This results in the controller
never delivering any more packets to the netdump subsystem.

Comment 50 Joshua Jensen 2005-04-19 22:25:54 UTC

Jeff, who are you waiting on feedback from?  As far as I am concerned my problem
has been %95 percent solved by finding that the netdump server can't be on an
aliased network interface.  Is someone else on this bug going to test your patch?

Also on a side note, one of our problems was also that on HP servers their hpasm
software + ASR hardware was causing problems.  The software would check in with
the hardware... when it stops doing so the ASR reboots the machine.  This isn't
a bad idea in general (if the software is locked up then perhaps you want the
machine to reboot), but it doesn't mix well with netdump.  Disabling the ASR
timeout in bios does the trick... increasing the timeout value to the max of 30
minutes might also work for fast netdumps with small ammounts of memory.

Comment 51 Jeff Moyer 2005-04-19 23:45:44 UTC

Joshua,

It appears that this one bugzilla is tracking multiple bugs.  In your case, you
say that you are 95% solved.  What is the remaining 5%?  I think the main point
of confusion here is the Summary line of this bug.  =)

So, do me a favor and update the summary to more accurately describe your
situation.  I'll then clone the bug for the problem I described in comment #48.  

ginnie nuckles,

I think the regression you are experiencing is that netdump will no longer work
when the interface is not the first (i.e. eth0 works, everything else doesn't).
 Please take a look at bug #150374 and see if that describes your situation.  If
so, we'll move you onto the CC list there.

Thanks for bearing with me, here.

Comment 52 Joshua Jensen 2005-04-20 17:50:21 UTC

The remaining 5% of the problem is that sometimes netdump just freezes up. 
Changing summary now so that you can duplicate this bug to address the client
lockups

Comment 55 Jeff Moyer 2005-04-22 01:39:25 UTC

Created attachment 113503 [details]
Patch to fix up all NAPI enabled netdump drivers.

This patch is more comprehensive, fixing the bug in all of the netdump NAPI
enabled drivers.

Comment 56 Jeff Moyer 2005-04-22 15:31:58 UTC

The above patch (id=113503) was submitted for internal review.

Comment 57 Ernie Petrides 2005-04-23 00:44:34 UTC

A fix for this problem has just been committed to the RHEL3 U6
patch pool this evening (in kernel version 2.4.21-32.2.EL).

Comment 58 Joshua Jensen 2005-04-24 00:02:16 UTC

Very nice!  Thanks Red Hat!

Comment 64 Red Hat Bugzilla 2005-09-28 14:42:31 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2005-663.html

Comment 65 Joshua Jensen 2005-10-21 15:33:24 UTC

Reading the man page to netdump-server, I see no indication that netdump-server
can bind to a specific IP or interface with netdump-server-0.7.7-2, which came
out in the RHEL3 U6 update.

Comment 66 Jeff Moyer 2005-10-21 15:42:40 UTC

Joshua,

Sorry, the bug fixed here was not the one you were seeing.  I got confused.  So,
I'm creating a new bug, and I'll add you to the CC list.  So, I'm closing this
bug again.

Comment 67 Jeff Moyer 2005-10-21 15:51:18 UTC

The new bug ID is 171405.  Any interested parties should add themselves to the
CC list for that bug.

Note You need to log in before you can comment on or make changes to this bug.