Bug 98462 - bonding TLB load sharing fails under heavy UDP Tx stress
Summary: bonding TLB load sharing fails under heavy UDP Tx stress
Keywords:
Status: CLOSED RAWHIDE
Alias: None
Product: Red Hat Enterprise Linux 3
Classification: Red Hat
Component: kernel
Version: 3.0
Hardware: i686
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Jeff Garzik
QA Contact: Brian Brock
URL: http://sourceforge.net/projects/bonding/
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2003-07-02 17:46 UTC by Need Real Name
Modified: 2013-07-03 02:12 UTC (History)
3 users (show)

Fixed In Version: 2.4.21-1.1931.2.349.2.2.ent
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2003-08-03 13:54:50 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Need Real Name 2003-07-02 17:46:34 UTC
Description of problem:
When running very heavy UDP Tx stress traffic with 10/100 adapters, load-
sharing collapses to only one slave after a few seconds. Caused by a 
unsigned/signed cast error in the TLB code.

Version-Release number of selected component (if applicable):
kernel-2.4.20-1.1931.2.231.2.11.ent

How reproducible:
Configure a bond team with only 10/100 adapters and run very heavy UDP Tx 
stress traffic to many clients. Monitor Tx/Rx activity of the slaves.

Steps to Reproduce:
1. insmod bonding mode=5
2. ifconfig bond0 <ip-addr>
3. ifenslave bond0 eth0 eth1 eth2
4. start stress application (e.g iperf, netperf, etc.)
    
Actual results:
After a few seconds only one slave takes part of load sharing while others stay 
idle. Traffic may pass from slave to slave at 10 sec. intervals (re-balance 
timeout).

Expected results:
All slaves continuously take part of the load sharing.

Additional info:
A bug fix patch was sent by me on June 26th to bond-devel, linux-net and linux-
netdev lists. It was already accepted by Jeff Garzik into his net-drivers-2.4 
BK tree.

Comment 1 Larry Troan 2003-07-16 13:51:59 UTC
ISSUE TRACKER 25886 opened as sev 1

Comment 2 Rik van Riel 2003-07-16 13:57:07 UTC
Jeff, does Taroon already have the patch for this or is it still in your queue ?

Comment 3 Need Real Name 2003-07-23 17:11:08 UTC
Appears to be fix implemented in RHEL 3 B1 candidate kernel (version 2.4.21-
1.1931.2.349.2.2.ent).



Note You need to log in before you can comment on or make changes to this bug.