Bug 494381 - RHEL 5.3 hangs during huge 'rcp' file copy at 95% GbE rate, 82575/igb NICs
Summary: RHEL 5.3 hangs during huge 'rcp' file copy at 95% GbE rate, 82575/igb NICs
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.3
Hardware: All
OS: Linux
low
medium
Target Milestone: rc
: ---
Assignee: Red Hat Kernel Manager
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2009-04-06 16:47 UTC by starlight
Modified: 2011-03-04 10:06 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2011-03-04 10:06:24 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description starlight 2009-04-06 16:47:24 UTC
Description of problem:

Using 'rcp' to copy about 3TB of data in 20 large files from one 
server to another.  Files reside on dedicated 'ext4' 4-way 
LVM striped logical volumes.  Takes 6.5 hours to complete at 95% 
gigabit Ethernet utilization, but totally locked/hung RHEL 5.3
twice within the first hour.  No kernel messages in 'syslogd' to 
help understand the problem.  No response from the console, 
though network interfaces were all pingable.  Had to reset
the server to recover.

Worked around the problem by booting up the F9+kernel.org
partition on the same DL160 server and copying the files.

The receiving server is a Tyan S2912 also running RHEL
5.3, but with the more established 82571/e1000e network
interfaces instead of 82575/igb NIC.

Version-Release number of selected component (if applicable):

kernel 2.6.18-128.1.1.el5

igb 1.3.8.6 compiled from source

(Would use native 'igb.ko', but on the DL160 the RH version does 
not work at all with 82575.  Same version of 'igb' works under 
F9 with kernel.org 2.6.27.7 kernel.)

How reproducible:

'rcp' huge files from one server to another

Steps to Reproduce:
1. rcp /???/file remote:/???/file
2.
3.
  
Actual results:

hangs Linux

Expected results:

file copy completes

Additional info:

I'm not sure I expect anyone to care about fixing this due 
to our use of the upstream 'igb' driver.  However it's a serious 
failure of the "production" version of RH so it seems worth 
reporting.  Interesting that a bleeding-edge kernel has no 
problem with the network file copy.

Comment 1 Stefan Assmann 2011-03-04 10:06:24 UTC
This should be fixed by now. Please reopen if you still see this problem with the latest RHEL5 kernel.


Note You need to log in before you can comment on or make changes to this bug.