Bug 494381 - RHEL 5.3 hangs during huge 'rcp' file copy at 95% GbE rate, 82575/igb NICs
RHEL 5.3 hangs during huge 'rcp' file copy at 95% GbE rate, 82575/igb NICs
Status: CLOSED CURRENTRELEASE
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel (Show other bugs)
5.3
All Linux
low Severity medium
: rc
: ---
Assigned To: Red Hat Kernel Manager
Red Hat Kernel QE team
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2009-04-06 12:47 EDT by starlight
Modified: 2011-03-04 05:06 EST (History)
4 users (show)

See Also:
Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2011-03-04 05:06:24 EST
Type: ---
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)

  None (edit)
Description starlight 2009-04-06 12:47:24 EDT
Description of problem:

Using 'rcp' to copy about 3TB of data in 20 large files from one 
server to another.  Files reside on dedicated 'ext4' 4-way 
LVM striped logical volumes.  Takes 6.5 hours to complete at 95% 
gigabit Ethernet utilization, but totally locked/hung RHEL 5.3
twice within the first hour.  No kernel messages in 'syslogd' to 
help understand the problem.  No response from the console, 
though network interfaces were all pingable.  Had to reset
the server to recover.

Worked around the problem by booting up the F9+kernel.org
partition on the same DL160 server and copying the files.

The receiving server is a Tyan S2912 also running RHEL
5.3, but with the more established 82571/e1000e network
interfaces instead of 82575/igb NIC.

Version-Release number of selected component (if applicable):

kernel 2.6.18-128.1.1.el5

igb 1.3.8.6 compiled from source

(Would use native 'igb.ko', but on the DL160 the RH version does 
not work at all with 82575.  Same version of 'igb' works under 
F9 with kernel.org 2.6.27.7 kernel.)

How reproducible:

'rcp' huge files from one server to another

Steps to Reproduce:
1. rcp /???/file remote:/???/file
2.
3.
  
Actual results:

hangs Linux

Expected results:

file copy completes

Additional info:

I'm not sure I expect anyone to care about fixing this due 
to our use of the upstream 'igb' driver.  However it's a serious 
failure of the "production" version of RH so it seems worth 
reporting.  Interesting that a bleeding-edge kernel has no 
problem with the network file copy.
Comment 1 Stefan Assmann 2011-03-04 05:06:24 EST
This should be fixed by now. Please reopen if you still see this problem with the latest RHEL5 kernel.

Note You need to log in before you can comment on or make changes to this bug.