Description of problem: Using 'rcp' to copy about 3TB of data in 20 large files from one server to another. Files reside on dedicated 'ext4' 4-way LVM striped logical volumes. Takes 6.5 hours to complete at 95% gigabit Ethernet utilization, but totally locked/hung RHEL 5.3 twice within the first hour. No kernel messages in 'syslogd' to help understand the problem. No response from the console, though network interfaces were all pingable. Had to reset the server to recover. Worked around the problem by booting up the F9+kernel.org partition on the same DL160 server and copying the files. The receiving server is a Tyan S2912 also running RHEL 5.3, but with the more established 82571/e1000e network interfaces instead of 82575/igb NIC. Version-Release number of selected component (if applicable): kernel 2.6.18-128.1.1.el5 igb 1.3.8.6 compiled from source (Would use native 'igb.ko', but on the DL160 the RH version does not work at all with 82575. Same version of 'igb' works under F9 with kernel.org 2.6.27.7 kernel.) How reproducible: 'rcp' huge files from one server to another Steps to Reproduce: 1. rcp /???/file remote:/???/file 2. 3. Actual results: hangs Linux Expected results: file copy completes Additional info: I'm not sure I expect anyone to care about fixing this due to our use of the upstream 'igb' driver. However it's a serious failure of the "production" version of RH so it seems worth reporting. Interesting that a bleeding-edge kernel has no problem with the network file copy.
This should be fixed by now. Please reopen if you still see this problem with the latest RHEL5 kernel.