From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:126.96.36.199) Gecko/20070206 (CK-IBM) Firefox/188.8.131.52
Description of problem:
E1000 RHEL 5 Network Install Issue. Network build on a machine with an e1000 network card may take 12 hours to complete. Other machines complete in aprox 1 hour.
Problem: Network install times for machines with the Intel e1000 network driver extremely slow. Issue seemed to be specifically the time it takes to reconnect to the server to download each rpm required for the install. As the build progresses the time it takes to make a connection increases. We are certain the network card is the problem, we are able to install an external (PCMCIA) network card in the same machine and the install completes without issue.
We performed a packet capture on this machine We compared the results of a normal running machine to a e1000 nic machine. The following is the major difference.
FTP - Response: 22
Time delta from previous packet: 0.003043000 seconds
UDP Source port: 2967 Destination port: 2967
Time delta from previous packet: 0.226744000 seconds
[TCP Retransmission] Responce: 22
Time delta from previous packet: 0.696182000 seconds
The above happens with almost every cycle (FTP Connect, Download RPM, Disconnect) on the slow build. On a normal build it almost never happens. " Time delta from previous packet"
from the above certainly shows exactly why the build is much slower. Here are some other time deltas pulled at random out of the capture file of "normal" traffic.
Time delta from previous packet: 0.001271000 seconds
Time delta from previous packet: 0.000113000 seconds
Time delta from previous packet: 0.000081000 seconds
Time delta from previous packet: 0.000513000 seconds
Workarounds Tried (None Fixed the problem)
1. Different amounts of physical memory... 512mb 1GB 2GB
2. Different ftp clients and versions vsftpd and proftpd both slower than other machines (vsftpd seems faster than proftpd but still much slower than other machines)
3. Different protocols http vs ftp
4. Pulled the install server out behind firewall
5. Different cables and dedicates switch (only server and client on switch)
6. Updated machine bios
7. RHEL 4 vs RHEL 5 kernal
8. Verified connection speed was 100 Full Duplex
IBM Machines the Problem have be reproduced on all with the Intel e1000 network card
1. T60 Thinkpad
2. T60p Thinkpad
3. x60s Thinkpad
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1.Boot RHEL 5 install boot CD
2.set up a network install - either FTP or HTTP
3.Install will take many hours on T60/T60p/X60s Thinkpads that have e1000
hmmm this looks like a rhel5 kernel issue. HOwever the version field has 4.4
selected. Does this also apply to rhel4? just trying to get this properly
Sorry - There was no way for me to tag this against the RHEL5 kernel. It
definitely affects RHEL5 boot media installs.
The various teams have traced this back to the Intel 82573L NIC (device id
8086:109a) and that chipsets interaction with Anaconda.
There is a suggested partial workaround that modifies the RxIntDelay value by
following additional boot options (and modprobe options once the
system is installed):
modprobe e1000 RxIntDelay=8
See the following for additional info:
This problem has been resolved by a EEPROM fix for the Intel 82573L NIC (device id
8086:109a) that modifies the Active State Power Management (ASPM) behavior.
Lenovo / Intel provided the EEPROM fix.
We have tested this fix across a wide vary of ThinkPad T60/T60p/X60 that include
the Intel 82573L NIC (device id 8086:109a). The problem has been solved.
The bugzilla ticket can be closed.