Bug 166786
Summary: | Dell onboard e1000 stops receiving packets from some hosts after 30 minutes | ||||||
---|---|---|---|---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Eric Z. Ayers <ericzundel> | ||||
Component: | kernel | Assignee: | John W. Linville <linville> | ||||
Status: | CLOSED CANTFIX | QA Contact: | Brian Brock <bbrock> | ||||
Severity: | medium | Docs Contact: | |||||
Priority: | medium | ||||||
Version: | 4 | CC: | davej, wtogami | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | i386 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2005-09-15 15:01:54 UTC | Type: | --- | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Eric Z. Ayers
2005-08-25 18:20:35 UTC
The IT guy commented that there is a firewall in between 158.155.2.3 and 158.155.4.1 (the default route) which is common to the systems we are having troubles with. Still, it doesn't explain why the problem is not reproducable when we switch to use the 3c905 NIC. It is difficult to know where to start...please attach the output of running "sysreport"...thanks! Created attachment 118257 [details]
Output of running 'sysreport'
FYI, I did just update the kernel to 2.6.12-1.1447_FC4smp - same problem. The network went out while I was running 'sysreport' above. Perhaps there is an auto-negotiation problem? I have occasionally seen or heard of problems like this that go away when a fixed port configuration is used. Could you force the link speed to 1000/Full (or whatever is appropriate) at the switch? For good measure, you should also set ETHTOOL_OPTS in /etc/sysconfig/network-scripts/ifcfg-ethX: ETHTOOL_OPTS="speed 1000 duplex full autoneg off" Modify that as appropriate if not using 1000/full, of course. Could you give that a try and report the results...thanks! The machine goes live in about 1 week. Folks are getting their feet wet now, the new hardware replaces one of our mainstay machines running RH Linux 7.3. I'm waiting for a chance to reboot the machine and re-enable the onboard controller. I won't have much of an opportunity to do these kinds of tests after the server goes live. We've set the port to full duplex, 100Mbit, replaced a cable was questionable (we jiggled it and the switch port re-negotiated), and added the line ETHTOOL_OPTS to the network interface script. No joy after that change. I rebooted this morning after nailing the port to 100MBit full duplex and adding the ETHTOOL_OPTS line: $ uptime 08:57:43 up 31 min, 2 users, load average: 0.00, 0.02, 0.06 The problem is exhibiting itself again already. Thanks for trying to help me resolve this problem. We have a workaround (installing a second NIC) and tonight we are taking the server 'live'. After 7pm EDT or so, I won't be able to screw around with the onboard NIC without disrupting business. If there is something else you can think of to try today, let me know. Moving this to CANTFIX due to need for continued testing that the reporter will be unable to conduct. Please reopen if this situation changes. |