Bug 496248
Summary: | When the network is initialized by e1000e driver, I lose connection to the IPMI card | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Luis <luis.figueiredo> |
Component: | kernel | Assignee: | Peter Martuccelli <peterm> |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Red Hat Kernel QE team <kernel-qe> |
Severity: | medium | Docs Contact: | |
Priority: | low | ||
Version: | 5.4 | CC: | abuse, dzickus, jarod, jburke, jsafrane, pasteur, ralph, ryan.dooley |
Target Milestone: | rc | Keywords: | Regression |
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
URL: | http://bugs.centos.org/view.php?id=3477 | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2009-12-23 16:04:55 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Luis
2009-04-17 14:27:36 UTC
If I understand it correctly, this is kernel problem. The report is corrrect. I see the same problem with Supermicro server PDSMI+ motherboard and AOC-IPMI20-E IPMI card. I already had the IPMI card returned and replaced in vain... When rebooting an older kernel (2.6.18-92.1.18.el5xen), the IPMI starts working again. See commit eb7c3adb1ca92450870dbb0d347fc986cd5e2af4 that is included in kernel patch-2.6.28-rc5-git2. This should urgently be back ported to 2.6.18 and noted in the errata. What is the current status? Will the fix for this issue be included in the next kernel for RHEL? I continue with the same problem and with the last kernel. uname -a 2.6.18-128.1.10.el5PAE Kernel 2.6.18-128.1.14.el5xen still seems to contain the broken driver version 0.3.3.3-k4. So I guess nothing has changed to it. Work-around in http://bugs.centos.org/view.php?id=3477 I'm actually seeing this behavior with with 2.6.18-164.el5.x86_64, the bnx2 driver that comes with it (1.9.3 I believe), and the Dell R710 platform. The system comes up just fine with SOL over IPMI and as soon as the bnx2 driver takes over I lose IPMI. If I ssh to the machine and /sbin/reboot, as soon as the driver is unloaded, IPMI comes back. I've downloaded and installed Broadcom's lastest netxtreme2 driver (1.9.20b5). I've flashed the BIOS to the latest version (1.2.6) as well as the iDRAC firmware (to 1.20.1). None of it has helped much. (In reply to comment #7) > I'm actually seeing this behavior with with 2.6.18-164.el5.x86_64, the bnx2 > driver that comes with it (1.9.3 I believe), and the Dell R710 platform. This bug is specific to hardware driven by the e1000e driver, you've got a similar-but-different problem, which should be filed under another bug. Actually this turned out to be Ganglia+Multicast for me. The Dell iDRAC is running some OS (Linux?) that was attempting to process the multicast traffic. Turn off Ganglia and the shared bnx2 connection does the right thing. Well, it looks like things have improved. Kernel 2.6.18-164.9.1.el5xen seems to contain a version of the e1000e kernel driver that supports the CrcStripping option. So on SuperMicro boards, one needs to add a file in the /etc/modprobe.d directory containing the line options e1000e CrcStripping=0 Can this added to the errata? The CrcStripping option was actually added in the 5.4 kernels, so this bug is already fixed, as far as I can see. We don't typically update an errata after it has already been released. Could be a candidate for adding a knowledgebase article for, but I don't know offhand how to make that happen... |