Bug 746486 - e1000e - AER: Multiple Corrected error received
Summary: e1000e - AER: Multiple Corrected error received
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Red Hat Enterprise MRG
Classification: Red Hat
Component: realtime-kernel
Version: 2.1
Hardware: x86_64
OS: Linux
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: Red Hat Real Time Maintenance
QA Contact: David Sommerseth
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-10-16 14:39 UTC by evcz
Modified: 2016-05-22 23:33 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2014-09-25 19:57:23 UTC
Target Upstream Version:


Attachments (Terms of Use)
lspci -vv (61.20 KB, text/plain)
2011-10-16 14:39 UTC, evcz
no flags Details

Description evcz 2011-10-16 14:39:50 UTC
Created attachment 528394 [details]
lspci -vv

Description of problem:


Version-Release number of selected component (if applicable):
kernel-rt-2.6.33.9-rt31.75

How reproducible:
constantly happening when using MRG kernel.
Working fine with non-MRG kernels


Steps to Reproduce:
1. install mrg kernel
2. reboot
  
Actual results:
Everything appears to be working ok but /var/log/messages is filled with these errors:

Oct 15 13:54:14 pink kernel: pcieport 0000:00:01.0: AER: Multiple Corrected error received: id=0000
Oct 15 13:54:14 pink kernel: e1000e 0000:01:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, id=0100(Receiver ID)
Oct 15 13:54:14 pink kernel: e1000e 0000:01:00.0:   device [8086:10d3] error status/mask=00000041/00002000
Oct 15 13:54:14 pink kernel: e1000e 0000:01:00.0:    [ 0] Receiver Error         (First)
Oct 15 13:54:14 pink kernel: e1000e 0000:01:00.0:    [ 6] Bad TLP

This kind of errors is being written constantly into the logs while making traffic on eth0 interface

Expected results:
No errors in log

Additional info:
Motherboard: Supermicro X8STi
lspci attached

Comment 1 evcz 2011-10-18 10:51:38 UTC
upgrading to e1000e-1.6.2 didn't fixed the problem as the error log is still getting spammed


Oct 18 12:45:55 pink kernel: udev: starting version 147
Oct 18 12:45:55 pink kernel: e1000e: Intel(R) PRO/1000 Network Driver - 1.6.2-NAPI
Oct 18 12:45:55 pink kernel: e1000e: Copyright(c) 1999 - 2011 Intel Corporation.
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: Disabling ASPM L0s
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: eth0: (PCI Express:2.5GT/s:Width x1) 00:30:48:fb:b8:a6
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: eth0: Intel(R) PRO/1000 Network Connection
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: eth0: MAC: 4, PHY: 8, PBA No: 0101FF-0FF
Oct 18 12:45:55 pink kernel: e1000e 0000:02:00.0: Disabling ASPM L0s
Oct 18 12:45:55 pink kernel: e1000e 0000:02:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 18 12:45:55 pink kernel: e1000e 0000:02:00.0: eth1: (PCI Express:2.5GT/s:Width x1) 00:30:48:fb:b8:a7
Oct 18 12:45:55 pink kernel: e1000e 0000:02:00.0: eth1: Intel(R) PRO/1000 Network Connection
Oct 18 12:45:55 pink kernel: e1000e 0000:02:00.0: eth1: MAC: 4, PHY: 8, PBA No: 0101FF-0FF
Oct 18 12:45:55 pink kernel: pcieport 0000:00:01.0: AER: Multiple Corrected error received: id=0000
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, id=0100(Receiver ID)
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:   device [8086:10d3] error status/mask=00002041/00002000
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:    [ 0] Receiver Error         (First)
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:    [ 6] Bad TLP
Oct 18 12:45:55 pink kernel: EDAC MC: Ver: 2.1.0 Sep 13 2011
Oct 18 12:45:55 pink kernel: PCI: Discovered peer bus ff
Oct 18 12:45:55 pink kernel: EDAC MC0: Giving out device to 'i7core_edac.c' 'i7 core #0': DEV 0000:ff:03.0
Oct 18 12:45:55 pink kernel: EDAC PCI0: Giving out device to module 'i7core_edac' controller 'EDAC PCI controller': DEV '0000:ff:03.0' (POLLED)
Oct 18 12:45:55 pink kernel: EDAC i7core: Driver loaded.
Oct 18 12:45:55 pink kernel: dca service started, version 1.12.1
Oct 18 12:45:55 pink kernel: ioatdma: Intel(R) QuickData Technology Driver 4.00
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.1: PCI INT B -> GSI 17 (level, low) -> IRQ 17
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.2: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.3: PCI INT D -> GSI 19 (level, low) -> IRQ 19
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.4: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.5: PCI INT B -> GSI 17 (level, low) -> IRQ 17
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.6: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 18 12:45:55 pink kernel: ioatdma 0000:00:16.7: PCI INT D -> GSI 19 (level, low) -> IRQ 19
Oct 18 12:45:55 pink kernel: iTCO_vendor_support: vendor-support=0
Oct 18 12:45:55 pink kernel: iTCO_wdt: Intel TCO WatchDog Timer Driver v1.05
Oct 18 12:45:55 pink kernel: iTCO_wdt: Found a ICH10R TCO device (Version=2, TCOBASE=0x0860)
Oct 18 12:45:55 pink kernel: iTCO_wdt: initialized. heartbeat=30 sec (nowayout=0)
Oct 18 12:45:55 pink kernel: i801_smbus 0000:00:1f.3: PCI INT C -> GSI 18 (level, low) -> IRQ 18
Oct 18 12:45:55 pink kernel: sd 0:0:0:0: Attached scsi generic sg0 type 0
Oct 18 12:45:55 pink kernel: input: PC Speaker as /devices/platform/pcspkr/input/input3
Oct 18 12:45:55 pink kernel: EXT4-fs (sda1): mounted filesystem with ordered data mode
Oct 18 12:45:55 pink kernel: Adding 16932856k swap on /dev/sda2.  Priority:-1 extents:1 across:16932856k
Oct 18 12:45:55 pink kernel: NET: Registered protocol family 10
Oct 18 12:45:55 pink kernel: lo: Disabled Privacy Extensions
Oct 18 12:45:55 pink kernel: ip6_tables: (C) 2000-2006 Netfilter Core Team
Oct 18 12:45:55 pink kernel: ip_tables: (C) 2000-2006 Netfilter Core Team
Oct 18 12:45:55 pink kernel: ADDRCONF(NETDEV_UP): eth0: link is not ready
Oct 18 12:45:55 pink kernel: e1000e: eth0 NIC Link is Up 100 Mbps Full Duplex, Flow Control: None
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: eth0: 10/100 speed: disabling TSO
Oct 18 12:45:55 pink kernel: ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Oct 18 12:45:55 pink kernel: pcieport 0000:00:01.0: AER: Multiple Corrected error received: id=0000
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, id=0100(Receiver ID)
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:   device [8086:10d3] error status/mask=00000041/00002000
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:    [ 0] Receiver Error         (First)
Oct 18 12:45:55 pink kernel: e1000e 0000:01:00.0:    [ 6] Bad TLP
Oct 18 12:45:55 pink kdump: No crashkernel parameter specified for running kernel

Comment 2 Vasily Averin 2011-11-11 07:02:28 UTC
use "pcie_aspm=off" in kernel commandline to workaround this issue.

Comment 3 evcz 2011-11-11 14:02:53 UTC
Thanks!
that fixed it.

At this point I suppose it was something in the bios settings as I got multiple other boxes with the same motherboard and none of them were showing this issue.

Thank you again :)


Note You need to log in before you can comment on or make changes to this bug.