Bug 580836 - EDAC driver error on system with bad memory [rhel-5.5.z]
Summary: EDAC driver error on system with bad memory [rhel-5.5.z]
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: kernel
Version: 5.6
Hardware: All
OS: Linux
urgent
high
Target Milestone: rc
: ---
Assignee: Jiri Pirko
QA Contact: Red Hat Kernel QE team
URL:
Whiteboard:
Depends On: 569938
Blocks: 590691
TreeView+ depends on / blocked
 
Reported: 2010-04-09 08:06 UTC by RHEL Program Management
Modified: 2018-11-30 20:37 UTC (History)
16 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 590691 (view as bug list)
Environment:
Last Closed: 2010-05-06 18:49:57 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHSA-2010:0398 0 normal SHIPPED_LIVE Important: kernel security and bug fix update 2010-05-06 18:49:33 UTC

Description RHEL Program Management 2010-04-09 08:06:07 UTC
This bug has been copied from bug #569938 and has been proposed
to be backported to 5.5 z-stream (EUS).

Comment 4 Jiri Pirko 2010-04-13 20:55:28 UTC
in 2.6.18-194.1.1.el5

Comment 9 errata-xmlrpc 2010-05-06 18:49:57 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2010-0398.html

Comment 10 Dave Love 2010-06-24 10:25:39 UTC
I can't see the original bug report, but the problem reported in the erratum
appears not to be fixed in the 2.6.18-194.3.1.el5 kernel, like this (one of
many reports on multiple Barcelona nodes in our cluster):

Jun 24 00:32:39 lvgig025 kernel: Northbridge Error, node 1, core: -1
Jun 24 00:32:39 lvgig025 kernel: K8 ECC error.
Jun 24 00:32:39 lvgig025 kernel: EDAC amd64 MC1: CE ERROR_ADDRESS= 0x375a77cc0
Jun 24 00:32:39 lvgig025 kernel: EDAC MC1: CE page 0x375a77, offset 0xcc0, grain 0, syndrome 0x4951, row 7, channel 0, label "": amd64_edac
Jun 24 00:32:39 lvgig025 kernel: EDAC MC1: CE - no information available: amd64_edacError Overflow

Comment 13 Paul Lowrie 2014-04-16 20:36:04 UTC
The problem is not fixed with 2.6.18-274.7.1.el5 either.
Linux racdbmc1ldv 2.6.18-274.7.1.el5 #1 SMP Mon Oct 17 11:57:14 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux


Apr 15 16:05:54 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x404b9f3a0
Apr 15 16:05:54 racdbmc1ldv kernel: EDAC MC0: CE page 0x404b9f, offset 0x3a0, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:05:58 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x28ea0dd40
Apr 15 16:05:58 racdbmc1ldv kernel: EDAC MC0: CE page 0x28ea0d, offset 0xd40, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:06:04 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x40382bfa0
Apr 15 16:06:04 racdbmc1ldv kernel: EDAC MC0: CE page 0x40382b, offset 0xfa0, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:06:18 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x212788140
Apr 15 16:06:18 racdbmc1ldv kernel: EDAC MC0: CE page 0x212788, offset 0x140, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:06:34 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x2d222c6e0
Apr 15 16:06:34 racdbmc1ldv kernel: EDAC MC0: CE page 0x2d222c, offset 0x6e0, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:07:34 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x3d491c8f0
Apr 15 16:07:34 racdbmc1ldv kernel: EDAC MC0: CE page 0x3d491c, offset 0x8f0, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac
Apr 15 16:07:46 racdbmc1ldv kernel: EDAC amd64 MC0: CE ERROR_ADDRESS= 0x330d2d000
Apr 15 16:07:46 racdbmc1ldv kernel: EDAC MC0: CE page 0x330d2d, offset 0x0, grain 0, syndrome 0x2b8, row 4, channel 0, label "": amd64_edac

Comment 14 Jan Gerrit Kootstra 2014-12-02 16:06:10 UTC
kernel 2.6.18-398.el5 on RHEL 5.11 shows same kind of messages.


Note You need to log in before you can comment on or make changes to this bug.