Version-Release number of selected component: kernel-2.6.18-53.el5 Drivers or hardware or architecture dependency: ia64 only Description of Problem: The CMCI (Corrected Machine Check Interrupt) is one of mechanism consisting Machine Check Architecture ia64 have, and that is an interrupt which the processor issues when the processor detects trivial errors in itself (ex. cache error) and corrects them successfully. Thus CMCI is necessary feature for hardware error prediction. On boot time, CMCIs on processors are masked(disabled) all, and later they are unmasked(enabled) all at once. The issue is that the CMCIs on hot-added processors are also masked, however no one unmask them. How reproducible: Always Step to Reproduce: do cpu hot-add (including online after offline) Actual Results: CMCI is disabled on onlined processor. Expected Results: CMCI is enaled on onlined processor. Summary of actions taken to resolve issue: Patch is available, and is already posted to upstream.
Created attachment 290406 [details] Upstream patch ported to recent RHEL-5 kernel
Created attachment 290407 [details] Log of test session by partner
Flagging for 5.3. Or do we want to fight for an exception for 5.2?
Posted to RHKL, still applies and builds cleanly to 2.6.18-71.el5. Current build information is here: http://brewweb.devel.redhat.com/brew/taskinfo?taskID=1124031
This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release.
From Fujitsu about testing with kernel-2.6.18-71.el5.it142698.2.ia64.rpm on C#6. --------------------------------------------------------- I tested the uploaded test kernel and confirmed that the included fix works fine. - cmc-mask-log-080122.txt result of today's test. Thanks, This event sent from IssueTracker by mmatsuya issue 142698
Created attachment 292457 [details] positive result log from Fujitsu on C#10
in 2.6.18-74.el5 You can download this test kernel from http://people.redhat.com/dzickus/el5
added to RHEL5.2 release notes under "Kernel-Related Updates": <quote> (ia64) Added support for Corrected Machine Check Interrupt (CMCI). This feature issues an interrupt when the processor detects and successfully rectifies trivial errors (such as a cache error). </quote> please advise if any further revisions are required. thanks!
This note is wrong... since CMCI itself is already supported from RHEL4 (and also from RHEL5.0). What not supported was CMCI on hot-added CPU. Thanks, H.Seto This event sent from IssueTracker by mmatsuya issue 142698
release note corrected. thanks!
Hi, the RHEL5.2 release notes will be dropped to translation on April 15, 2008, at which point no further additions or revisions will be entertained. a mockup of the RHEL5.2 release notes can be viewed at the following link: http://intranet.corp.redhat.com/ic/intranet/RHEL5u2relnotesmockup.html please use the aforementioned link to verify if your bugzilla is already in the release notes (if it needs to be). each item in the release notes contains a link to its original bug; as such, you can search through the release notes by bug number. Cheers, Don
An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on the solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2008-0314.html