Bug 1352308 - mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Summary: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Keywords:
Status: CLOSED EOL
Alias: None
Product: Fedora
Classification: Fedora
Component: mcelog
Version: 24
Hardware: x86_64
OS: All
unspecified
unspecified
Target Milestone: ---
Assignee: Prarit Bhargava
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-07-03 15:18 UTC by Yevgeny Zaspitsky
Modified: 2017-08-08 15:20 UTC (History)
12 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 712220
Environment:
Last Closed: 2017-08-08 15:20:01 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Yevgeny Zaspitsky 2016-07-03 15:18:04 UTC
Description of problem:
/var/log/messages reports:
   mcelog: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
   mcelog: Hardware event. This is not a software error.

Version-Release number of selected component (if applicable):
mcelog-119-2.fc24.x86_64

How reproducible:
The batch of mcelog messages appears every time the computer is requested to run a heavy CPU operation.

Steps to Reproduce:
install Fedora 24 (kernel-4.6.3-300.fc24.x86_64) on Intel(R) Core(TM) i7-6820HQ CPU @ 2.70GHz

Actual results:
Error message about Unsupported new Family 6 Model 5e CPU.


Expected results:
No error messages about Unsupported new Family 6 Model 5e CPU.

Additional info:
* From /proc/cpuinfo:
---------------------
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 94
model name      : Intel(R) Core(TM) i7-6820HQ CPU @ 2.70GHz
stepping        : 3
microcode       : 0x8a
cpu MHz         : 799.980
cache size      : 8192 KB
physical id     : 0
siblings        : 8
core id         : 0
cpu cores       : 4
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 22
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs            :
bogomips        : 5424.49
clflush size    : 64
cache_alignment : 64
address sizes   : 39 bits physical, 48 bits virtual
power management:

* A typical block of mcelog messages in the journal:
----------------------------------------------------
Jul 03 16:25:09 myhostname mcelog[1085]: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Jul 03 16:25:09 myhostname mcelog[1085]: Hardware event. This is not a software error.
Jul 03 16:25:09 myhostname mcelog[1085]: MCE 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPU 5 THERMAL EVENT TSC 2b38a49396c2
Jul 03 16:25:09 myhostname mcelog[1085]: TIME 1467552309 Sun Jul  3 16:25:09 2016
Jul 03 16:25:09 myhostname mcelog[1085]: Processor 5 heated above trip temperature. Throttling enabled.
Jul 03 16:25:09 myhostname mcelog[1085]: Please check your system cooling. Performance will be impacted
Jul 03 16:25:09 myhostname mcelog[1085]: STATUS 88030a83 MCGSTATUS 0
Jul 03 16:25:09 myhostname mcelog[1085]: MCGCAP c0a APICID 3 SOCKETID 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPUID Vendor Intel Family 6 Model 94
Jul 03 16:25:09 myhostname mcelog[1085]: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Jul 03 16:25:09 myhostname mcelog[1085]: Hardware event. This is not a software error.
Jul 03 16:25:09 myhostname mcelog[1085]: MCE 1
Jul 03 16:25:09 myhostname mcelog[1085]: CPU 1 THERMAL EVENT TSC 2b38a493bf64
Jul 03 16:25:09 myhostname mcelog[1085]: TIME 1467552309 Sun Jul  3 16:25:09 2016
Jul 03 16:25:09 myhostname mcelog[1085]: Processor 1 heated above trip temperature. Throttling enabled.
Jul 03 16:25:09 myhostname mcelog[1085]: Please check your system cooling. Performance will be impacted
Jul 03 16:25:09 myhostname mcelog[1085]: STATUS 88030a83 MCGSTATUS 0
Jul 03 16:25:09 myhostname mcelog[1085]: MCGCAP c0a APICID 2 SOCKETID 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPUID Vendor Intel Family 6 Model 94
Jul 03 16:25:09 myhostname mcelog[1085]: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Jul 03 16:25:09 myhostname mcelog[1085]: Hardware event. This is not a software error.
Jul 03 16:25:09 myhostname mcelog[1085]: MCE 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPU 5 THERMAL EVENT TSC 2b38a4bdbddf
Jul 03 16:25:09 myhostname mcelog[1085]: TIME 1467552309 Sun Jul  3 16:25:09 2016
Jul 03 16:25:09 myhostname mcelog[1085]: Processor 5 below trip temperature. Throttling disabled
Jul 03 16:25:09 myhostname mcelog[1085]: STATUS 88040a82 MCGSTATUS 0
Jul 03 16:25:09 myhostname mcelog[1085]: MCGCAP c0a APICID 3 SOCKETID 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPUID Vendor Intel Family 6 Model 94
Jul 03 16:25:09 myhostname mcelog[1085]: mcelog: Family 6 Model 5e CPU: only decoding architectural errors
Jul 03 16:25:09 myhostname mcelog[1085]: Hardware event. This is not a software error.
Jul 03 16:25:09 myhostname mcelog[1085]: MCE 1
Jul 03 16:25:09 myhostname mcelog[1085]: CPU 1 THERMAL EVENT TSC 2b38a4bdc37a
Jul 03 16:25:09 myhostname mcelog[1085]: TIME 1467552309 Sun Jul  3 16:25:09 2016
Jul 03 16:25:09 myhostname mcelog[1085]: Processor 1 below trip temperature. Throttling disabled
Jul 03 16:25:09 myhostname mcelog[1085]: STATUS 88040a82 MCGSTATUS 0
Jul 03 16:25:09 myhostname mcelog[1085]: MCGCAP c0a APICID 2 SOCKETID 0
Jul 03 16:25:09 myhostname mcelog[1085]: CPUID Vendor Intel Family 6 Model 94

Comment 1 Jasper Siero 2017-04-21 10:11:59 UTC
We are having the same problems on our laptops:

cat /proc/cpuinfo
processor    : 0
vendor_id    : GenuineIntel
cpu family    : 6
model        : 142
model name    : Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz
stepping    : 9
microcode    : 0x30
cpu MHz        : 499.987
cache size    : 3072 KB
physical id    : 0
siblings    : 4
core id        : 0
cpu cores    : 2
apicid        : 0
initial apicid    : 0
fpu        : yes
fpu_exception    : yes
cpuid level    : 22
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs        :
bogomips    : 5424.00
clflush size    : 64
cache_alignment    : 64
address sizes    : 39 bits physical, 48 bits virtual
power management:

processor    : 1
vendor_id    : GenuineIntel
cpu family    : 6
model        : 142
model name    : Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz
stepping    : 9
microcode    : 0x30
cpu MHz        : 499.987
cache size    : 3072 KB
physical id    : 0
siblings    : 4
core id        : 1
cpu cores    : 2
apicid        : 2
initial apicid    : 2
fpu        : yes
fpu_exception    : yes
cpuid level    : 22
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs        :
bogomips    : 5428.17
clflush size    : 64
cache_alignment    : 64
address sizes    : 39 bits physical, 48 bits virtual
power management:

processor    : 2
vendor_id    : GenuineIntel
cpu family    : 6
model        : 142
model name    : Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz
stepping    : 9
microcode    : 0x30
cpu MHz        : 499.987
cache size    : 3072 KB
physical id    : 0
siblings    : 4
core id        : 0
cpu cores    : 2
apicid        : 1
initial apicid    : 1
fpu        : yes
fpu_exception    : yes
cpuid level    : 22
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs        :
bogomips    : 5430.21
clflush size    : 64
cache_alignment    : 64
address sizes    : 39 bits physical, 48 bits virtual
power management:

processor    : 3
vendor_id    : GenuineIntel
cpu family    : 6
model        : 142
model name    : Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz
stepping    : 9
microcode    : 0x30
cpu MHz        : 499.987
cache size    : 3072 KB
physical id    : 0
siblings    : 4
core id        : 1
cpu cores    : 2
apicid        : 3
initial apicid    : 3
fpu        : yes
fpu_exception    : yes
cpuid level    : 22
wp        : yes
flags        : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs        :
bogomips    : 5428.53
clflush size    : 64
cache_alignment    : 64
address sizes    : 39 bits physical, 48 bits virtual
power management:







Apr 21 11:47:38 localhost.localdomain mcelog[1070]: mcelog: Family 6 Model 8e CPU: only decoding architectural errors
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: mcelog: Family 6 Model 8e CPU: only decoding architectural errors
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Hardware event. This is not a software error.
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCE 0
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: CPU 0 BANK 6
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MISC 3880020086 ADDR fef1cf00
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: TIME 1492768023 Fri Apr 21 11:47:03 2017
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCG status:
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi status:
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Error overflow
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Uncorrected error
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi_MISC register valid
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi_ADDR register valid
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Processor context corrupt
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCA: corrected filtering (some unreported errors in same region)
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Generic CACHE Level-2 Generic Error
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: STATUS ee2000000040110a MCGSTATUS 0
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCGCAP c08 APICID 0 SOCKETID 0
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: CPUID Vendor Intel Family 6 Model 142
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: mcelog: Family 6 Model 8e CPU: only decoding architectural errors
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Hardware event. This is not a software error.
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCE 1
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: CPU 0 BANK 7
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MISC 43880020086 ADDR fef1ff00
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: TIME 1492768023 Fri Apr 21 11:47:03 2017
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCG status:
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi status:
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Error overflow
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Uncorrected error
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi_MISC register valid
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCi_ADDR register valid
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Processor context corrupt
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCA: corrected filtering (some unreported errors in same region)
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: Generic CACHE Level-2 Generic Error
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: STATUS ee2000000040110a MCGSTATUS 0
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: MCGCAP c08 APICID 0 SOCKETID 0
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: CPUID Vendor Intel Family 6 Model 142
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: mcelog: warning: 16 bytes ignored in each record
Apr 21 11:47:38 localhost.localdomain mcelog[1070]: mcelog: consider an update



After upgrading one laptop to Fedora 25 the problem still exists

Comment 2 Fedora End Of Life 2017-07-25 21:32:23 UTC
This message is a reminder that Fedora 24 is nearing its end of life.
Approximately 2 (two) weeks from now Fedora will stop maintaining
and issuing updates for Fedora 24. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as EOL if it remains open with a Fedora  'version'
of '24'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not
able to fix it before Fedora 24 is end of life. If you would still like
to see this bug fixed and are able to reproduce it against a later version
of Fedora, you are encouraged  change the 'version' to a later Fedora
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.

Comment 3 Fedora End Of Life 2017-08-08 15:20:01 UTC
Fedora 24 changed to end-of-life (EOL) status on 2017-08-08. Fedora 24 is
no longer maintained, which means that it will not receive any further
security or bug fix updates. As a result we are closing this bug.

If you can reproduce this bug against a currently maintained version of
Fedora please feel free to reopen this bug against that version. If you
are unable to reopen this bug, please file a new report against the
current release. If you experience problems, please add a comment to this
bug.

Thank you for reporting this bug and we are sorry it could not be fixed.


Note You need to log in before you can comment on or make changes to this bug.