Created attachment 843778 [details]
Description of problem:
From time to time system temperature gets as high as 95C along with power usage of 30W. System load and resources remain normal.
Version-Release number of selected component (if applicable):
rpm -q kernel
It is not clear why specifically triggers the high system temperature but it is always very sudden out of nowhere.
Steps to Reproduce:
No clear steps to reproduce.
Sudden high system temperature.
Please let me know if you want any specific information. It is a new Fedora 20 install and I can't think of any particular software that would trigger such a high temperature. Also both top and powertop fails to point to any particular process.
depending on the type of machine, 30W are not all too much really ;)
Your powertop output does not add up to too much.
The higher temperature might be due to the fans not kicking in?
What system is this?
Can you attach here dmidecode and dmesg output?
commands: level <level> (<level> is 0-7, auto, disengaged, full-speed)
commands: enable, disable
commands: watchdog <timeout> (<timeout> is 0 (off), 1-120 (seconds))
systemctl status thinkfan
thinkfan.service - simple and lightweight fan control program
Loaded: loaded (/usr/lib/systemd/system/thinkfan.service; enabled)
Active: active (running) since Mon 2014-01-27 14:15:20 IST; 7h ago
Main PID: 651 (thinkfan)
Jan 27 14:15:20 fedora systemd: Starting simple and lightweight fan control program...
Jan 27 14:15:20 fedora thinkfan: Daemon PID: 651
Jan 27 14:15:20 fedora systemd: Started simple and lightweight fan control program.
Fan seems to be working fine.
This is Lenovo 4177CTO ThinkPad T420i.
It seems temperature tends to go down if I suspend the system and then wake from sleep.
Linux fedora 3.12.8-300.fc20.x86_64 #1 SMP Thu Jan 16 01:07:50 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux
Please see the attachment for dmesg, dmidecode and powertop.html.
PS:- Sorry for the delay in reply. I will be prompt from now on.
Created attachment 856134 [details]
Created attachment 856135 [details]
Created attachment 856136 [details]
latest powertop results
I've had the same issue on my ThinkPad W520 running Fedora20 (and earlier); here's the mcelog output about the event(s):
1047 CPU 1 THERMAL EVENT TSC c01ca82b78
1048 TIME 1391796070 Fri Feb 7 12:01:10 2014
1049 Processor 1 heated above trip temperature. Throttling enabled.
1050 Please check your system cooling. Performance will be impacted
1051 STATUS 88030003 MCGSTATUS 0
1052 MCGCAP c09 APICID 1 SOCKETID 0
1053 CPUID Vendor Intel Family 6 Model 42
1054 Hardware event. This is not a software error.
1055 MCE 1
1056 CPU 0 THERMAL EVENT TSC c01ca8654a
1057 TIME 1391796070 Fri Feb 7 12:01:10 2014
1058 Processor 0 heated above trip temperature. Throttling enabled.
1059 Please check your system cooling. Performance will be impacted
1060 STATUS 88030003 MCGSTATUS 0
1061 MCGCAP c09 APICID 0 SOCKETID 0
1062 CPUID Vendor Intel Family 6 Model 42
1063 Hardware event. This is not a software error.
1064 MCE 2
1065 CPU 1 THERMAL EVENT TSC c01cc9fbce
1066 TIME 1391796070 Fri Feb 7 12:01:10 2014
1067 Processor 1 below trip temperature. Throttling disabled
1068 STATUS 88040002 MCGSTATUS 0
1069 MCGCAP c09 APICID 1 SOCKETID 0
1070 CPUID Vendor Intel Family 6 Model 42
1071 Hardware event. This is not a software error.
1072 MCE 3
1073 CPU 0 THERMAL EVENT TSC c01cca2f06
1074 TIME 1391796070 Fri Feb 7 12:01:10 2014
1075 Processor 0 below trip temperature. Throttling disabled
1076 STATUS 88040002 MCGSTATUS 0
1077 MCGCAP c09 APICID 0 SOCKETID 0
1078 CPUID Vendor Intel Family 6 Model 42
regarding the thinkfan package: It's weird that there seems to be two thinkfan.conf files; one is /etc/thinkfan.conf; the other, /etc/modprobe.d/thinkfan.conf
*********** MASS BUG UPDATE **************
We apologize for the inconvenience. There is a large number of bugs to go through and several of them have gone stale. Due to this, we are doing a mass bug update across all of the Fedora 20 kernel bugs.
Fedora 20 has now been rebased to 3.13.4-200.fc20. Please test this kernel update and let us know if you issue has been resolved or if it is still present with the newer kernel.
If you experience different issues, please open a new bug report for those.
My issue has been resolved, though it may not necessarily due to the kernel update. The kernel reported correctly that the temperature was at times increasing above threshold levels; however, this was (on my ThinkPad W520) "resolved" after cleaning interior hardware components (primarily case Fan), which had gathered large amount of dust.
Nevertheless, the fact that the mcelog messages were occurring with more frequency when running Fedora, as opposed to Arch or Ubuntu, should still be a cause of concern and focus for Kernel developers. I don't know the extent of Lenovo's cooperation with the Linux community in great detail, but I do remember reading on the kernel's bugzilla reports that it was an ongoing process; is it possible to have the kernel somehow avoid "overloading" individual cores on a multi-core processor, since the messages usually point to Core 0 and/or Core 1?
I no longer seem to have the problem in 3.13.3 so I am closing this bug.