Bug 797579

Summary: Laptop fan not working due to bad read from GPU thermometer
Product: [Fedora] Fedora Reporter: Jan Sedlák <boloomka>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 16CC: airlied, chpatrick, collura, gansalmon, itamar, jfeeney, jforbes, jonathan, jsedlak, kernel-maint, madhu.chinakonda
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-11-13 15:43:39 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
acpidump with kernel-3.5.3-1.fc17.x86_64 none

Description Jan Sedlák 2012-02-26 21:32:02 UTC
Description of problem:
I have Lenovo Edge E520 with Sandy Bridge architecture and two switchable GPUs, integrated Intel and discrete ATI. I'm using kernel feature vgaswitcheroo (http://en.gentoo-wiki.com/wiki/Vga_switcheroo) to switch off discrete GPU. And my laptop is overheating. I have discovered that my laptop fan isn't working, because Linux kernel is reading temperature of whole notebook from thermometer that is on discrete GPU, even when I switch that GPU off. Then system thinks that my laptop has about 30 degrees celsius (even when it has 80 degrees) and it doesn't start my laptop fan.

It is very bad, because overheating can possibly lead to shortening the lifespan of components. This bug is documented on other distros (http://ubuntuforums.org/showthread.php?t=1900942) and even other operating systems (http://forums.lenovo.com/t5/ThinkPad-Edge/E520-Fan-sometimes-not-working-after-switchable-graphic-issue/td-p/586829/page/4) as well.

I think that it's not hardware problem, because when I switch off discrete GPU in BIOS, temperature is right and fan works as expected.

How reproducible:
I think that it is easily reproducible on laptops with two (hybrid) GPUs (at least on Edge E520).

Steps to Reproduce:
1. Get Lenovo Edge E520 (not sure if it is problem with all hybrid GPUs)
2. Switch off discrete GPU (with vgaswitcheroo)
3. Do some work with it. It will overheat. Fan will not work.
  
Actual results:
Fan doesn't start even if the whole laptop has ~80 degrees celsius.

Expected results:
Fan would start spinning when the laptop reaches ~40 degrees celsius and cool it down.

Comment 1 Josh Boyer 2012-02-27 14:51:54 UTC
Dave and Matthew, is there anything that can be done here?

Comment 2 Matthew Garrett 2012-02-27 15:08:22 UTC
Please install the pmtools package and run (as root) acpidump and attach the output.

Comment 3 Jan Sedlák 2012-02-27 16:46:09 UTC
Here is dump when the discrete GPU is on: http://fpaste.org/SiAi/
and here when it's off: http://fpaste.org/hDOR/

Comment 4 Dave Jones 2012-03-22 17:03:11 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 5 Dave Jones 2012-03-22 17:06:25 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 6 Dave Jones 2012-03-22 17:17:25 UTC
[mass update]
kernel-3.3.0-4.fc16 has been pushed to the Fedora 16 stable repository.
Please retest with this update.

Comment 7 collura 2012-09-13 06:56:24 UTC
Created attachment 612359 [details]
acpidump with kernel-3.5.3-1.fc17.x86_64

Comment 8 collura 2012-09-13 07:00:02 UTC
this lack of fan speed seems to happen on occasion not sure why but seems to have gotten worse after some recent updates but not sure which.

http://www.smolts.org/client/show/pub_d71dc9ab-d675-4ee2-a9a9-303efc25df83

Comment 9 Jan Sedlák 2012-09-13 09:51:30 UTC
To [NEEDINFO] - yep, it still doesn't work.

Comment 10 Dave Jones 2012-10-23 15:29:34 UTC
# Mass update to all open bugs.

Kernel 3.6.2-1.fc16 has just been pushed to updates.
This update is a significant rebase from the previous version.

Please retest with this kernel, and let us know if your problem has been fixed.

In the event that you have upgraded to a newer release and the bug you reported
is still present, please change the version field to the newest release you have
encountered the issue with.  Before doing so, please ensure you are testing the
latest kernel update in that release and attach any new and relevant information
you may have gathered.

If you are not the original bug reporter and you still experience this bug,
please file a new report, as it is possible that you may be seeing a
different problem. 
(Please don't clone this bug, a fresh bug referencing this bug in the comment is sufficient).

Comment 11 Justin M. Forbes 2012-11-13 15:43:39 UTC
With no response, we are closing this bug under the assumption that it is no longer an issue. If you still experience this bug, please feel free to reopen the bug report.