Description of problem: With 3.6.7-5.fc18.i686 kernel in dmesg output sometimes occurs message: [22826.654365] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [22826.654369] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state Seems this bug already fixed in drm-intel-next. https://bugs.freedesktop.org/show_bug.cgi?id=54226 I want that this patch will be included in Fedora.
(In reply to comment #0) > Description of problem: > With 3.6.7-5.fc18.i686 kernel in dmesg output sometimes occurs message: > [22826.654365] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... > GPU hung > [22826.654369] [drm] capturing error event; look for more information in > /debug/dri/0/i915_error_state > > > Seems this bug already fixed in drm-intel-next. > > https://bugs.freedesktop.org/show_bug.cgi?id=54226 > > I want that this patch will be included in Fedora. Adam, Dave, is this commit: http://cgit.freedesktop.org/~danvet/drm-intel/commit/drivers/gpu/drm?h=drm-intel-next&id=1c8b46fc8c865189f562c9ab163d63863759712f something that is stand-alone that can be cherry-picked? I can't immediately tell from the commit log, and it isn't CC'd to stable so I'm thinking it isn't.
*** Bug 879825 has been marked as a duplicate of this bug. ***
Should be fine to backport yes.
(In reply to comment #3) > Should be fine to backport yes. OK. I'll look at getting that done later today. Mikhail, I'll have a scratch kernel for you to test with and would appreciate your feedback on it once I put the link in this bug.
OK, here's the scratch build. Please test when it completes: http://koji.fedoraproject.org/koji/taskinfo?taskID=4759096
Problem repeated with patched kernel. [118637.439016] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [118637.439020] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state [mikhail@localhost ~]$ uname -a Linux localhost.localdomain 3.6.9-4.1.fc18.i686.PAE #1 SMP Wed Dec 5 15:16:33 UTC 2012 i686 i686 i386 GNU/Linux [mikhail@localhost ~]$ sudo cat /sys/kernel/debug/dri/0/i915_error_state > i915_error_state [sudo] password for mikhail: [mikhail@localhost ~]$
Created attachment 659798 [details] i915_error_state
kernel: [193364.241099] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung kernel: [193364.241104] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state $ uname -a Linux cwawak-rhlaptop 3.6.11-3.fc18.x86_64 #1 SMP Mon Dec 17 21:35:39 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux This is happening roughly every other day or so, I'll try to grab i915_error_state next time.
Josh, can you create new kernel package for test new patch from upstream? https://bugs.freedesktop.org/show_bug.cgi?id=54226#c26 https://bugs.freedesktop.org/attachment.cgi?id=72766
(In reply to comment #9) > Josh, can you create new kernel package for test new patch from upstream? > > https://bugs.freedesktop.org/show_bug.cgi?id=54226#c26 > > https://bugs.freedesktop.org/attachment.cgi?id=72766 There's a different patch that has been submitted to stable for a GPU hung issue. I'm not sure which would be preferred at this point.
[ 9418.403018] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [ 9418.403023] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state # uname -a Linux localhost 3.7.1-2.fc18.x86_64 #1 SMP Fri Jan 4 00:10:48 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux $ sudo cat /sys/kernel/debug/dri/0/i915_error_state > i915_error_state cat: /sys/kernel/debug/dri/0/i915_error_state: Cannot allocate memory Couldn't grab i915_error_state.
(In reply to comment #10) > There's a different patch that has been submitted to stable for a GPU hung > issue. I'm not sure which would be preferred at this point. Which patches do you mean? I mean last patch from comment 26.
I have this bug too... quick steps to reproduce: install xbmc (rpmfusion) install libva and libva-intel-driver enable vaapi e start playing a movie with a intel i5-3570K.
What progress is being made on this issue? What can I do to help? It's seriously impeding my ability to use Fedora 18 on a Thinkpad T420s. Thanks!
Christopher apparently this bug has been fixed in the current stable kernel. Please check https://admin.fedoraproject.org/updates/F18/FEDORA-2013-1443 I haven't updated yet, but I'll do it and then I'll report here.
It seems to be fixed in the Fedora18 stable kernel, the problem hasn't occurred here after updating to it for a few days (where it used to happen multiply times each day)
I still receive drm:i915_hangcheck_hung error message in dmesg output: [125692.981672] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [127544.267347] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [127966.207633] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [128452.389972] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung But my kernel is 3.7.4-204 $ uname -a Linux localhost.localdomain 3.7.4-204.fc18.i686.PAE #1 SMP Wed Jan 23 16:58:41 UTC 2013 i686 i686 i386 GNU/Linux This bug still not fixed yet. Which patch are you applied for it?
Please help compile kernel with: This https://bugs.freedesktop.org/attachment.cgi?id=73577 and this https://bugs.freedesktop.org/attachment.cgi?id=72766 patches, for each patch separate kernel for test purpose
Created attachment 695697 [details] my kernel.spec which couldn't compile
drivers/gpu/drm/i915/intel_ringbuffer.c: In function 'gen6_add_request': drivers/gpu/drm/i915/intel_ringbuffer.c:611:3: error: too few arguments to function 'update_mboxes' drivers/gpu/drm/i915/intel_ringbuffer.c:557:1: note: declared here drivers/gpu/drm/i915/intel_ringbuffer.c:612:3: error: too few arguments to function 'update_mboxes' drivers/gpu/drm/i915/intel_ringbuffer.c:557:1: note: declared here make[4]: *** [drivers/gpu/drm/i915/intel_ringbuffer.o] Error 1 make[3]: *** [drivers/gpu/drm/i915] Error 2 make[2]: *** [drivers/gpu/drm] Error 2 make[1]: *** [drivers/gpu] Error 2 make[1]: *** Waiting for unfinished jobs.... make: *** [drivers] Error 2 make: *** Waiting for unfinished jobs.... [reply] [-] Comment 36
I also get this error on Fedora 18 on a Thinkpad T520. dmesg: [ 6934.976050] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [ 6934.976055] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state Linux 3.7.6-201.fc18.x86_64 #1 SMP Mon Feb 4 15:54:08 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
Hello, i have the same Bug with CPU intel G645 and openelec 3.0 when do you thinks the bug fixed? Many Thanks for help Charly Mar 28 19:24:55 openelec user.err kernel: [ 94.646120] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:24:55 openelec user.info kernel: [ 94.646124] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state Mar 28 19:25:01 openelec user.err kernel: [ 101.009793] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:26:00 openelec user.err kernel: [ 159.705871] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:26:06 openelec user.err kernel: [ 166.348828] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:26:13 openelec user.err kernel: [ 172.991785] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:26:15 openelec user.err kernel: [ 174.494596] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Mar 28 19:26:15 openelec user.err kernel: [ 174.494644] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged! Mar 28 19:26:15 openelec user.err kernel: [ 174.494645] [drm:i915_reset] *ERROR* Failed to reset chip. Mar 28 19:26:15 openelec user.info kernel: [ 174.538630] Jobworker[1210]: segfault at 7f07a8067e59 ip 00007f07a8067e59 sp 00007f078abc3378 error 14
Still running into this problem. Linux localhost 3.8.11-200.fc18.x86_64 #1 SMP Wed May 1 19:44:27 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux I didn't notice it for a week when I was not in the dock, but noticed the issue again when I am plugged into my dock with a 24" and 20" LCD plugged in (internal LCD turned off.) May 8 15:36:40 localhost kernel: [92938.244482] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung May 8 15:36:40 localhost kernel: [92938.244487] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state May 8 15:36:48 localhost kernel: [92946.237923] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung and when I tried to grab the i915_error_state, I get a page allocation error: May 8 15:38:01 localhost kernel: [93018.742214] cat: page allocation failure: order:9, mode:0x1040d0 May 8 15:38:01 localhost kernel: [93018.742218] Pid: 17595, comm: cat Tainted: G W 3.8.11-200.fc18.x86_64 #1 May 8 15:38:01 localhost kernel: [93018.742219] Call Trace: May 8 15:38:01 localhost kernel: [93018.742226] [<ffffffff811380d9>] warn_alloc_failed+0xe9/0x150 May 8 15:38:01 localhost kernel: [93018.742229] [<ffffffff8164b36d>] ? __alloc_pages_direct_compact+0x182/0x194 May 8 15:38:01 localhost kernel: [93018.742231] [<ffffffff8113c2fe>] __alloc_pages_nodemask+0x80e/0xa80 May 8 15:38:01 localhost kernel: [93018.742234] [<ffffffff8117a5e8>] alloc_pages_current+0xb8/0x190 May 8 15:38:01 localhost kernel: [93018.742237] [<ffffffff81136fea>] __get_free_pages+0x2a/0x80 May 8 15:38:01 localhost kernel: [93018.742239] [<ffffffff81185909>] kmalloc_order_trace+0x39/0xb0 May 8 15:38:01 localhost kernel: [93018.742240] [<ffffffff81185b40>] __kmalloc+0x1c0/0x250 May 8 15:38:01 localhost kernel: [93018.742242] [<ffffffff81184747>] ? kfree+0x157/0x170 May 8 15:38:01 localhost kernel: [93018.742244] [<ffffffff811bf42e>] seq_read+0x10e/0x3b0 May 8 15:38:01 localhost kernel: [93018.742247] [<ffffffff8119dce9>] vfs_read+0xa9/0x180 May 8 15:38:01 localhost kernel: [93018.742248] [<ffffffff8119de12>] sys_read+0x52/0xa0 May 8 15:38:01 localhost kernel: [93018.742252] [<ffffffff816577be>] ? do_page_fault+0xe/0x10 May 8 15:38:01 localhost kernel: [93018.742254] [<ffffffff8165be19>] system_call_fastpath+0x16/0x1b May 8 15:38:01 localhost kernel: [93018.742255] Mem-Info: May 8 15:38:01 localhost kernel: [93018.742256] Node 0 DMA per-cpu: May 8 15:38:01 localhost kernel: [93018.742257] CPU 0: hi: 0, btch: 1 usd: 0 May 8 15:38:01 localhost kernel: [93018.742258] CPU 1: hi: 0, btch: 1 usd: 0 May 8 15:38:01 localhost kernel: [93018.742259] CPU 2: hi: 0, btch: 1 usd: 0 May 8 15:38:01 localhost kernel: [93018.742259] CPU 3: hi: 0, btch: 1 usd: 0 May 8 15:38:01 localhost kernel: [93018.742260] Node 0 DMA32 per-cpu: May 8 15:38:01 localhost kernel: [93018.742261] CPU 0: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742262] CPU 1: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742263] CPU 2: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742263] CPU 3: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742264] Node 0 Normal per-cpu: May 8 15:38:01 localhost kernel: [93018.742265] CPU 0: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742265] CPU 1: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742266] CPU 2: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742267] CPU 3: hi: 186, btch: 31 usd: 0 May 8 15:38:01 localhost kernel: [93018.742269] active_anon:609781 inactive_anon:119636 isolated_anon:0 May 8 15:38:01 localhost kernel: [93018.742269] active_file:440429 inactive_file:439906 isolated_file:0 May 8 15:38:01 localhost kernel: [93018.742269] unevictable:952 dirty:108 writeback:0 unstable:0 May 8 15:38:01 localhost kernel: [93018.742269] free:254868 slab_reclaimable:53051 slab_unreclaimable:17425 May 8 15:38:01 localhost kernel: [93018.742269] mapped:41776 shmem:73070 pagetables:15919 bounce:0 May 8 15:38:01 localhost kernel: [93018.742269] free_cma:0 May 8 15:38:01 localhost kernel: [93018.742271] Node 0 DMA free:15344kB min:124kB low:152kB high:184kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15104kB managed:15360kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:16kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes May 8 15:38:01 localhost kernel: [93018.742274] lowmem_reserve[]: 0 3312 7822 7822 May 8 15:38:01 localhost kernel: [93018.742276] Node 0 DMA32 free:809648kB min:28560kB low:35700kB high:42840kB active_anon:837140kB inactive_anon:108356kB active_file:785292kB inactive_file:730236kB unevictable:92kB isolated(anon):0kB isolated(file):0kB present:3391540kB managed:3332476kB mlocked:92kB dirty:64kB writeback:0kB mapped:15836kB shmem:30360kB slab_reclaimable:69548kB slab_unreclaimable:6904kB kernel_stack:336kB pagetables:7844kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no May 8 15:38:01 localhost kernel: [93018.742279] lowmem_reserve[]: 0 0 4510 4510 May 8 15:38:01 localhost kernel: [93018.742280] Node 0 Normal free:194480kB min:38892kB low:48612kB high:58336kB active_anon:1601984kB inactive_anon:370188kB active_file:976424kB inactive_file:1029388kB unevictable:3716kB isolated(anon):0kB isolated(file):0kB present:4618656kB managed:4561328kB mlocked:3716kB dirty:368kB writeback:0kB mapped:151268kB shmem:261920kB slab_reclaimable:142656kB slab_unreclaimable:62780kB kernel_stack:4232kB pagetables:55832kB unstable:0kB bounce:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no May 8 15:38:01 localhost kernel: [93018.742283] lowmem_reserve[]: 0 0 0 0 May 8 15:38:01 localhost kernel: [93018.742285] Node 0 DMA: 0*4kB 0*8kB 1*16kB (U) 1*32kB (U) 1*64kB (U) 1*128kB (U) 1*256kB (U) 1*512kB (U) 0*1024kB 1*2048kB (R) 3*4096kB (M) = 15344kB May 8 15:38:01 localhost kernel: [93018.742291] Node 0 DMA32: 13104*4kB (UEMR) 10711*8kB (UEMR) 5798*16kB (UEMR) 2971*32kB (UEMR) 2445*64kB (UEMR) 848*128kB (UEMR) 366*256kB (UEM) 129*512kB (UEMR) 56*1024kB (UEMR) 1*2048kB (E) 0*4096kB = 810104kB May 8 15:38:01 localhost kernel: [93018.742297] Node 0 Normal: 2375*4kB (UEM) 2239*8kB (UEM) 1617*16kB (UEM) 1709*32kB (UEM) 665*64kB (UEM) 221*128kB (UEM) 53*256kB (UEM) 3*512kB (EM) 1*1024kB (U) 0*2048kB 0*4096kB = 194948kB May 8 15:38:01 localhost kernel: [93018.742303] 953976 total pagecache pages May 8 15:38:01 localhost kernel: [93018.742304] 2 pages in swap cache May 8 15:38:01 localhost kernel: [93018.742305] Swap cache stats: add 1465, delete 1463, find 667/667 May 8 15:38:01 localhost kernel: [93018.742305] Free swap = 7944168kB May 8 15:38:01 localhost kernel: [93018.742306] Total swap = 7944188kB May 8 15:38:01 localhost kernel: [93018.760469] 2057712 pages RAM May 8 15:38:01 localhost kernel: [93018.760471] 71529 pages reserved May 8 15:38:01 localhost kernel: [93018.760472] 1905068 pages shared May 8 15:38:01 localhost kernel: [93018.760472] 1186634 pages non-shared
+1 [ 2653.470231] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [anton@bandura ~]$ uname -a Linux bandura.laptop 3.9.2-301.fc19.x86_64 #1 SMP Mon May 13 12:36:24 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
Created attachment 751616 [details] i915_error_state 2
Created attachment 754277 [details] contents of /sys/kernel/debug/dri/0/i915_error_state after GPU hang Since a few weeks I've been having this too on my Lenovo W520. It's very annoying and happens randomly during the day, usually when I'm right in the middle of something. I also have the distinct feeling that having flash and/or skype loaded/running seems to trigger it sooner. But I might be entirely wrong there. mesa-dri-drivers.i686 9.1-3.fc18 mesa-dri-drivers.x86_64 9.1-3.fc18 mesa-dri-filesystem.i686 9.1-3.fc18 mesa-dri-filesystem.x86_64 9.1-3.fc18 xorg-x11-drv-intel.x86_64 2.21.6-1.fc18
forgot the installed kernel rpms... kernel.x86_64 3.8.11-200.fc18 kernel.x86_64 3.9.2-200.fc18 kernel.x86_64 3.9.4-200.fc18 kernel-devel.x86_64 3.8.11-200.fc18 kernel-devel.x86_64 3.9.2-200.fc18 kernel-devel.x86_64 3.9.4-200.fc18 kernel-headers.x86_64 3.9.4-200.fc18 kernel-modules-extra.x86_64 3.8.11-200.fc18 kernel-modules-extra.x86_64 3.9.2-200.fc18 kernel-modules-extra.x86_64 3.9.4-200.fc18
Just received this error too on a Dell Latitude E5420 running Fedora 18 Running KDE4 with desktop effects turned on. All I did was click on a spreadsheet document in Google Docs using Chrome, and it hung. [ 6893.718127] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [ 6893.718156] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915 Intersting thing is that I can hit CTRL-ALT-F2 or another virtual terminal, switch to it, and then switch back to GUI with CTRL-ALT-F1 and everything is back working again. It seems to hang until I switch the display back/forth between text and graphical again.
(In reply to Christopher J. from comment #28) > Just received this error too on a Dell Latitude E5420 running Fedora 18 > Running KDE4 with desktop effects turned on. All I did was click on a > spreadsheet document in Google Docs using Chrome, and it hung. > > [ 6893.718127] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... > GPU hung > [ 6893.718156] [drm] capturing error event; look for more information > in/sys/kernel/debug/dri/0/i915 > > Intersting thing is that I can hit CTRL-ALT-F2 or another virtual terminal, > switch to it, and then switch back to GUI with CTRL-ALT-F1 and everything is > back working again. It seems to hang until I switch the display back/forth > between text and graphical again. kernel-3.9.4-200.fc18.x86_64 kernel-modules-extra-3.9.2-200.fc18.x86_64 kernel-modules-extra-3.9.4-200.fc18.x86_64 kernel-modules-extra-3.9.5-201.fc18.x86_64 kernel-3.9.2-200.fc18.x86_64 kernel-3.9.5-201.fc18.x86_64 xorg-x11-drv-intel-2.21.8-1.fc18.x86_64 KDE Platform Version 4.10.4
I see this issue too. Almost everyday it hangs once or twice. Switch to another tty and then switch back seems to unfreeze my Thinkpad T520 Jun 21 20:42:20 dhcp-0-126 kernel: [ 5615.933753] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Jun 21 20:42:20 dhcp-0-126 kernel: [ 5615.933764] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
Created attachment 763881 [details] error state file for Thinkpad T520 Fedora 18 GNOME 3
forgot to add env details: F18 latest updates (June 21 2013). 64-bit [rsriniva@valhalla ~]$ uname -a Linux valhalla 3.9.6-200.fc18.x86_64 #1 SMP Thu Jun 13 18:56:55 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux [rsriniva@valhalla ~]$ lspci -vv 00:00.0 Host bridge: Intel Corporation 2nd Generation Core Processor Family DRAM Controller (rev 09) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx- Latency: 0 Capabilities: <access denied> 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller]) Subsystem: Lenovo Device 21cf Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 44 Region 0: Memory at f0000000 (64-bit, non-prefetchable) [size=4M] Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M] Region 4: I/O ports at 4000 [size=64] Expansion ROM at <unassigned> [disabled] Capabilities: <access denied> Kernel driver in use: i915 00:16.0 Communication controller: Intel Corporation 6 Series/C200 Series Chipset Family MEI Controller #1 (rev 04) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 45 Region 0: Memory at f1525000 (64-bit, non-prefetchable) [size=16] Capabilities: <access denied> Kernel driver in use: mei 00:16.3 Serial controller: Intel Corporation 6 Series/C200 Series Chipset Family KT Controller (rev 04) (prog-if 02 [16550]) Subsystem: Lenovo Device 21cf Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin B routed to IRQ 19 Region 0: I/O ports at 40b0 [size=8] Region 1: Memory at f152c000 (32-bit, non-prefetchable) [size=4K] Capabilities: <access denied> Kernel driver in use: serial 00:19.0 Ethernet controller: Intel Corporation 82579LM Gigabit Network Connection (rev 04) Subsystem: Lenovo Device 21ce Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 43 Region 0: Memory at f1500000 (32-bit, non-prefetchable) [size=128K] Region 1: Memory at f152b000 (32-bit, non-prefetchable) [size=4K] Region 2: I/O ports at 4080 [size=32] Capabilities: <access denied> Kernel driver in use: e1000e 00:1a.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #2 (rev 04) (prog-if 20 [EHCI]) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 16 Region 0: Memory at f152a000 (32-bit, non-prefetchable) [size=1K] Capabilities: <access denied> Kernel driver in use: ehci-pci 00:1b.0 Audio device: Intel Corporation 6 Series/C200 Series Chipset Family High Definition Audio Controller (rev 04) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 47 Region 0: Memory at f1520000 (64-bit, non-prefetchable) [size=16K] Capabilities: <access denied> Kernel driver in use: snd_hda_intel 00:1c.0 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 1 (rev b4) (prog-if 00 [Normal decode]) Control: I/O- Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Bus: primary=00, secondary=02, subordinate=02, sec-latency=0 Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: <access denied> Kernel driver in use: pcieport 00:1c.1 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 2 (rev b4) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Bus: primary=00, secondary=03, subordinate=03, sec-latency=0 Memory behind bridge: f1400000-f14fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: <access denied> Kernel driver in use: pcieport 00:1c.4 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 5 (rev b4) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Bus: primary=00, secondary=0d, subordinate=0d, sec-latency=0 I/O behind bridge: 00003000-00003fff Memory behind bridge: f0c00000-f13fffff Prefetchable memory behind bridge: 00000000f0400000-00000000f0bfffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA- VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: <access denied> Kernel driver in use: pcieport 00:1d.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #1 (rev 04) (prog-if 20 [EHCI]) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 23 Region 0: Memory at f1529000 (32-bit, non-prefetchable) [size=1K] Capabilities: <access denied> Kernel driver in use: ehci-pci 00:1f.0 ISA bridge: Intel Corporation QM67 Express Chipset Family LPC Controller (rev 04) Subsystem: Lenovo Device 21cf Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Capabilities: <access denied> Kernel driver in use: lpc_ich 00:1f.2 SATA controller: Intel Corporation 6 Series/C200 Series Chipset Family 6 port SATA AHCI Controller (rev 04) (prog-if 01 [AHCI 1.0]) Subsystem: Lenovo Device 21cf Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin B routed to IRQ 42 Region 0: I/O ports at 40a8 [size=8] Region 1: I/O ports at 40bc [size=4] Region 2: I/O ports at 40a0 [size=8] Region 3: I/O ports at 40b8 [size=4] Region 4: I/O ports at 4060 [size=32] Region 5: Memory at f1528000 (32-bit, non-prefetchable) [size=2K] Capabilities: <access denied> Kernel driver in use: ahci 00:1f.3 SMBus: Intel Corporation 6 Series/C200 Series Chipset Family SMBus Controller (rev 04) Subsystem: Lenovo Device 21cf Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Interrupt: pin C routed to IRQ 18 Region 0: Memory at f1524000 (64-bit, non-prefetchable) [size=256] Region 4: I/O ports at efa0 [size=32] Kernel driver in use: i801_smbus 03:00.0 Network controller: Intel Corporation Centrino Ultimate-N 6300 (rev 3e) Subsystem: Intel Corporation Centrino Ultimate-N 6300 3x3 AGN Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 46 Region 0: Memory at f1400000 (64-bit, non-prefetchable) [size=8K] Capabilities: <access denied> Kernel driver in use: iwlwifi 0d:00.0 System peripheral: Ricoh Co Ltd PCIe SDXC/MMC Host Controller (rev 08) (prog-if 01) Subsystem: Lenovo Device 21cf Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 16 Region 0: Memory at f0c00000 (32-bit, non-prefetchable) [size=256] Capabilities: <access denied> Kernel driver in use: sdhci-pci [rsriniva@valhalla ~]$ rpm -qa | grep intel xorg-x11-drv-intel-2.21.8-1.fc18.x86_64 [rsriniva@valhalla ~]$ rpm -qa | grep mesa mesa-libxatracker-9.2-0.7.20130528.fc18.x86_64 mesa-libEGL-9.2-0.7.20130528.fc18.x86_64 mesa-libGL-9.2-0.7.20130528.fc18.i686 mesa-libGLU-9.0.0-1.fc18.x86_64 mesa-filesystem-9.2-0.7.20130528.fc18.x86_64 mesa-libGLES-9.2-0.7.20130528.fc18.x86_64 mesa-libGL-9.2-0.7.20130528.fc18.x86_64 mesa-libglapi-9.2-0.7.20130528.fc18.i686 mesa-libgbm-9.2-0.7.20130528.fc18.i686 mesa-libglapi-9.2-0.7.20130528.fc18.x86_64 mesa-libgbm-9.2-0.7.20130528.fc18.x86_64 mesa-libGLU-9.0.0-1.fc18.i686 mesa-libEGL-9.2-0.7.20130528.fc18.i686 mesa-dri-drivers-9.2-0.7.20130528.fc18.x86_64 [rsriniva@valhalla ~]$ rpm -qa | grep kernel kernel-devel-3.9.6-200.fc18.x86_64 abrt-addon-kerneloops-2.1.4-3.fc18.x86_64 kernel-modules-extra-3.9.6-200.fc18.x86_64 libreport-plugin-kerneloops-2.1.4-4.fc18.x86_64 kernel-3.9.4-200.fc18.x86_64 kernel-headers-3.9.6-200.fc18.x86_64 kernel-3.9.6-200.fc18.x86_64 kernel-modules-extra-3.9.4-200.fc18.x86_64
Created attachment 765278 [details] Fedora 19 i915_error_state Thinkpad E420 kernel-3.9.5-301.fc19.x86_64 xorg-x11-drv-intel-2.21.8-1.fc19.x86_64 Most often GPU freeze happens during google hangout video call, i.e. display freezes but sound still works. Attempt to switch to another VT doesn't work, i.e. screen stays frozen. Host is accessible via ssh and killing Xorg restores VT console and after that GDM restarts and new X session works ok for some time. There is backtrace in Xorg.log which will be attached with next attachment.
Created attachment 765280 [details] Fedora 19 Xorg.0.log Thinkpad E420 complete log is in attachment --- cut --- (EE) [mi] EQ overflowing. Additional events will be discarded until existing events are processed. (EE) (EE) Backtrace: (EE) 0: /usr/bin/Xorg (mieqEnqueue+0x22b) [0x575fab] (EE) 1: /usr/bin/Xorg (QueuePointerEvents+0x52) [0x44d612] (EE) 2: /usr/lib64/xorg/modules/input/synaptics_drv.so (_init+0x29c8) [0x7f14783ed968] (EE) 3: /usr/lib64/xorg/modules/input/synaptics_drv.so (_init+0x46ea) [0x7f14783f1d8a] (EE) 4: /usr/bin/Xorg (DPMSSupported+0xe8) [0x485978] (EE) 5: /usr/bin/Xorg (xf86SerialModemClearBits+0x230) [0x4adf30] (EE) 6: /lib64/libpthread.so.0 (__restore_rt+0x0) [0x3ccc20ef9f] (EE) 7: /lib64/libpthread.so.0 (__read_nocancel+0x24) [0x3ccc20e0c4] (EE) 8: /usr/lib64/xorg/modules/drivers/intel_drv.so (_init+0x57397) [0x7f147a6ea767] (EE) 9: /usr/lib64/xorg/modules/drivers/intel_drv.so (_init+0x57dd0) [0x7f147a6eb930] (EE) 10: /usr/lib64/xorg/modules/drivers/intel_drv.so (_init+0x47c50) [0x7f147a6cb5a0] (EE) 11: /usr/bin/Xorg (BlockHandler+0x44) [0x43ae44] (EE) 12: /usr/bin/Xorg (WaitForSomething+0x124) [0x4660b4] (EE) 13: /usr/bin/Xorg (SendErrorToClient+0xe1) [0x436b31] (EE) 14: /usr/bin/Xorg (_init+0x3ab2) [0x429ae2] (EE) 15: /lib64/libc.so.6 (__libc_start_main+0xf5) [0x3ccbe21b75] (EE) 16: /usr/bin/Xorg (_start+0x29) [0x426741] (EE) 17: ? (?+0x29) [0x29] (EE) (EE) [mi] These backtraces from mieqEnqueue may point to a culprit higher up the stack. (EE) [mi] mieq is *NOT* the cause. It is a victim. --- cut ---
I did a "yum downgrade xorg-x11-drv-intel" and T520 is much more stable now. It rolled back to version 2.20.x of the Intel Driver.
Same here on T410. 2.21 seems completely unstable as I get horrible performance and a hung GPU very easily (This seems unrelated to the semaphore problem mentioned by the original report: I used sysfs to switch off semaphores. So I am guessing some of us are seeing a different bug). 2.20.14 is fine.
I ran on 2.20.14 for a while too but that one was also unstable for me, causing just a many GPU hangs. this morning I upgraded to F19 and haven't seen the GPU hang yet. kernel 3.9.8-300.fc19 xorg 2.21.8-1.fc19
Ah, it's not fixed. The thing I now see is that the (drawn) mouse cursor doesn't move but that the actual cursor still moves (I can see that because of the changing highlights in my frippery favorites on the top of the screen). so the full GPU hang has become a hang of the cursor planes
After a some days working with F19 on my W520, I can say that the hangs of the cursor planes happens extremely often. At least the full GPU doesn't hang, but it's still very annoying ;-)
I am having the same problem too after upgrading to Fedora 19. It most often happens when using Flash (eg. YouTube (in Chrome)) but sometimes happens when i'm doing other things, even just using gnome-shell with no windows open! The whole Xorg display 'freezes' (the video appears stuck and the display will not respond to keyboard/mouse input. Although I can still move the mouse pointer, the display does not respond to mouse clicks. I am unable to switch to a TTY). I know the system hasn't totally crashed, as I can still move the cursor and hear any audio playing normally (this would suggest an Xorg/video driver error). This goes on for around 10 seconds (sometimes longer). My system will usually TOTALLY CRASH - the cursor stops moving, my system totally locks up, and I hear a 1 second loop of any audio playing. I have to force power off my laptop and lose all my work. The last few times this has happened, after 10 seconds or so, my machine has become responsive again, and I can investigate the error (which has lead me here). I wouldn't blame this entirely on flash, however it does happen more often when flash is being used. It's very frustrating when it happens almost once every day and I have to lose all of my work and reboot my laptop. DMESG Shows: ===================================================== [ 1510.714479] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [ 1510.714496] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state [ 2746.409651] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung ===================================================== I get the following in /var/log/messages: ===================================================== Jul 10 02:11:52 cobby-aspire-5750 kernel: [ 1510.714479] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Jul 10 02:11:52 cobby-aspire-5750 kernel: [ 1510.714496] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state Jul 10 02:32:27 cobby-aspire-5750 kernel: [ 2746.409651] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Jul 10 02:32:27 cobby-aspire-5750 /etc/gdm/Xsession[1110]: Window manager warning: CurrentTime used to choose focus window; focus window may not be correct. Jul 10 02:32:27 cobby-aspire-5750 /etc/gdm/Xsession[1110]: Window manager warning: Got a request to focus 0x1a00007 (Desktop) with a timestamp of 0. This shouldn't happen! ===================================================== My CPU is "Intel(R) Core(TM) i3-2310M CPU @ 2.10GHz" (/proc/cpuinfo) Integrated graphics are Intel HD 3000 Integrated graphics (VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)) I will be so glad when Xorg is gone and Flash player has died. I will also attach /sys/kernel/debug/dri/0/i915_error_state
Created attachment 771341 [details] i915_error_state
Created attachment 774483 [details] i915_error_state I'm suffering this error around 6 times per day, my computer is becoming unusable. It happens anytime, I'm using KDE and suddenly the display becames corrupted, with a lot of weird color pixels. The versions I'm using are: kernel-3.9.9-302.fc19.x86_64 xorg-x11-drv-intel-2.21.8-1.fc19.x86_64 My hardware: Intel Core i7-2600K Asus P8Z68-V LE motherboard 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller]) Subsystem: ASUSTeK Computer Inc. Device 844d Flags: bus master, fast devsel, latency 0, IRQ 58 Memory at f7800000 (64-bit, non-prefetchable) [size=4M] Memory at e0000000 (64-bit, prefetchable) [size=256M] I/O ports at f000 [size=64] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 2 Capabilities: [a4] PCI Advanced Features Kernel driver in use: i915
Same issue here: Jul 26 08:33:53 t520 kernel: [ 1135.615221] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung I tried updating to xorg-x11-drv-intel-2.21.12-1.fc19.x86_64 from updates-testing, no luck, problem still happening. Thinkpad T520 3.9.9-302.fc19.x86_64
Same here. Last lines of dmesg: [15399.626850] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [15502.735410] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [15502.738388] [drm:__gen6_gt_force_wake_get] *ERROR* Timed out waiting for forcewake old ack to clear. [15582.825907] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [17040.485125] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [18273.874969] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung [18358.971136] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Occurs mostly when I play games (native and via Wine). After the hang period I can switch to VT and back and the video becomes correct again. Hardware: 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller]) Subsystem: Dell Device 0502 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 42 Region 0: Memory at f6800000 (64-bit, non-prefetchable) [size=4M] Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M] Region 4: I/O ports at f000 [size=64] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Address: fee0f00c Data: 41a1 Capabilities: [d0] Power Management version 2 Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME- Capabilities: [a4] PCI Advanced Features AFCap: TP+ FLR+ AFCtrl: FLR- AFStatus: TP- Kernel driver in use: i915
I am also experiencing this GPU hang problem while playing games and more rarely when watching HD video. I am not having the cursor hang problem others are describing, but the full GPU hang. From journalctl: Jul 30 21:39:32 localhost kernel: [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung Jul 30 21:39:32 localhost kernel: [drm] capturing error event; look for more information in /sys/kernel/debug/dri/0/i915_error_state Versions: kernel.3.10.3-300.fc19.x86_64 xorg-x11-drv-intel.2.21.12-1.fc19.x86_64 Hardware: Dell Inspiron 15R laptop Intel(R) Core(TM) i5-2450M CPU @ 2.50GHz 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller]) Subsystem: Dell Device 04b0 Flags: bus master, fast devsel, latency 0, IRQ 51 Memory at f6800000 (64-bit, non-prefetchable) [size=4M] Memory at e0000000 (64-bit, prefetchable) [size=256M] I/O ports at f000 [size=64] Expansion ROM at <unassigned> [disabled] Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit- Capabilities: [d0] Power Management version 2 Capabilities: [a4] PCI Advanced Features Kernel driver in use: i915
Created attachment 780915 [details] The i915 error state after my most recent GPU hang up
I've been finding that this happens quite reproducibly on my F19 T530 ever since upgrading to kernel 3.10. Any time I suspend, I get a very corrupted screen when resuming, and /var/log/messages will have the GPU hung message in it.
Created attachment 781172 [details] Photo of corruption when GPU hangs
Problem still persists on recent kernel and xorg-x11-drv-intel updates. kernel.x86_64.3.10.4-300.fc19 xorg-x11-drv-intel.x86_64.2.21.12-2.fc19 xorg-x11-server-Xorg.x86_64.1.14.2-9.fc19 Would I be better served taking this to the freedesktop.org bugzilla?
Created attachment 784787 [details] fresh i915 error state
My situation (on fully updated software) continues to be 'interesting' I have (listed in priority of occurrence) 1- very frequent cursor/mouse pointer plane hangs. usually the GPU 'unlocks' itself within a few seconds. 2- 'infrequent' GPU hangs. usually the GPU 'unlocks' itself after about 30 seconds 3- very infrequent GPU hangs while still able to move the cursor/mouse pointer. usually the GPU 'unlocks' itself after about 30 seconds Isn't it time that this got escalated to the Intel guys? It's persisting over several kernel versions Lenovo W520 machine 4284-4MG, 16GB memory
*** Bug 901687 has been marked as a duplicate of this bug. ***
I'm getting this too on my Acer netbook with an i915. I can attach any logs, configs, etc. fc19, kernel 3.10.9-200 x86_64, xorg-x11-drv-intel 2.21.12-2. Usually happens while playing a game and often listening to music. dosbox, foobillard or OpenTTD so far, they freeze, totem's audio loops over a few seconds, I can't get a virtual tty and the machine is unresponsive.
Still happening with latest F19 updates applied on Thinkpad T520 Oct 7 13:57:58 localhost kernel: [10343.338442] [drm:i915_hangcheck_elapsed] *ERROR* stuck on render ring Oct 7 13:57:58 localhost kernel: [10343.338454] [drm] capturing error event; look for more information in /sys/kernel/debug/dri/0/i915_error_state Oct 7 13:57:58 localhost kernel: [10343.347326] [drm:i915_set_reset_status] *ERROR* render ring hung inside bo (0x24e3000 ctx 1) at 0x24e31c8 Linux localhost 3.11.3-201.fc19.x86_64 #1 SMP Thu Oct 3 00:47:03 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux xorg-x11-drv-intel-2.21.12-2.fc19.x86_64 mesa-dri-drivers-9.2-1.20130919.fc19.x86_64 mesa-libGL-9.2-1.20130919.fc19.x86_64 mesa-libwayland-egl-9.2-1.20130919.fc19.x86_64 mesa-libglapi-9.2-1.20130919.fc19.i686 mesa-libEGL-9.2-1.20130919.fc19.x86_64 mesa-filesystem-9.2-1.20130919.fc19.x86_64 mesa-libGLU-9.0.0-2.fc19.x86_64 mesa-libGLU-9.0.0-2.fc19.i686 mesa-libEGL-9.2-1.20130919.fc19.i686 mesa-libgbm-9.2-1.20130919.fc19.i686 mesa-libglapi-9.2-1.20130919.fc19.x86_64 mesa-libxatracker-9.2-1.20130919.fc19.x86_64 mesa-libgbm-9.2-1.20130919.fc19.x86_64 mesa-libGL-9.2-1.20130919.fc19.i686
This message is a reminder that Fedora 18 is nearing its end of life. Approximately 4 (four) weeks from now Fedora will stop maintaining and issuing updates for Fedora 18. It is Fedora's policy to close all bug reports from releases that are no longer maintained. At that time this bug will be closed as WONTFIX if it remains open with a Fedora 'version' of '18'. Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, simply change the 'version' to a later Fedora version prior to Fedora 18's end of life. Thank you for reporting this issue and we are sorry that we may not be able to fix it before Fedora 18 is end of life. If you would still like to see this bug fixed and are able to reproduce it against a later version of Fedora, you are encouraged change the 'version' to a later Fedora version prior to Fedora 18's end of life. Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Fedora release includes newer upstream software that fixes bugs or makes them obsolete.
Happening on F20 with latest BIOS and Fedora updates (including 3.15.4 kernel).
This bug appears to have been reported against 'rawhide' during the Fedora 23 development cycle. Changing version to '23'. (As we did not run this process for some time, it could affect also pre-Fedora 23 development cycle bugs. We are very sorry. It will help us with cleanup during Fedora 23 End Of Life. Thank you.) More information and reason for this action is here: https://fedoraproject.org/wiki/BugZappers/HouseKeeping/Fedora23
This message is a reminder that Fedora 23 is nearing its end of life. Approximately 4 (four) weeks from now Fedora will stop maintaining and issuing updates for Fedora 23. It is Fedora's policy to close all bug reports from releases that are no longer maintained. At that time this bug will be closed as EOL if it remains open with a Fedora 'version' of '23'. Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, simply change the 'version' to a later Fedora version. Thank you for reporting this issue and we are sorry that we were not able to fix it before Fedora 23 is end of life. If you would still like to see this bug fixed and are able to reproduce it against a later version of Fedora, you are encouraged change the 'version' to a later Fedora version prior this bug is closed as described in the policy above. Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Fedora release includes newer upstream software that fixes bugs or makes them obsolete.
Fedora 23 changed to end-of-life (EOL) status on 2016-12-20. Fedora 23 is no longer maintained, which means that it will not receive any further security or bug fix updates. As a result we are closing this bug. If you can reproduce this bug against a currently maintained version of Fedora please feel free to reopen this bug against that version. If you are unable to reopen this bug, please file a new report against the current release. If you experience problems, please add a comment to this bug. Thank you for reporting this bug and we are sorry it could not be fixed.