Bug 669382

Summary: KMS: RV610 X crashed with GPU lockup
Product: [Fedora] Fedora Reporter: Tom Horsley <horsley1953>
Component: xorg-x11-drv-atiAssignee: Jérôme Glisse <jglisse>
Status: CLOSED WONTFIX QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: low    
Version: 14CC: xgl-maint
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-08-16 18:26:32 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
Xorg.0.log.old from just before the crash none

Description Tom Horsley 2011-01-13 14:32:39 UTC
Description of problem:

I doing nothing more exotic than reading a posting on a message board in
firefox, when suddenly my screen flickered off, came back, then a minute
later flickered off and stayed off, the monitor saying it was getting no signal.

I found this at the end of /var/log/messages:

Jan 13 09:06:07 tomh kernel: [178989.816035] radeon 0000:01:00.0: GPU lockup CP stall for more than 1000msec
Jan 13 09:06:07 tomh kernel: [178989.816040] ------------[ cut here ]------------
Jan 13 09:06:07 tomh kernel: [178989.816082] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]()
Jan 13 09:06:07 tomh kernel: [178989.816100] Hardware name: OptiPlex 755                 
Jan 13 09:06:07 tomh kernel: [178989.816104] GPU lockup (waiting for 0x001B19D3 last fence id 0x001B19C0)
Jan 13 09:06:07 tomh kernel: [178989.816108] Modules linked in: nfs lockd fscache nfs_acl auth_rpcgss sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ipv6 uinput snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer ppdev snd parport_pc parport serio_raw microcode e1000e iTCO_wdt iTCO_vendor_support i2c_i801 soundcore joydev dcdbas snd_page_alloc usb_storage pata_acpi ata_generic radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
Jan 13 09:06:07 tomh kernel: [178989.816181] Pid: 30722, comm: X Not tainted 2.6.35.10-74.fc14.x86_64 #1
Jan 13 09:06:07 tomh kernel: [178989.816185] Call Trace:
Jan 13 09:06:07 tomh kernel: [178989.816197]  [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d
Jan 13 09:06:07 tomh kernel: [178989.816205]  [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48
Jan 13 09:06:07 tomh kernel: [178989.816235]  [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon]
Jan 13 09:06:07 tomh kernel: [178989.816244]  [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39
Jan 13 09:06:07 tomh kernel: [178989.816252]  [<ffffffff81468695>] ? _cond_resched+0xe/0x22
Jan 13 09:06:07 tomh kernel: [178989.816289]  [<ffffffffa00aa61a>] radeon_ib_get+0x141/0x221 [radeon]
Jan 13 09:06:07 tomh kernel: [178989.816325]  [<ffffffffa00abad8>] radeon_cs_ioctl+0x96/0x1a6 [radeon]
Jan 13 09:06:07 tomh kernel: [178989.816335]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
Jan 13 09:06:07 tomh kernel: [178989.816353]  [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm]
Jan 13 09:06:07 tomh kernel: [178989.816389]  [<ffffffffa00aba42>] ? radeon_cs_ioctl+0x0/0x1a6 [radeon]
Jan 13 09:06:07 tomh kernel: [178989.816398]  [<ffffffff81116cfe>] ? do_sync_read+0xcb/0x108
Jan 13 09:06:07 tomh kernel: [178989.816405]  [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7
Jan 13 09:06:07 tomh kernel: [178989.816411]  [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b
Jan 13 09:06:07 tomh kernel: [178989.816420]  [<ffffffff811dc621>] ? security_file_permission+0x16/0x18
Jan 13 09:06:07 tomh kernel: [178989.816427]  [<ffffffff81124b85>] sys_ioctl+0x56/0x79
Jan 13 09:06:07 tomh kernel: [178989.816433]  [<ffffffff81117482>] ? sys_read+0x61/0x6e
Jan 13 09:06:07 tomh kernel: [178989.816440]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
Jan 13 09:06:07 tomh kernel: [178989.816446] ---[ end trace a53d6875cd136db9 ]---
Jan 13 09:06:07 tomh kernel: [178989.816456] [drm] Disabling audio support
Jan 13 09:06:07 tomh kernel: [178989.817566] radeon 0000:01:00.0: GPU softreset 
Jan 13 09:06:07 tomh kernel: [178989.817570] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
Jan 13 09:06:07 tomh kernel: [178989.817575] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
Jan 13 09:06:07 tomh kernel: [178989.817579] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200010C0
Jan 13 09:06:07 tomh kernel: [178989.817590] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
Jan 13 09:06:07 tomh kernel: [178989.832600] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
Jan 13 09:06:07 tomh kernel: [178989.848609] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0x00003030
Jan 13 09:06:07 tomh kernel: [178989.848613] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
Jan 13 09:06:07 tomh kernel: [178989.848618] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
Jan 13 09:06:07 tomh kernel: [178989.849623] radeon 0000:01:00.0: GPU reset succeed
Jan 13 09:06:07 tomh kernel: [178989.852288] [drm] Clocks initialized !
Jan 13 09:06:07 tomh kernel: [178989.886171] [drm] ring test succeeded in 1 usecs
Jan 13 09:06:07 tomh kernel: [178989.886182] [drm] ib test succeeded in 1 usecs
Jan 13 09:09:16 tomh kernel: [178989.886187] [drm] Enabling audio support
Jan 13 09:09:16 tomh kernel: [179179.173036] radeon 0000:01:00.0: GPU lockup CP stall for more than 1000msec
Jan 13 09:09:16 tomh kernel: [179179.173042] ------------[ cut here ]------------
Jan 13 09:09:16 tomh kernel: [179179.173086] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]()
Jan 13 09:09:16 tomh kernel: [179179.173105] Hardware name: OptiPlex 755                 
Jan 13 09:09:16 tomh kernel: [179179.173109] GPU lockup (waiting for 0x001B785C last fence id 0x001B785A)
Jan 13 09:09:16 tomh kernel: [179179.173113] Modules linked in: nfs lockd fscache nfs_acl auth_rpcgss sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf ipv6 uinput snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer ppdev snd parport_pc parport serio_raw microcode e1000e iTCO_wdt iTCO_vendor_support i2c_i801 soundcore joydev dcdbas snd_page_alloc usb_storage pata_acpi ata_generic radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
Jan 13 09:09:16 tomh kernel: [179179.173185] Pid: 30722, comm: X Tainted: G        W   2.6.35.10-74.fc14.x86_64 #1
Jan 13 09:09:16 tomh kernel: [179179.173190] Call Trace:
Jan 13 09:09:16 tomh kernel: [179179.173202]  [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d
Jan 13 09:09:16 tomh kernel: [179179.173209]  [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48
Jan 13 09:09:16 tomh kernel: [179179.173240]  [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173249]  [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39
Jan 13 09:09:16 tomh kernel: [179179.173280]  [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173292]  [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173301]  [<ffffffff81469d93>] ? _raw_write_unlock_irqrestore+0x1a/0x1d
Jan 13 09:09:16 tomh kernel: [179179.173314]  [<ffffffffa00641f7>] ttm_bo_move_accel_cleanup+0x8d/0x2aa [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173351]  [<ffffffffa00c0e01>] ? r600_copy_blit+0x30/0xaa [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173382]  [<ffffffffa009b22b>] radeon_move_blit+0x129/0x147 [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173412]  [<ffffffffa009b6d5>] radeon_bo_move+0x102/0x12a [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173425]  [<ffffffffa0062a90>] ttm_bo_handle_move_mem+0x171/0x267 [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173437]  [<ffffffffa0063043>] ttm_bo_evict+0x2d0/0x342 [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173458]  [<ffffffffa00200a7>] ? drm_class_suspend+0x2f/0x42 [drm]
Jan 13 09:09:16 tomh kernel: [179179.173466]  [<ffffffff81469bf6>] ? _raw_spin_lock+0xe/0x10
Jan 13 09:09:16 tomh kernel: [179179.173478]  [<ffffffffa00631bb>] ttm_mem_evict_first+0x106/0x134 [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173490]  [<ffffffffa0063980>] ttm_bo_mem_space+0x39f/0x42f [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173504]  [<ffffffffa0063ac2>] ttm_bo_move_buffer+0xb2/0x123 [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173523]  [<ffffffffa00200a7>] ? drm_class_suspend+0x2f/0x42 [drm]
Jan 13 09:09:16 tomh kernel: [179179.173535]  [<ffffffffa0063bec>] ttm_bo_validate+0xb9/0xf6 [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173547]  [<ffffffffa0063efb>] ttm_bo_init+0x2d2/0x30b [ttm]
Jan 13 09:09:16 tomh kernel: [179179.173578]  [<ffffffffa009c068>] radeon_bo_create+0x172/0x23b [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173608]  [<ffffffffa009bd92>] ? radeon_ttm_bo_destroy+0x0/0x69 [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173645]  [<ffffffffa00a9c17>] radeon_gem_object_create+0x7d/0xda [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173682]  [<ffffffffa00a9eac>] radeon_gem_create_ioctl+0x4d/0xab [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173690]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
Jan 13 09:09:16 tomh kernel: [179179.173697]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
Jan 13 09:09:16 tomh kernel: [179179.173714]  [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm]
Jan 13 09:09:16 tomh kernel: [179179.173721]  [<ffffffff81125e23>] ? core_sys_select+0x1ce/0x213
Jan 13 09:09:16 tomh kernel: [179179.173757]  [<ffffffffa00a9e5f>] ? radeon_gem_create_ioctl+0x0/0xab [radeon]
Jan 13 09:09:16 tomh kernel: [179179.173766]  [<ffffffff81010207>] ? paravirt_read_tsc+0x9/0xd
Jan 13 09:09:16 tomh kernel: [179179.173773]  [<ffffffff81469c5f>] ? _raw_spin_unlock_irqrestore+0x17/0x19
Jan 13 09:09:16 tomh kernel: [179179.173780]  [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7
Jan 13 09:09:16 tomh kernel: [179179.173786]  [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b
Jan 13 09:09:16 tomh kernel: [179179.173794]  [<ffffffff81052209>] ? do_setitimer+0x19a/0x1c4
Jan 13 09:09:16 tomh kernel: [179179.173800]  [<ffffffff81124b85>] sys_ioctl+0x56/0x79
Jan 13 09:09:16 tomh kernel: [179179.173806]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
Jan 13 09:09:16 tomh kernel: [179179.173812] ---[ end trace a53d6875cd136dba ]---
Jan 13 09:09:16 tomh kernel: [179179.173822] [drm] Disabling audio support
Jan 13 09:09:16 tomh kernel: [179179.174836] radeon 0000:01:00.0: ffff880037874800 unpin not necessary
Jan 13 09:09:16 tomh kernel: [179179.174910] radeon 0000:01:00.0: GPU softreset 
Jan 13 09:09:16 tomh kernel: [179179.174913] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
Jan 13 09:09:16 tomh kernel: [179179.174916] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
Jan 13 09:09:16 tomh kernel: [179179.174919] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
Jan 13 09:09:16 tomh kernel: [179179.174927] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
Jan 13 09:09:16 tomh kernel: [179179.189933] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
Jan 13 09:09:16 tomh kernel: [179179.205942] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0x00003030
Jan 13 09:09:16 tomh kernel: [179179.205947] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
Jan 13 09:09:16 tomh kernel: [179179.205951] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
Jan 13 09:09:16 tomh kernel: [179179.206956] radeon 0000:01:00.0: GPU reset succeed
Jan 13 09:09:16 tomh kernel: [179179.209486] [drm] Clocks initialized !
Jan 13 09:09:16 tomh kernel: [179179.243435] [drm] ring test succeeded in 1 usecs
Jan 13 09:09:16 tomh kernel: [179179.243450] [drm] ib test succeeded in 1 usecs
Jan 13 09:10:16 tomh init: tty (/dev/tty2) main process (1878) killed by TERM signal
Jan 13 09:10:16 tomh init: tty (/dev/tty3) main process (1881) killed by TERM signal
Jan 13 09:10:16 tomh init: tty (/dev/tty4) main process (1883) killed by TERM signal
Jan 13 09:10:16 tomh init: tty (/dev/tty5) main process (1885) killed by TERM signal
Jan 13 09:10:16 tomh init: tty (/dev/tty6) main process (1887) killed by TERM signal
Jan 13 09:10:16 tomh init: system-setup-keyboard main process (851) killed by TERM signal
Jan 13 09:10:21 tomh init: prefdm main process (1874) killed by KILL signal

The video card is:

01:00.0 VGA compatible controller: ATI Technologies Inc RV610 video device [Radeon HD 2400 PRO]

I have no xorg.conf file, I'm using the server defaults.

Version-Release number of selected component (if applicable):
xorg-x11-drv-ati-6.13.1-0.3.20100705git37b348059.fc14.x86_64
mesa-libGL-7.9-5.fc14.i686
mesa-dri-drivers-7.9-5.fc14.x86_64
mesa-libGL-7.9-5.fc14.x86_64
mesa-libGLU-devel-7.9-5.fc14.x86_64
mesa-libGL-devel-7.9-5.fc14.x86_64
mesa-libGLU-7.9-5.fc14.i686
mesa-libGLU-7.9-5.fc14.x86_64
mesa-dri-drivers-7.9-5.fc14.i686

How reproducible:

Been running f14 since it was released, and this is the first time I've had
a problem on this machine (which I use all day at work every day).

Steps to Reproduce:
1. use machine, suddenly it happens
  
Actual results:
crash

Expected results:
not crash

Additional info:
I'll attach the xorg log (there was nothing in it about the crash though).

Comment 1 Tom Horsley 2011-01-13 14:33:34 UTC
Created attachment 473336 [details]
Xorg.0.log.old from just before the crash

Comment 2 Fedora End Of Life 2012-08-16 18:26:35 UTC
This message is a notice that Fedora 14 is now at end of life. Fedora 
has stopped maintaining and issuing updates for Fedora 14. It is 
Fedora's policy to close all bug reports from releases that are no 
longer maintained.  At this time, all open bugs with a Fedora 'version'
of '14' have been closed as WONTFIX.

(Please note: Our normal process is to give advanced warning of this 
occurring, but we forgot to do that. A thousand apologies.)

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, feel free to reopen 
this bug and simply change the 'version' to a later Fedora version.

Bug Reporter: Thank you for reporting this issue and we are sorry that 
we were unable to fix it before Fedora 14 reached end of life. If you 
would still like to see this bug fixed and are able to reproduce it 
against a later version of Fedora, you are encouraged to click on 
"Clone This Bug" (top right of this page) and open it against that 
version of Fedora.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events.  Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

The process we are following is described here: 
http://fedoraproject.org/wiki/BugZappers/HouseKeeping