Bug 673740 - RS480:X200M GPU lockup (couldn't schedule IB)
Summary: RS480:X200M GPU lockup (couldn't schedule IB)
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Fedora
Classification: Fedora
Component: xorg-x11-drv-ati
Version: 14
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Jérôme Glisse
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-01-30 10:08 UTC by Joshua Covington
Modified: 2012-08-16 13:31 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2012-08-16 13:31:44 UTC
Type: ---


Attachments (Terms of Use)
messages - Lockup back-traces (122.94 KB, text/plain)
2011-01-30 10:08 UTC, Joshua Covington
no flags Details
Xorg.0.log (44.06 KB, text/plain)
2011-01-30 10:09 UTC, Joshua Covington
no flags Details
installed packages (380 bytes, text/plain)
2011-01-30 10:10 UTC, Joshua Covington
no flags Details
device-id (1.02 KB, text/plain)
2011-01-30 10:10 UTC, Joshua Covington
no flags Details

Description Joshua Covington 2011-01-30 10:08:47 UTC
Created attachment 475997 [details]
messages - Lockup back-traces

Description of problem:

After leaving the desktop idle for some time the xorg-x11-drv-ati locks up and crashes. Restarting the xserver can't help. The log is full of

[drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB

messages.


Version-Release number of selected component (if applicable):

kernel-headers-2.6.35.10-74.fc14.x86_64
kernel-2.6.35.10-74.fc14.x86_64
xorg-x11-drv-ati-6.13.1-0.4.20100705git37b348059.fc14.x86_64
xorg-x11-server-utils-7.4-19.fc14.x86_64
xorg-x11-server-common-1.9.3-4.fc14.x86_64
xorg-x11-server-Xorg-1.9.3-4.fc14.x86_64
libdrm-2.4.22-1.fc14.x86_64
mesa-libGL-7.9-5.fc14.x86_64
mesa-libGLU-7.9-5.fc14.x86_64
mesa-dri-drivers-7.9-5.fc14.x86_64

The IGP is:

01:05.0 VGA compatible controller: ATI Technologies Inc RS482 [Radeon Xpress 200M] (prog-if 00 [VGA controller])
        Subsystem: Acer Incorporated [ALI] Device 010f
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 66 (2000ns min), Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 17
        Region 0: Memory at c8000000 (32-bit, prefetchable) [size=128M]
        Region 1: I/O ports at 9000 [size=256]
        Region 2: Memory at c0100000 (32-bit, non-prefetchable) [size=64K]
        [virtual] Expansion ROM at c0120000 [disabled] [size=128K]
        Capabilities: [50] Power Management version 2
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: radeon
        Kernel modules: radeon, radeonfb



How reproducible:

Leave the desktop idle and wait for the screen to be locked. I can't find any special pattern that can trigger this crash.


Steps to Reproduce:
1. see above
2.
3.
  
Actual results:

The driver locks up and only hardware restart can fix it.


Expected results:
It shouldn't lock up on this old igp.

Additional info:

I've also attached the message file. It shows 3 driver back-traces and the error message. Xorg.0.log is also attached.

Comment 1 Joshua Covington 2011-01-30 10:09:36 UTC
Created attachment 475998 [details]
Xorg.0.log

Comment 2 Joshua Covington 2011-01-30 10:10:13 UTC
Created attachment 475999 [details]
installed packages

Comment 3 Joshua Covington 2011-01-30 10:10:35 UTC
Created attachment 476000 [details]
device-id

Comment 4 Joshua Covington 2011-01-30 10:18:32 UTC
Some back-trace

[ 3663.897081] radeon 0000:01:05.0: GPU lockup CP stall for more than 1020msec
[ 3663.897096] ------------[ cut here ]------------
[ 3663.897233] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]()
[ 3663.897242] Hardware name: YYpire 3050     
[ 3663.897249] GPU lockup (waiting for 0x0003B0BD last fence id 0x0003B0BC)
[ 3663.897255] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
[ 3663.897369] Pid: 1338, comm: X Not tainted 2.6.35.10-74.fc14.x86_64 #1
[ 3663.897376] Call Trace:
[ 3663.897400]  [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d
[ 3663.897413]  [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48
[ 3663.897466]  [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon]
[ 3663.897481]  [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39
[ 3663.897535]  [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon]
[ 3663.897565]  [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm]
[ 3663.897626]  [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon]
[ 3663.897687]  [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon]
[ 3663.897741]  [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm]
[ 3663.897803]  [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon]
[ 3663.897820]  [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f
[ 3663.897835]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3663.897846]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3663.897859]  [<ffffffff81468695>] ? _cond_resched+0xe/0x22
[ 3663.897869]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3663.897879]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3663.897894]  [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81
[ 3663.897905]  [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21
[ 3663.897916]  [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7
[ 3663.897925]  [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b
[ 3663.897938]  [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c
[ 3663.897947]  [<ffffffff81124b85>] sys_ioctl+0x56/0x79
[ 3663.897958]  [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d
[ 3663.897968]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
[ 3663.897977] ---[ end trace cb28df46bbf5a566 ]---
[ 3664.055221] Failed to wait GUI idle while programming pipes. Bad things might happen.
[ 3664.056239] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x80010140
[ 3664.561476] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x80010140
[ 3665.062688] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x00000140
[ 3665.062729] radeon 0000:01:05.0: GPU reset succeed
[ 3665.062735] radeon 0000:01:05.0: GPU reset succeed
[ 3665.073094] [drm] radeon: 2 quad pipes, 1 z pipes initialized.
[ 3665.074413] [drm] radeon: ring at 0x00000000A6000000
[ 3665.074484] [drm] ring test succeeded in 0 usecs
[ 3665.074509] [drm] ib test succeeded in 1 usecs
[ 3666.250335] radeon 0000:01:05.0: GPU lockup CP stall for more than 1020msec
[ 3666.250350] ------------[ cut here ]------------
[ 3666.250475] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]()
[ 3666.250484] Hardware name: YYpire 3050     
[ 3666.250490] GPU lockup (waiting for 0x0003B0C0 last fence id 0x0003B0BF)
[ 3666.250495] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
[ 3666.250606] Pid: 1338, comm: X Tainted: G        W   2.6.35.10-74.fc14.x86_64 #1
[ 3666.250613] Call Trace:
[ 3666.250636]  [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d
[ 3666.250647]  [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48
[ 3666.250701]  [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon]
[ 3666.250716]  [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39
[ 3666.250770]  [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon]
[ 3666.250801]  [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm]
[ 3666.250862]  [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon]
[ 3666.250923]  [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon]
[ 3666.250983]  [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm]
[ 3666.251044]  [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon]
[ 3666.251059]  [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f
[ 3666.251075]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3666.251085]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3666.251097]  [<ffffffff81468695>] ? _cond_resched+0xe/0x22
[ 3666.251106]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3666.251116]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3666.251131]  [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81
[ 3666.251141]  [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21
[ 3666.251153]  [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7
[ 3666.251161]  [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b
[ 3666.251173]  [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c
[ 3666.251182]  [<ffffffff81124b85>] sys_ioctl+0x56/0x79
[ 3666.251193]  [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d
[ 3666.251203]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
[ 3666.251212] ---[ end trace cb28df46bbf5a567 ]---
[ 3666.395890] Failed to wait GUI idle while programming pipes. Bad things might happen.
[ 3666.396899] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x80010140
[ 3666.901982] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x80010140
[ 3667.403060] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x00000140
[ 3667.403079] radeon 0000:01:05.0: GPU reset succeed
[ 3667.403082] radeon 0000:01:05.0: GPU reset succeed
[ 3667.413433] [drm] radeon: 2 quad pipes, 1 z pipes initialized.
[ 3667.413958] [drm] radeon: ring at 0x00000000A6000000
[ 3667.413999] [drm] ring test succeeded in 0 usecs
[ 3667.414027] [drm] ib test succeeded in 1 usecs
[ 3667.549301] radeon 0000:01:05.0: GPU lockup CP stall for more than 2319msec
[ 3667.549308] ------------[ cut here ]------------
[ 3667.549359] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]()
[ 3667.549363] Hardware name: YYpire 3050     
[ 3667.549366] GPU lockup (waiting for 0x0003B0C3 last fence id 0x0003B0C2)
[ 3667.549369] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
[ 3667.549422] Pid: 1338, comm: X Tainted: G        W   2.6.35.10-74.fc14.x86_64 #1
[ 3667.549426] Call Trace:
[ 3667.549436]  [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d
[ 3667.549442]  [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48
[ 3667.549468]  [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon]
[ 3667.549475]  [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39
[ 3667.549501]  [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon]
[ 3667.549515]  [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm]
[ 3667.549545]  [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon]
[ 3667.549575]  [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon]
[ 3667.549599]  [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm]
[ 3667.549630]  [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon]
[ 3667.549638]  [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f
[ 3667.549645]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3667.549650]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3667.549655]  [<ffffffff81468695>] ? _cond_resched+0xe/0x22
[ 3667.549660]  [<ffffffff8103c142>] ? need_resched+0x23/0x2d
[ 3667.549665]  [<ffffffff8103c15a>] ? should_resched+0xe/0x2e
[ 3667.549672]  [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81
[ 3667.549677]  [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21
[ 3667.549682]  [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7
[ 3667.549687]  [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b
[ 3667.549692]  [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c
[ 3667.549697]  [<ffffffff81124b85>] sys_ioctl+0x56/0x79
[ 3667.549702]  [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d
[ 3667.549707]  [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b
[ 3667.549711] ---[ end trace cb28df46bbf5a568 ]---
[ 3667.698200] Failed to wait GUI idle while programming pipes. Bad things might happen.
[ 3667.699209] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x84110140
[ 3668.204374] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x84010140
[ 3668.705441] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x84000140
[ 3668.705460] radeon 0000:01:05.0: failed to reset GPU
[ 3668.705463] radeon 0000:01:05.0: GPU reset failed
[ 3668.705937] [drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(6).
[ 3668.705943] [drm:radeon_cs_ioctl] *ERROR* Faild to schedule IB !
[ 3668.706471] [drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(7).

Comment 5 Jérôme Glisse 2011-01-31 14:34:58 UTC
Sadly this is GPU lockup not much we can do beside hopping that one change at one point will fix it.

Comment 6 Joshua Covington 2011-01-31 19:11:37 UTC
I thought this is a driver issue. Is there anything (besides hopping) that can be done in this case? Should I report it at x.org mailing list?

Comment 7 Jérôme Glisse 2011-01-31 19:20:50 UTC
This is a driver issue but GPU lockup are tedious to debug and pretty much need to have the hw in front a of dev. Even with hw in front of a dev one need a simple test case that trigger the lockup, as waiting X amount of time is no practicable for debugging.

Comment 8 Joshua Covington 2011-01-31 19:28:24 UTC
I see.

I'll try to figure out how to reproduce it and report back if something comes up. Thanks for the fast reply.

Comment 9 Joshua Covington 2011-01-31 19:30:57 UTC
With the back-trace isn't it possible to find what locks the GPU?

Comment 10 Mike Hanafey 2011-08-01 16:52:51 UTC
Very similar problem on Fedora 15 with a HD 6850 card. Easily reproducible by dragging the scrollbar in Firefox up and down. I submitted the following: https://bugs.freedesktop.org/show_bug.cgi?id=39576

Comment 11 Jérôme Glisse 2011-08-01 17:40:49 UTC
When it comes to gpu please follow the rules 1 person = 1 bug. We, developer, will mark bug as duplicate if we, developer, think they are.

Comment 12 Patrick Dubois 2011-08-23 18:45:45 UTC
This appears to affect my card as well.
--
product: RV630 [Radeon HD 2600 Series]
--

Both F14 and F15 x86_64 show symptoms.

Symptoms include no ability to use OpenGL features including my KDE desktop.  The majority of time the machine appears to lock up with <alt>-F4 showing the 'scheduling IB' error.

Also showing up today : 
--
Aug 23 14:41:19 blackcat kernel: [  785.245653] [drm] wait for fifo failed status : 0xE57C34E0 0x00113303
--

Anything I can do to provide further debugging information ?

Comment 13 Jérôme Glisse 2011-08-23 18:53:15 UTC
When it comes to gpu, 1 user -> 1 bug, we (dev) will mark duplicate if we think they are. In this case this definitly isn't.

Comment 14 Fedora End Of Life 2012-08-16 13:31:47 UTC
This message is a notice that Fedora 14 is now at end of life. Fedora 
has stopped maintaining and issuing updates for Fedora 14. It is 
Fedora's policy to close all bug reports from releases that are no 
longer maintained.  At this time, all open bugs with a Fedora 'version'
of '14' have been closed as WONTFIX.

(Please note: Our normal process is to give advanced warning of this 
occurring, but we forgot to do that. A thousand apologies.)

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, feel free to reopen 
this bug and simply change the 'version' to a later Fedora version.

Bug Reporter: Thank you for reporting this issue and we are sorry that 
we were unable to fix it before Fedora 14 reached end of life. If you 
would still like to see this bug fixed and are able to reproduce it 
against a later version of Fedora, you are encouraged to click on 
"Clone This Bug" (top right of this page) and open it against that 
version of Fedora.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events.  Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

The process we are following is described here: 
http://fedoraproject.org/wiki/BugZappers/HouseKeeping


Note You need to log in before you can comment on or make changes to this bug.