| Summary: | RS480:X200M GPU lockup (couldn't schedule IB) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Joshua Covington <joshuacov> | ||||||||||
| Component: | xorg-x11-drv-ati | Assignee: | Jérôme Glisse <jglisse> | ||||||||||
| Status: | CLOSED WONTFIX | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||||||||
| Severity: | high | Docs Contact: | |||||||||||
| Priority: | unspecified | ||||||||||||
| Version: | 14 | CC: | collura, mike.hanafey, pat, xgl-maint | ||||||||||
| Target Milestone: | --- | Keywords: | Triaged | ||||||||||
| Target Release: | --- | ||||||||||||
| Hardware: | x86_64 | ||||||||||||
| OS: | Linux | ||||||||||||
| Whiteboard: | |||||||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||||||
| Doc Text: | Story Points: | --- | |||||||||||
| Clone Of: | Environment: | ||||||||||||
| Last Closed: | 2012-08-16 13:31:44 UTC | Type: | --- | ||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||
| Documentation: | --- | CRM: | |||||||||||
| Verified Versions: | Category: | --- | |||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||
| Attachments: |
|
||||||||||||
Created attachment 475998 [details]
Xorg.0.log
Created attachment 475999 [details]
installed packages
Created attachment 476000 [details]
device-id
Some back-trace [ 3663.897081] radeon 0000:01:05.0: GPU lockup CP stall for more than 1020msec [ 3663.897096] ------------[ cut here ]------------ [ 3663.897233] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]() [ 3663.897242] Hardware name: YYpire 3050 [ 3663.897249] GPU lockup (waiting for 0x0003B0BD last fence id 0x0003B0BC) [ 3663.897255] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan] [ 3663.897369] Pid: 1338, comm: X Not tainted 2.6.35.10-74.fc14.x86_64 #1 [ 3663.897376] Call Trace: [ 3663.897400] [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d [ 3663.897413] [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48 [ 3663.897466] [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon] [ 3663.897481] [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39 [ 3663.897535] [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon] [ 3663.897565] [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm] [ 3663.897626] [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon] [ 3663.897687] [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon] [ 3663.897741] [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm] [ 3663.897803] [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon] [ 3663.897820] [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f [ 3663.897835] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3663.897846] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3663.897859] [<ffffffff81468695>] ? _cond_resched+0xe/0x22 [ 3663.897869] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3663.897879] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3663.897894] [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81 [ 3663.897905] [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21 [ 3663.897916] [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7 [ 3663.897925] [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b [ 3663.897938] [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c [ 3663.897947] [<ffffffff81124b85>] sys_ioctl+0x56/0x79 [ 3663.897958] [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d [ 3663.897968] [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b [ 3663.897977] ---[ end trace cb28df46bbf5a566 ]--- [ 3664.055221] Failed to wait GUI idle while programming pipes. Bad things might happen. [ 3664.056239] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x80010140 [ 3664.561476] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x80010140 [ 3665.062688] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x00000140 [ 3665.062729] radeon 0000:01:05.0: GPU reset succeed [ 3665.062735] radeon 0000:01:05.0: GPU reset succeed [ 3665.073094] [drm] radeon: 2 quad pipes, 1 z pipes initialized. [ 3665.074413] [drm] radeon: ring at 0x00000000A6000000 [ 3665.074484] [drm] ring test succeeded in 0 usecs [ 3665.074509] [drm] ib test succeeded in 1 usecs [ 3666.250335] radeon 0000:01:05.0: GPU lockup CP stall for more than 1020msec [ 3666.250350] ------------[ cut here ]------------ [ 3666.250475] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]() [ 3666.250484] Hardware name: YYpire 3050 [ 3666.250490] GPU lockup (waiting for 0x0003B0C0 last fence id 0x0003B0BF) [ 3666.250495] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan] [ 3666.250606] Pid: 1338, comm: X Tainted: G W 2.6.35.10-74.fc14.x86_64 #1 [ 3666.250613] Call Trace: [ 3666.250636] [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d [ 3666.250647] [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48 [ 3666.250701] [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon] [ 3666.250716] [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39 [ 3666.250770] [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon] [ 3666.250801] [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm] [ 3666.250862] [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon] [ 3666.250923] [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon] [ 3666.250983] [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm] [ 3666.251044] [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon] [ 3666.251059] [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f [ 3666.251075] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3666.251085] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3666.251097] [<ffffffff81468695>] ? _cond_resched+0xe/0x22 [ 3666.251106] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3666.251116] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3666.251131] [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81 [ 3666.251141] [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21 [ 3666.251153] [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7 [ 3666.251161] [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b [ 3666.251173] [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c [ 3666.251182] [<ffffffff81124b85>] sys_ioctl+0x56/0x79 [ 3666.251193] [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d [ 3666.251203] [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b [ 3666.251212] ---[ end trace cb28df46bbf5a567 ]--- [ 3666.395890] Failed to wait GUI idle while programming pipes. Bad things might happen. [ 3666.396899] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x80010140 [ 3666.901982] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x80010140 [ 3667.403060] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x00000140 [ 3667.403079] radeon 0000:01:05.0: GPU reset succeed [ 3667.403082] radeon 0000:01:05.0: GPU reset succeed [ 3667.413433] [drm] radeon: 2 quad pipes, 1 z pipes initialized. [ 3667.413958] [drm] radeon: ring at 0x00000000A6000000 [ 3667.413999] [drm] ring test succeeded in 0 usecs [ 3667.414027] [drm] ib test succeeded in 1 usecs [ 3667.549301] radeon 0000:01:05.0: GPU lockup CP stall for more than 2319msec [ 3667.549308] ------------[ cut here ]------------ [ 3667.549359] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:235 radeon_fence_wait+0x22e/0x2cd [radeon]() [ 3667.549363] Hardware name: YYpire 3050 [ 3667.549366] GPU lockup (waiting for 0x0003B0C3 last fence id 0x0003B0C2) [ 3667.549369] Modules linked in: cryptd aes_x86_64 aes_generic cpufreq_ondemand powernow_k8 freq_table mperf ip6t_REJECT nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 fuse snd_hda_codec_realtek snd_hda_intel arc4 usblp snd_hda_codec snd_hwdep ecb snd_seq ath5k mac80211 snd_seq_device 8139too ath cfg80211 snd_pcm 8139cp rfkill mii snd_timer snd soundcore k8temp edac_core i2c_piix4 shpchp snd_page_alloc edac_mce_amd serio_raw wmi joydev microcode sdhci_pci ata_generic pata_acpi sdhci mmc_core yenta_socket pata_atiixp sata_sil usb_storage radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan] [ 3667.549422] Pid: 1338, comm: X Tainted: G W 2.6.35.10-74.fc14.x86_64 #1 [ 3667.549426] Call Trace: [ 3667.549436] [<ffffffff8104d999>] warn_slowpath_common+0x85/0x9d [ 3667.549442] [<ffffffff8104da54>] warn_slowpath_fmt+0x46/0x48 [ 3667.549468] [<ffffffffa009a8e0>] radeon_fence_wait+0x22e/0x2cd [radeon] [ 3667.549475] [<ffffffff810665cf>] ? autoremove_wake_function+0x0/0x39 [ 3667.549501] [<ffffffffa009b0f0>] radeon_sync_obj_wait+0x11/0x13 [radeon] [ 3667.549515] [<ffffffffa0061839>] ttm_bo_wait+0xa5/0x15c [ttm] [ 3667.549545] [<ffffffffa00a9aaa>] radeon_bo_wait+0xb6/0xd7 [radeon] [ 3667.549575] [<ffffffffa00aa068>] radeon_gem_wait_idle_ioctl+0x40/0x77 [radeon] [ 3667.549599] [<ffffffffa00192b8>] drm_ioctl+0x28b/0x389 [drm] [ 3667.549630] [<ffffffffa00aa028>] ? radeon_gem_wait_idle_ioctl+0x0/0x77 [radeon] [ 3667.549638] [<ffffffff811e3718>] ? inode_has_perm.clone.20+0x79/0x8f [ 3667.549645] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3667.549650] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3667.549655] [<ffffffff81468695>] ? _cond_resched+0xe/0x22 [ 3667.549660] [<ffffffff8103c142>] ? need_resched+0x23/0x2d [ 3667.549665] [<ffffffff8103c15a>] ? should_resched+0xe/0x2e [ 3667.549672] [<ffffffff8105c755>] ? recalc_sigpending+0x7c/0x81 [ 3667.549677] [<ffffffff81469c46>] ? _raw_spin_lock_irq+0x1f/0x21 [ 3667.549682] [<ffffffff8112419b>] vfs_ioctl+0x36/0xa7 [ 3667.549687] [<ffffffff81124afc>] do_vfs_ioctl+0x468/0x49b [ 3667.549692] [<ffffffff81099cd2>] ? audit_syscall_exit+0x130/0x14c [ 3667.549697] [<ffffffff81124b85>] sys_ioctl+0x56/0x79 [ 3667.549702] [<ffffffff81009fa5>] ? int_check_syscall_exit_work+0x34/0x3d [ 3667.549707] [<ffffffff81009cf2>] system_call_fastpath+0x16/0x1b [ 3667.549711] ---[ end trace cb28df46bbf5a568 ]--- [ 3667.698200] Failed to wait GUI idle while programming pipes. Bad things might happen. [ 3667.699209] radeon 0000:01:05.0: (r300_asic_reset:415) RBBM_STATUS=0x84110140 [ 3668.204374] radeon 0000:01:05.0: (r300_asic_reset:434) RBBM_STATUS=0x84010140 [ 3668.705441] radeon 0000:01:05.0: (r300_asic_reset:446) RBBM_STATUS=0x84000140 [ 3668.705460] radeon 0000:01:05.0: failed to reset GPU [ 3668.705463] radeon 0000:01:05.0: GPU reset failed [ 3668.705937] [drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(6). [ 3668.705943] [drm:radeon_cs_ioctl] *ERROR* Faild to schedule IB ! [ 3668.706471] [drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(7). Sadly this is GPU lockup not much we can do beside hopping that one change at one point will fix it. I thought this is a driver issue. Is there anything (besides hopping) that can be done in this case? Should I report it at x.org mailing list? This is a driver issue but GPU lockup are tedious to debug and pretty much need to have the hw in front a of dev. Even with hw in front of a dev one need a simple test case that trigger the lockup, as waiting X amount of time is no practicable for debugging. I see. I'll try to figure out how to reproduce it and report back if something comes up. Thanks for the fast reply. With the back-trace isn't it possible to find what locks the GPU? Very similar problem on Fedora 15 with a HD 6850 card. Easily reproducible by dragging the scrollbar in Firefox up and down. I submitted the following: https://bugs.freedesktop.org/show_bug.cgi?id=39576 When it comes to gpu please follow the rules 1 person = 1 bug. We, developer, will mark bug as duplicate if we, developer, think they are. This appears to affect my card as well. -- product: RV630 [Radeon HD 2600 Series] -- Both F14 and F15 x86_64 show symptoms. Symptoms include no ability to use OpenGL features including my KDE desktop. The majority of time the machine appears to lock up with <alt>-F4 showing the 'scheduling IB' error. Also showing up today : -- Aug 23 14:41:19 blackcat kernel: [ 785.245653] [drm] wait for fifo failed status : 0xE57C34E0 0x00113303 -- Anything I can do to provide further debugging information ? When it comes to gpu, 1 user -> 1 bug, we (dev) will mark duplicate if we think they are. In this case this definitly isn't. This message is a notice that Fedora 14 is now at end of life. Fedora has stopped maintaining and issuing updates for Fedora 14. It is Fedora's policy to close all bug reports from releases that are no longer maintained. At this time, all open bugs with a Fedora 'version' of '14' have been closed as WONTFIX. (Please note: Our normal process is to give advanced warning of this occurring, but we forgot to do that. A thousand apologies.) Package Maintainer: If you wish for this bug to remain open because you plan to fix it in a currently maintained version, feel free to reopen this bug and simply change the 'version' to a later Fedora version. Bug Reporter: Thank you for reporting this issue and we are sorry that we were unable to fix it before Fedora 14 reached end of life. If you would still like to see this bug fixed and are able to reproduce it against a later version of Fedora, you are encouraged to click on "Clone This Bug" (top right of this page) and open it against that version of Fedora. Although we aim to fix as many bugs as possible during every release's lifetime, sometimes those efforts are overtaken by events. Often a more recent Fedora release includes newer upstream software that fixes bugs or makes them obsolete. The process we are following is described here: http://fedoraproject.org/wiki/BugZappers/HouseKeeping |
Created attachment 475997 [details] messages - Lockup back-traces Description of problem: After leaving the desktop idle for some time the xorg-x11-drv-ati locks up and crashes. Restarting the xserver can't help. The log is full of [drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB messages. Version-Release number of selected component (if applicable): kernel-headers-2.6.35.10-74.fc14.x86_64 kernel-2.6.35.10-74.fc14.x86_64 xorg-x11-drv-ati-6.13.1-0.4.20100705git37b348059.fc14.x86_64 xorg-x11-server-utils-7.4-19.fc14.x86_64 xorg-x11-server-common-1.9.3-4.fc14.x86_64 xorg-x11-server-Xorg-1.9.3-4.fc14.x86_64 libdrm-2.4.22-1.fc14.x86_64 mesa-libGL-7.9-5.fc14.x86_64 mesa-libGLU-7.9-5.fc14.x86_64 mesa-dri-drivers-7.9-5.fc14.x86_64 The IGP is: 01:05.0 VGA compatible controller: ATI Technologies Inc RS482 [Radeon Xpress 200M] (prog-if 00 [VGA controller]) Subsystem: Acer Incorporated [ALI] Device 010f Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 66 (2000ns min), Cache Line Size: 32 bytes Interrupt: pin A routed to IRQ 17 Region 0: Memory at c8000000 (32-bit, prefetchable) [size=128M] Region 1: I/O ports at 9000 [size=256] Region 2: Memory at c0100000 (32-bit, non-prefetchable) [size=64K] [virtual] Expansion ROM at c0120000 [disabled] [size=128K] Capabilities: [50] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: radeon Kernel modules: radeon, radeonfb How reproducible: Leave the desktop idle and wait for the screen to be locked. I can't find any special pattern that can trigger this crash. Steps to Reproduce: 1. see above 2. 3. Actual results: The driver locks up and only hardware restart can fix it. Expected results: It shouldn't lock up on this old igp. Additional info: I've also attached the message file. It shows 3 driver back-traces and the error message. Xorg.0.log is also attached.