Bug 741500

Summary: [REDWOOD] Xorg stuck in infinite loop
Product: [Fedora] Fedora Reporter: Rik van Riel <riel>
Component: xorg-x11-serverAssignee: Jérôme Glisse <jglisse>
Status: CLOSED WONTFIX QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 15CC: jglisse, mcepl, xgl-maint
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: [cat:lockup]
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-08-07 16:32:20 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
Xorg.0.log of the session that crashed none

Description Rik van Riel 2011-09-27 04:12:29 UTC
Created attachment 525024 [details]
Xorg.0.log of the session that crashed

Every once in a while Xorg gets stuck in an infinite loop. Killing it with -SEGV makes it exit and restart.

Version-Release number of selected component (if applicable):

xorg-x11-drv-ati-6.14.1-2.20110525gitfe5c42f51.fc15.x86_64
kernel-3.1.0-0.rc7.git0.2.compact.fc17.x86_64
mesa-dri-drivers-7.11-1.fc15.x86_64

How reproducible:

Not sure.  I have a Radeon 5570 with 3 monitors attached.

Additional info:

Xorg.0.log attached.  Here are some (possibly related) lines from my dmesg, though I do not have the timing correlated so this could be an earlier deadlock from which X recovered after a few seconds:

[33691.948725] radeon 0000:02:00.0: GPU lockup CP stall for more than 10000msec
[33691.948736] ------------[ cut here ]------------
[33691.948770] WARNING: at drivers/gpu/drm/radeon/radeon_fence.c:267 radeon_fence_wait+0x296/0x33d [radeon]()
[33691.948776] Hardware name: Precision WorkStation T3500  
[33691.948780] GPU lockup (waiting for 0x00EFC6D5 last fence id 0x00EFC6D2)
[33691.948782] Modules linked in: tcp_lp bnep bluetooth rfkill fuse tun ip6table_filter ip6_tables ebtable_nat ebtables ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack xt_CHECKSUM iptable_mangle netconsole configfs nfsd lockd nfs_acl auth_rpcgss sunrpc bridge stp llc max6650 coretemp snd_hda_codec_hdmi snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_ens1371 gameport snd_ac97_codec snd_usb_audio ppdev tg3 ac97_bus snd_hwdep snd_usbmidi_lib iTCO_wdt snd_seq i7core_edac edac_core virtio_net iTCO_vendor_support snd_pcm kvm_intel parport_pc snd_timer snd_rawmidi kvm i2c_i801 pl2303 parport microcode dcdbas dell_wmi snd_seq_device snd soundcore snd_page_alloc sparse_keymap serio_raw wmi radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core [last unloaded: scsi_wait_scan]
[33691.948825] Pid: 9645, comm: Xorg Tainted: G        W   3.1.0-0.rc7.git0.2.compact.fc17.x86_64 #1
[33691.948827] Call Trace:
[33691.948834]  [<ffffffff81057afa>] warn_slowpath_common+0x83/0x9b
[33691.948837]  [<ffffffff81057bb5>] warn_slowpath_fmt+0x46/0x48
[33691.948857]  [<ffffffffa00d270c>] ? evergreen_gpu_is_lockup+0xbb/0xc3 [radeon]
[33691.948875]  [<ffffffffa0098e01>] radeon_fence_wait+0x296/0x33d [radeon]
[33691.948881]  [<ffffffff8107348e>] ? remove_wait_queue+0x3a/0x3a
[33691.948897]  [<ffffffffa00993f5>] radeon_sync_obj_wait+0x11/0x13 [radeon]
[33691.948904]  [<ffffffffa00605fe>] ttm_bo_wait+0xbf/0x17a [ttm]
[33691.948910]  [<ffffffffa0061031>] ? ttm_bo_list_ref_sub+0x29/0x2b [ttm]
[33691.948928]  [<ffffffffa00a9e47>] radeon_bo_wait+0x7b/0x9f [radeon]
[33691.948947]  [<ffffffffa00aa3a9>] radeon_gem_wait_idle_ioctl+0x3d/0x70 [radeon]
[33691.948957]  [<ffffffffa001577c>] drm_ioctl+0x29e/0x37b [drm]
[33691.948976]  [<ffffffffa00aa36c>] ? radeon_gem_busy_ioctl+0x86/0x86 [radeon]
[33691.948980]  [<ffffffff811de9c4>] ? inode_has_perm+0x32/0x34
[33691.948982]  [<ffffffff811dea6d>] ? file_has_perm+0xa7/0xc9
[33691.948986]  [<ffffffff81136ff3>] do_vfs_ioctl+0x452/0x493
[33691.948989]  [<ffffffff810a3a8d>] ? audit_syscall_exit+0x12e/0x14a
[33691.948992]  [<ffffffff8113708a>] sys_ioctl+0x56/0x7c
[33691.948996]  [<ffffffff814bc2f5>] ? int_check_syscall_exit_work+0x34/0x3d
[33691.948999]  [<ffffffff814bc042>] system_call_fastpath+0x16/0x1b
[33691.949001] ---[ end trace 115b516a41aa4c16 ]---
[33691.950120] radeon 0000:02:00.0: GPU softreset 
[33691.950129] radeon 0000:02:00.0:   GRBM_STATUS=0xA0003828
[33691.950136] radeon 0000:02:00.0:   GRBM_STATUS_SE0=0x00000007
[33691.950142] radeon 0000:02:00.0:   GRBM_STATUS_SE1=0x00000007
[33691.950148] radeon 0000:02:00.0:   SRBM_STATUS=0x20000AC0
[33691.950173] radeon 0000:02:00.0:   GRBM_SOFT_RESET=0x00007F6B
[33691.950281] radeon 0000:02:00.0:   GRBM_STATUS=0x00003828
[33691.950288] radeon 0000:02:00.0:   GRBM_STATUS_SE0=0x00000007
[33691.950294] radeon 0000:02:00.0:   GRBM_STATUS_SE1=0x00000007
[33691.950300] radeon 0000:02:00.0:   SRBM_STATUS=0x200000C0
[33691.951311] radeon 0000:02:00.0: GPU reset succeed
[33692.006375] radeon 0000:02:00.0: WB enabled
[33692.023019] [drm] ring test succeeded in 1 usecs
[33692.023039] [drm] ib test succeeded in 1 usecs

Comment 1 Matěj Cepl 2011-09-27 20:28:23 UTC
Created attachment 525200 [details]
Xorg.0.log from the archive

Comment 2 Matěj Cepl 2011-09-27 20:28:30 UTC
Created attachment 525201 [details]
Xorg.0.log.old from the archive

Comment 3 Matěj Cepl 2011-09-27 20:28:37 UTC
Created attachment 525202 [details]
Xorg.1.log from the archive

Comment 4 Matěj Cepl 2011-09-27 20:28:45 UTC
Created attachment 525203 [details]
Xorg.1.log.old from the archive

Comment 5 Matěj Cepl 2011-09-27 20:31:16 UTC
sorry, those attachments should go to different bug

Comment 6 Matěj Cepl 2011-09-27 20:46:48 UTC
[ 33866.976] 0: /usr/bin/Xorg (xorg_backtrace+0x2f) [0x45e84f]
[ 33866.976] 1: /usr/bin/Xorg (mieqEnqueue+0x1e9) [0x458dc9]
[ 33866.976] 2: /usr/bin/Xorg (xf86PostMotionEventM+0xa3) [0x47e763]
[ 33866.976] 3: /usr/bin/Xorg (xf86PostMotionEventP+0x52) [0x47e882]
[ 33866.976] 4: /usr/lib64/xorg/modules/input/evdev_drv.so (0x7f120f1f2000+0x4b1b) [0x7f120f1f6b1b]
[ 33866.976] 5: /usr/bin/Xorg (0x400000+0x6c148) [0x46c148]
[ 33866.976] 6: /usr/bin/Xorg (0x400000+0x11a130) [0x51a130]
[ 33866.976] 7: /lib64/libpthread.so.0 (0x334c200000+0xeef0) [0x334c20eef0]
[ 33866.976] 8: /lib64/libc.so.6 (ioctl+0x7) [0x334bad8af7]
[ 33866.976] 9: /usr/lib64/libdrm.so.2 (drmIoctl+0x28) [0x335de03358]
[ 33866.976] 10: /usr/lib64/libdrm.so.2 (drmCommandWriteRead+0x1c) [0x335de054bc]
[ 33866.976] 11: /usr/lib64/libdrm_radeon.so.1 (0x7f1211208000+0x1be9) [0x7f1211209be9]
[ 33866.976] 12: /usr/lib64/libdrm_radeon.so.1 (0x7f1211208000+0x1e14) [0x7f1211209e14]
[ 33866.976] 13: /usr/lib64/xorg/modules/drivers/radeon_drv.so (0x7f121140e000+0xc3554) [0x7f12114d1554]
[ 33866.976] 14: /usr/lib64/xorg/modules/libexa.so (0x7f1210ff0000+0x616e) [0x7f1210ff616e]
[ 33866.976] 15: /usr/lib64/xorg/modules/libexa.so (0x7f1210ff0000+0x8ba9) [0x7f1210ff8ba9]
[ 33866.976] 16: /usr/lib64/xorg/modules/libexa.so (0x7f1210ff0000+0x11df2) [0x7f1211001df2]
[ 33866.976] 17: /usr/lib64/xorg/modules/libexa.so (0x7f1210ff0000+0x9826) [0x7f1210ff9826]
[ 33866.976] 18: /usr/bin/Xorg (0x400000+0xdc4a9) [0x4dc4a9]
[ 33866.976] 19: /usr/bin/Xorg (0x400000+0x2babd) [0x42babd]
[ 33866.976] 20: /usr/bin/Xorg (0x400000+0x2edb1) [0x42edb1]
[ 33866.976] 21: /usr/bin/Xorg (0x400000+0x22e1a) [0x422e1a]
[ 33866.977] 22: /lib64/libc.so.6 (__libc_start_main+0xed) [0x334ba2139d]
[ 33866.977] 23: /usr/bin/Xorg (0x400000+0x23101) [0x423101]

Comment 7 Matěj Cepl 2011-09-27 21:03:09 UTC
I would need EVR of libdrm and xorg-x11-drv-evdev as well, please.

Thank you

Comment 8 Rik van Riel 2011-09-27 23:56:32 UTC
libdrm-2.4.26-1.fc15.x86_64
xorg-x11-drv-evdev-2.6.0-4.fc15.x86_64

Also, today Dave Airlie gave me a newer version of mesa to test things with. This new version does not appear to change things though:

mesa-dri-drivers-7.12-0.1.fc15.x86_64

Comment 9 Matěj Cepl 2011-10-01 22:45:05 UTC
Frame 1: /usr/bin/Xorg (mieqEnqueue+0x1e9) [0x458dc9]
	/usr/src/debug/xorg-server-1.10.4/mi/mieq.c:179
	mieqEnqueue
Frame 2: /usr/bin/Xorg (xf86PostMotionEventM+0xa3) [0x47e763]
	/usr/src/debug/xorg-server-1.10.4/hw/xfree86/common/xf86Xinput.c:1057
	xf86PostMotionEventM
Frame 3: /usr/bin/Xorg (xf86PostMotionEventP+0x52) [0x47e882]
	/usr/src/debug/xorg-server-1.10.4/hw/xfree86/common/xf86Xinput.c:1009
	xf86PostMotionEventP
Frame 5: /usr/bin/Xorg (0x400000+0x6c148) [0x46c148]
	/usr/src/debug/xorg-server-1.10.4/hw/xfree86/common/xf86Events.c:300
	xf86SigioReadInput
Frame 6: /usr/bin/Xorg (0x400000+0x11a130) [0x51a130]
	/usr/src/debug/xorg-server-1.10.4/hw/xfree86/os-support/linux/../shared/sigio.c:106
	xf86SIGIO
Frame 18: /usr/bin/Xorg (0x400000+0xdc4a9) [0x4dc4a9]
	/usr/src/debug/xorg-server-1.10.4/miext/damage/damage.c:880
	damagePutImage
Frame 19: /usr/bin/Xorg (0x400000+0x2babd) [0x42babd]
	/usr/src/debug/xorg-server-1.10.4/dix/dispatch.c:1990
	ProcPutImage
Frame 20: /usr/bin/Xorg (0x400000+0x2edb1) [0x42edb1]
	/usr/src/debug/xorg-server-1.10.4/dix/dispatch.c:431
	Dispatch
Frame 21: /usr/bin/Xorg (0x400000+0x22e1a) [0x422e1a]
	/usr/src/debug/xorg-server-1.10.4/dix/main.c:296
	main

Comment 10 Fedora End Of Life 2012-08-07 16:32:22 UTC
This message is a notice that Fedora 15 is now at end of life. Fedora
has stopped maintaining and issuing updates for Fedora 15. It is
Fedora's policy to close all bug reports from releases that are no
longer maintained. At this time, all open bugs with a Fedora 'version'
of '15' have been closed as WONTFIX.

(Please note: Our normal process is to give advanced warning of this
occurring, but we forgot to do that. A thousand apologies.)

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, feel free to reopen
this bug and simply change the 'version' to a later Fedora version.

Bug Reporter: Thank you for reporting this issue and we are sorry that
we were unable to fix it before Fedora 15 reached end of life. If you
would still like to see this bug fixed and are able to reproduce it
against a later version of Fedora, you are encouraged to click on
"Clone This Bug" (top right of this page) and open it against that
version of Fedora.

Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.

The process we are following is described here:
http://fedoraproject.org/wiki/BugZappers/HouseKeeping