Bug 690865
| Summary: | kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4238! | |||
|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 6 | Reporter: | Mike Khusid <mkhusid> | |
| Component: | kernel | Assignee: | Dave Airlie <airlied> | |
| Status: | CLOSED ERRATA | QA Contact: | Red Hat Kernel QE team <kernel-qe> | |
| Severity: | high | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | 6.1 | CC: | airlied, borgan, ddumas, jofernan, snagar | |
| Target Milestone: | rc | Keywords: | Regression | |
| Target Release: | --- | |||
| Hardware: | Unspecified | |||
| OS: | Linux | |||
| Whiteboard: | ||||
| Fixed In Version: | kernel-2.6.32-131.0.1.el6 | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 784901 (view as bug list) | Environment: | ||
| Last Closed: | 2011-05-19 12:00:11 UTC | Type: | --- | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 609355, 784901 | |||
Another kernel crash on my computer and it looks similar. I believe this is regression from 6.0 since I have not observed the same crashes with earlier kernels.
SYSTEM MAP: /boot/System.map-2.6.32-122.el6.x86_64
DEBUG KERNEL: /usr/lib/debug/lib/modules/2.6.32-122.el6.x86_64/vmlinux (2.6.32-122.el6.x86_64)
DUMPFILE: vmcore
CPUS: 4
DATE: Tue Mar 29 13:01:41 2011
UPTIME: 13:32:07
LOAD AVERAGE: 2.00, 1.71, 1.36
TASKS: 374
NODENAME: mkhusid
RELEASE: 2.6.32-122.el6.x86_64
VERSION: #1 SMP Wed Mar 9 23:54:34 EST 2011
MACHINE: x86_64 (2526 Mhz)
MEMORY: 3.8 GB
PANIC: "kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4190!"
PID: 155
COMMAND: "kslowd001"
TASK: ffff880132da40c0 [THREAD_INFO: ffff88012df3e000]
CPU: 3
STATE: TASK_RUNNING (PANIC)
crash> bt -l
bt: line numbers are not available
PID: 155 TASK: ffff880132da40c0 CPU: 3 COMMAND: "kslowd001"
#0 [ffff88012df3f510] machine_kexec at ffffffff81030c0b
#1 [ffff88012df3f570] crash_kexec at ffffffff810b6142
#2 [ffff88012df3f640] oops_end at ffffffff814dcdc0
#3 [ffff88012df3f670] die at ffffffff8100f2fb
#4 [ffff88012df3f6a0] do_trap at ffffffff814dc6b4
#5 [ffff88012df3f700] do_invalid_op at ffffffff8100ceb5
#6 [ffff88012df3f7a0] invalid_op at ffffffff8100bf5b
[exception RIP: i915_gem_object_pin+424]
RIP: ffffffffa00b9bb8 RSP: ffff88012df3f850 RFLAGS: 00010246
RAX: 000000000003c47e RBX: ffff88012eeaa200 RCX: 0000000000000000
RDX: 000000000003c000 RSI: 0000000000001000 RDI: ffff88012eeaa200
RBP: ffff88012df3f870 R8: ffff88012df3e000 R9: 00000000ffffffff
R10: 0000000000000000 R11: 0000000000000000 R12: ffff88012e610000
R13: 0000000000001000 R14: 0000000000000000 R15: 0000000000000000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#7 [ffff88012df3f878] intel_pin_and_fence_fb_obj at ffffffffa00c37d0 [i915]
#8 [ffff88012df3f8a8] intel_pipe_set_base at ffffffffa00c6791 [i915]
#9 [ffff88012df3f938] intel_crtc_mode_set at ffffffffa00cf784 [i915]
#10 [ffff88012df3fa78] drm_crtc_helper_set_mode at ffffffffa0096632 [drm_kms_helper]
#11 [ffff88012df3fc18] drm_crtc_helper_set_config at ffffffffa009776a [drm_kms_helper]
#12 [ffff88012df3fcd8] drm_fb_helper_set_par at ffffffffa009509c [drm_kms_helper]
#13 [ffff88012df3fd18] drm_fb_helper_single_fb_probe at ffffffffa0095260 [drm_kms_helper]
#14 [ffff88012df3fd68] drm_fb_helper_hotplug_event at ffffffffa0094fe4 [drm_kms_helper]
#15 [ffff88012df3fda8] intel_fb_output_poll_changed at ffffffffa00de2cc [i915]
#16 [ffff88012df3fdb8] output_poll_execute at ffffffffa00962bc [drm_kms_helper]
#17 [ffff88012df3fe28] slow_work_execute at ffffffff811018a3
#18 [ffff88012df3fe68] slow_work_thread at ffffffff81101ad7
#19 [ffff88012df3fee8] kthread at ffffffff8108db96
#20 [ffff88012df3ff48] kernel_thread at ffffffff8100c1ca
crash>
http://download.devel.redhat.com/brewroot/scratch/airlied/task_3218241/ please give this a go, see if you can reproduce either oops. it definitely should fix at least one of them. Reproduced from the first try. I can't control whether it crashes on pin or unpin.
SYSTEM MAP: /boot/System.map-2.6.32-128.el6.test.x86_64
DEBUG KERNEL: /usr/lib/debug/lib/modules/2.6.32-128.el6.test.x86_64/vmlinux (2.6.32-128.el6.test.x86_64)
DUMPFILE: vmcore
CPUS: 4
DATE: Fri Apr 1 16:55:23 2011
UPTIME: 06:18:53
LOAD AVERAGE: 1.58, 1.19, 1.28
TASKS: 387
NODENAME: mkhusid
RELEASE: 2.6.32-128.el6.test.x86_64
VERSION: #1 SMP Wed Mar 30 23:13:42 EDT 2011
MACHINE: x86_64 (2527 Mhz)
MEMORY: 3.8 GB
PANIC: "kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4192!"
PID: 11259
COMMAND: "Xorg"
TASK: ffff88012a792a80 [THREAD_INFO: ffff880130e54000]
CPU: 3
STATE: TASK_RUNNING (PANIC)
crash> bt -l
bt: line numbers are not available
PID: 11259 TASK: ffff88012a792a80 CPU: 3 COMMAND: "Xorg"
#0 [ffff880130e55140] machine_kexec at ffffffff8103107b
#1 [ffff880130e551a0] crash_kexec at ffffffff810b63c2
#2 [ffff880130e55270] oops_end at ffffffff814dd650
#3 [ffff880130e552a0] die at ffffffff8100f2fb
#4 [ffff880130e552d0] do_trap at ffffffff814dcf44
#5 [ffff880130e55330] do_invalid_op at ffffffff8100ceb5
#6 [ffff880130e553d0] invalid_op at ffffffff8100bf5b
[exception RIP: i915_gem_object_pin+424]
RIP: ffffffffa00b9c28 RSP: ffff880130e55488 RFLAGS: 00010246
RAX: 000000000003c47e RBX: ffff88012ef21e00 RCX: 0000000000000000
RDX: 000000000003c000 RSI: 0000000000001000 RDI: ffff88012ef21e00
RBP: ffff880130e554a8 R8: ffff880130e54000 R9: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff88012e69c000
R13: 0000000000001000 R14: 0000000000000000 R15: 0000000000000000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#7 [ffff880130e554b0] intel_pin_and_fence_fb_obj at ffffffffa00c3840 [i915]
#8 [ffff880130e554e0] intel_pipe_set_base at ffffffffa00c6801 [i915]
#9 [ffff880130e55570] intel_crtc_mode_set at ffffffffa00cf804 [i915]
#10 [ffff880130e556b0] drm_crtc_helper_set_mode at ffffffffa0096632 [drm_kms_helper]
#11 [ffff880130e55850] drm_crtc_helper_set_config at ffffffffa0097652 [drm_kms_helper]
#12 [ffff880130e55910] drm_fb_helper_set_par at ffffffffa009509c [drm_kms_helper]
#13 [ffff880130e55950] drm_fb_helper_single_fb_probe at ffffffffa0095260 [drm_kms_helper]
#14 [ffff880130e559a0] drm_fb_helper_hotplug_event at ffffffffa0094fe4 [drm_kms_helper]
#15 [ffff880130e559e0] drm_fb_helper_set_par at ffffffffa00950e6 [drm_kms_helper]
#16 [ffff880130e55a20] fb_set_var at ffffffff8129fc42
#17 [ffff880130e55bf0] fbcon_blank at ffffffff812aea0c
#18 [ffff880130e55cf0] do_unblank_screen at ffffffff813160d0
#19 [ffff880130e55d10] complete_change_console at ffffffff8130b60f
#20 [ffff880130e55d40] vt_ioctl at ffffffff8130d2ed
#21 [ffff880130e55e00] tty_ioctl at ffffffff81301802
#22 [ffff880130e55e60] vfs_ioctl at ffffffff81183c92
#23 [ffff880130e55ea0] do_vfs_ioctl at ffffffff81183e34
#24 [ffff880130e55f30] sys_ioctl at ffffffff811843b1
#25 [ffff880130e55f80] system_call_fastpath at ffffffff8100b172
RIP: 0000003fcaedde87 RSP: 00007ffff5447b68 RFLAGS: 00013202
RAX: 0000000000000010 RBX: ffffffff8100b172 RCX: 0000000000000001
RDX: 0000000000000001 RSI: 0000000000005605 RDI: 0000000000000005
RBP: 0000000000000008 R8: 0000000000000000 R9: 0000000000000001
R10: 696c6320584c4749 R11: 0000000000003246 R12: 00000000007d38a0
R13: 0000000000000000 R14: 00000000007d1188 R15: 0000000000000001
ORIG_RAX: 0000000000000010 CS: 0033 SS: 002b
can I get a dmesg with drm.debug=4 on the command line for that machine using the test kernel oh also what userspace xorg-x11-drv-intel package is installed can you check if the one from the EL6 nightly makes any difference if you aren't running it already. are you running compiz or any sort of desktop effects? what window manager etc are you using? something is unbalanced the pin count, and I'm having trouble reproducing it on similar hardware here. also pleast attach /var/log/Xorg.0.log Okay I found another upstream patch that didn't hit stable properly http://brewweb.devel.redhat.com/brew/taskinfo?taskID=3225635 will have it in http://download.devel.redhat.com/brewroot/scratch/airlied/ when it finishes. Target for Snap 5 http://download.devel.redhat.com/brewroot/scratch/airlied/task_3225635/ has the final build please test and get back to me thanks. This request was evaluated by Red Hat Product Management for inclusion in a Red Hat Enterprise Linux maintenance release. Product Management has requested further review of this request by Red Hat Engineering, for potential inclusion in a Red Hat Enterprise Linux Update release for currently deployed products. This request is not yet committed for inclusion in an Update release. (In reply to comment #11) > http://download.devel.redhat.com/brewroot/scratch/airlied/task_3225635/ > > has the final build please test and get back to me thanks. I can't reproduce the crash with this kernel. Been trying for the past week. Thank you. Patch(es) available on kernel-2.6.32-131.0.1.el6 An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHSA-2011-0542.html |
Description of problem: I see a number of similar bugs for Fedora, but none for RHEL. I have not observed this crash with RHEL 6.0 kernel, but I ran into it within 24 hours of using RHEL 6.1 Beta kernel. SYSTEM MAP: /boot/System.map-2.6.32-122.el6.x86_64 DEBUG KERNEL: /usr/lib/debug/lib/modules/2.6.32-122.el6.x86_64/vmlinux (2.6.32-122.el6.x86_64) DUMPFILE: vmcore CPUS: 4 DATE: Fri Mar 25 11:02:26 2011 UPTIME: 13:54:29 LOAD AVERAGE: 1.52, 1.23, 1.17 TASKS: 444 NODENAME: mkhusid RELEASE: 2.6.32-122.el6.x86_64 VERSION: #1 SMP Wed Mar 9 23:54:34 EST 2011 MACHINE: x86_64 (2527 Mhz) MEMORY: 3.8 GB PANIC: "kernel BUG at drivers/gpu/drm/i915/i915_gem.c:4238!" PID: 11027 COMMAND: "Xorg" TASK: ffff88012fd7e100 [THREAD_INFO: ffff88011d026000] CPU: 1 STATE: TASK_RUNNING (PANIC) crash> bt -l bt: line numbers are not available PID: 11027 TASK: ffff88012fd7e100 CPU: 1 COMMAND: "Xorg" #0 [ffff88011d027570] machine_kexec at ffffffff81030c0b #1 [ffff88011d0275d0] crash_kexec at ffffffff810b6142 #2 [ffff88011d0276a0] oops_end at ffffffff814dcdc0 #3 [ffff88011d0276d0] die at ffffffff8100f2fb #4 [ffff88011d027700] do_trap at ffffffff814dc6b4 #5 [ffff88011d027760] do_invalid_op at ffffffff8100ceb5 #6 [ffff88011d027800] invalid_op at ffffffff8100bf5b [exception RIP: i915_gem_object_unpin+163] RIP: ffffffffa00b65a3 RSP: ffff88011d0278b8 RFLAGS: 00010246 RAX: 000000000003c07c RBX: ffff88012df2f200 RCX: ffff88012e6c8000 RDX: 000000000003c000 RSI: 0000000000000003 RDI: ffff88012df2f200 RBP: ffff88011d0278c8 R8: ffff88012e682148 R9: 0000000000000000 R10: ffff88012ee6d438 R11: 0000000000000000 R12: ffff88012e6c8000 R13: ffff88012ee6d450 R14: ffff880131206c00 R15: ffff88012ee6d420 ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018 #7 [ffff88011d0278b0] mutex_lock at ffffffff814da84e #8 [ffff88011d0278d0] intel_crtc_disable at ffffffffa00c2d3a [i915] #9 [ffff88011d0278f0] drm_helper_disable_unused_functions at ffffffffa00960cd [drm_kms_helper] #10 [ffff88011d027920] drm_crtc_helper_set_config at ffffffffa0097777 [drm_kms_helper] #11 [ffff88011d0279e0] drm_fb_helper_set_par at ffffffffa009509c [drm_kms_helper] #12 [ffff88011d027a20] fb_set_var at ffffffff8129f4a2 #13 [ffff88011d027bf0] fbcon_blank at ffffffff812ae26c #14 [ffff88011d027cf0] do_unblank_screen at ffffffff81315930 #15 [ffff88011d027d10] complete_change_console at ffffffff8130ae6f #16 [ffff88011d027d40] vt_ioctl at ffffffff8130cb4d #17 [ffff88011d027e00] tty_ioctl at ffffffff81301062 #18 [ffff88011d027e60] vfs_ioctl at ffffffff811836a2 #19 [ffff88011d027ea0] do_vfs_ioctl at ffffffff81183844 #20 [ffff88011d027f30] sys_ioctl at ffffffff81183dc1 #21 [ffff88011d027f80] system_call_fastpath at ffffffff8100b172 RIP: 00007f9c304aae87 RSP: 00007fffd5f41898 RFLAGS: 00013202 RAX: 0000000000000010 RBX: ffffffff8100b172 RCX: 0000000000000084 RDX: 0000000000000001 RSI: 0000000000005605 RDI: 0000000000000005 RBP: 0000000000000008 R8: 0000000000000040 R9: 0000000000000001 R10: 696c6320584c4749 R11: 0000000000003246 R12: 00000000007d38a0 R13: 0000000000000000 R14: 00000000007d1188 R15: 0000000000000001 ORIG_RAX: 0000000000000010 CS: 0033 SS: 002b Version-Release number of selected component (if applicable): 2.6.32-122.el6.x86_64 How reproducible: Very rarely Steps to Reproduce: 1. Boot computer up with closed lid 2. Undock (computer automatically