Bug 2260772 - i915 driver crashes in kernel 6.6.13
Summary: i915 driver crashes in kernel 6.6.13
Keywords:
Status: CLOSED EOL
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 39
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL: https://pastebin.com/246SqcHu
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-01-28 17:18 UTC by Nikolas Nyby
Modified: 2024-11-27 22:48 UTC (History)
16 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2024-11-27 22:48:44 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Nikolas Nyby 2024-01-28 17:18:08 UTC
1. Please describe the problem:

This kernel fails to boot when I attempt on my Dell XPS 8940 desktop with Intel graphics.

2. What is the Version-Release number of the kernel:

6.6.13-200.fc39.x86_64

3. Did it work previously in Fedora? If so, what kernel version did the issue
   *first* appear?  Old kernels are available for download at
   https://koji.fedoraproject.org/koji/packageinfo?packageID=8 :

Yes, I am currently using kernel 6.5.12-300.fc39.x86_64 as this is the most recent kernel that works for me. The problem appeared with kernel 6.6.


4. Can you reproduce this issue? If so, please provide the steps to reproduce
   the issue below:

Yes, to reproduce, I can attempt to boot in any 6.6 kernel on this computer.

5. Does this problem occur with the latest Rawhide kernel? To install the
   Rawhide kernel, run ``sudo dnf install fedora-repos-rawhide`` followed by
   ``sudo dnf update --enablerepo=rawhide kernel``:

I haven't tried the rawhide kernel.

6. Are you running any modules that not shipped with directly Fedora's kernel?:

No

7. Please attach the kernel logs. You can get the complete kernel log
   for a boot with ``journalctl --no-hostname -k > dmesg.txt``. If the
   issue occurred on a previous boot, use the journalctl ``-b`` flag.

See pastebin link for complete log: https://pastebin.com/246SqcHu

Jan 27 16:39:04  kernel: BUG: kernel NULL pointer dereference, address: 000000000000007c
Jan 27 16:39:04  kernel: #PF: supervisor read access in kernel mode
Jan 27 16:39:04  kernel: #PF: error_code(0x0000) - not-present page
Jan 27 16:39:04  kernel: PGD 0 P4D 0
Jan 27 16:39:04  kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Jan 27 16:39:04  kernel: CPU: 6 PID: 488 Comm: (udev-worker) Not tainted 6.6.13-200.fc39.x86_64 #1
Jan 27 16:39:04  kernel: Hardware name: Dell Inc. XPS 8940/0K3CM7, BIOS 2.16.0 11/02/2023
Jan 27 16:39:04  kernel: RIP: 0010:icl_aux_pw_to_phy.isra.0+0x35/0x40 [i915]
Jan 27 16:39:04  kernel: Code: 76 08 0f b6 d2 48 8d 04 52 f6 41 11 01 48 89 fb 48 8b 16 48 8d 04 c2 0f b6 70 14 8d 46 fa 0f 45 f0 e8 ef eb ff ff 48 89 df 5b <8b> 70 7c e9 83 9e fe ff 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90
Jan 27 16:39:04  kernel: RSP: 0018:ffffc900020b3ab0 EFLAGS: 00010246
Jan 27 16:39:04  kernel: RAX: 0000000000000000 RBX: ffff88810d600000 RCX: ffffffffc06dc958
Jan 27 16:39:04  kernel: RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff88810d600000
Jan 27 16:39:04  kernel: RBP: ffff888101df0b60 R08: ffffffffc074c8a8 R09: 0000000000000000
Jan 27 16:39:04  kernel: R10: 0000000000000001 R11: 0000000000000100 R12: ffff88810d600b30
Jan 27 16:39:04  kernel: R13: 0000000000000001 R14: ffff8881023e60c0 R15: ffff8881023e6000
Jan 27 16:39:04  kernel: FS: 00007f0f19125980(0000) GS:ffff888451580000(0000) knlGS:0000000000000000
Jan 27 16:39:04  kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 27 16:39:04  kernel: CR2: 000000000000007c CR3: 000000010d7be002 CR4: 0000000000770ee0
Jan 27 16:39:04  kernel: PKRU: 55555554
Jan 27 16:39:04  kernel: Call Trace:
Jan 27 16:39:04  kernel: <TASK>
Jan 27 16:39:04  kernel: ? __die+0x23/0x70
Jan 27 16:39:04  kernel: ? page_fault_oops+0x171/0x4e0
Jan 27 16:39:04  kernel: ? __intel_display_power_get_domain.part.0+0x52/0x70 [i915]
Jan 27 16:39:04  kernel: ? ktime_get+0x3c/0xa0
Jan 27 16:39:04  kernel: ? exc_page_fault+0x7f/0x180
Jan 27 16:39:04  kernel: ? asm_exc_page_fault+0x26/0x30
Jan 27 16:39:04  kernel: ? icl_aux_pw_to_phy.isra.0+0x35/0x40 [i915]
Jan 27 16:39:04  kernel: icl_aux_power_well_disable+0x39/0x1d0 [i915]
Jan 27 16:39:04  kernel: intel_power_domains_sanitize_state+0x91/0xb0 [i915]
Jan 27 16:39:04  kernel: intel_display_driver_probe_nogem+0x163/0x210 [i915]
Jan 27 16:39:04  kernel: i915_driver_probe+0x6d2/0xb90 [i915]
Jan 27 16:39:04  kernel: local_pci_probe+0x42/0xa0
Jan 27 16:39:04  kernel: pci_device_probe+0xc7/0x240
Jan 27 16:39:04  kernel: really_probe+0x19b/0x3e0
Jan 27 16:39:04  kernel: ? __pfx___driver_attach+0x10/0x10
Jan 27 16:39:04  kernel: __driver_probe_device+0x78/0x160
Jan 27 16:39:04  kernel: driver_probe_device+0x1f/0x90
Jan 27 16:39:04  kernel: __driver_attach+0xd2/0x1c0
Jan 27 16:39:04  kernel: bus_for_each_dev+0x85/0xd0
Jan 27 16:39:04  kernel: bus_add_driver+0x116/0x220
Jan 27 16:39:04  kernel: driver_register+0x59/0x100
Jan 27 16:39:04  kernel: i915_init+0x22/0xc0 [i915]
Jan 27 16:39:04  kernel: ? __pfx_i915_init+0x10/0x10 [i915]
Jan 27 16:39:04  kernel: do_one_initcall+0x5a/0x320
Jan 27 16:39:04  kernel: do_init_module+0x60/0x240
Jan 27 16:39:04  kernel: __do_sys_init_module+0x17f/0x1b0
Jan 27 16:39:04  kernel: ? __seccomp_filter+0x32c/0x510
Jan 27 16:39:04  kernel: do_syscall_64+0x5d/0x90
Jan 27 16:39:04  kernel: ? syscall_exit_to_user_mode+0x2b/0x40
Jan 27 16:39:04  kernel: ? do_syscall_64+0x6c/0x90
Jan 27 16:39:04  kernel: ? syscall_exit_to_user_mode+0x2b/0x40
Jan 27 16:39:04  kernel: ? do_syscall_64+0x6c/0x90
Jan 27 16:39:04  kernel: ? exc_page_fault+0x7f/0x180
Jan 27 16:39:04  kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
Jan 27 16:39:04  kernel: RIP: 0033:0x7f0f19b1083e
Jan 27 16:39:04  kernel: Code: 48 8b 0d f5 15 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c2 15 0c 00 f7 d8 64 89 01 48
Jan 27 16:39:04  kernel: RSP: 002b:00007ffd730e3048 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Jan 27 16:39:04  kernel: RAX: ffffffffffffffda RBX: 000055dada1f8a20 RCX: 00007f0f19b1083e
Jan 27 16:39:04  kernel: RDX: 00007f0f19c1607d RSI: 000000000089e4b6 RDI: 000055dadaab5610
Jan 27 16:39:04  kernel: RBP: 00007ffd730e3100 R08: 000055dada1a8010 R09: 0000000000000007
Jan 27 16:39:04  kernel: R10: 0000000000000006 R11: 0000000000000246 R12: 00007f0f19c1607d
Jan 27 16:39:04  kernel: R13: 0000000000020000 R14: 000055dada1e0300 R15: 000055dada1db510
Jan 27 16:39:04  kernel: </TASK>
Jan 27 16:39:04  kernel: Modules linked in: i915(+) crct10dif_pclmul crc32_pclmul crc32c_intel i2c_algo_bit drm_buddy polyval_clmulni polyval_generic ttm drm_display_helper ghash_clmulni_intel sha512_ssse3 uas sha256_ssse3 r8169 usb_storage sha1_ssse3 cec video wmi pinctrl_cannonlake scsi_dh_rdac scsi_dh_emc scsi_dh_alua dm_multipath fuse
Jan 27 16:39:04  kernel: CR2: 000000000000007c
Jan 27 16:39:04  kernel: ---[ end trace 0000000000000000 ]---
Jan 27 16:39:04  kernel: RIP: 0010:icl_aux_pw_to_phy.isra.0+0x35/0x40 [i915]
Jan 27 16:39:04  kernel: Code: 76 08 0f b6 d2 48 8d 04 52 f6 41 11 01 48 89 fb 48 8b 16 48 8d 04 c2 0f b6 70 14 8d 46 fa 0f 45 f0 e8 ef eb ff ff 48 89 df 5b <8b> 70 7c e9 83 9e fe ff 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90
Jan 27 16:39:04  kernel: RSP: 0018:ffffc900020b3ab0 EFLAGS: 00010246
Jan 27 16:39:04  kernel: RAX: 0000000000000000 RBX: ffff88810d600000 RCX: ffffffffc06dc958
Jan 27 16:39:04  kernel: RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff88810d600000
Jan 27 16:39:04  kernel: RBP: ffff888101df0b60 R08: ffffffffc074c8a8 R09: 0000000000000000
Jan 27 16:39:04  kernel: R10: 0000000000000001 R11: 0000000000000100 R12: ffff88810d600b30
Jan 27 16:39:04  kernel: R13: 0000000000000001 R14: ffff8881023e60c0 R15: ffff8881023e6000
Jan 27 16:39:04  kernel: FS: 00007f0f19125980(0000) GS:ffff888451580000(0000) knlGS:0000000000000000
Jan 27 16:39:04  kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 27 16:39:04  kernel: CR2: 000000000000007c CR3: 000000010d7be002 CR4: 0000000000770ee0
Jan 27 16:39:04  kernel: PKRU: 55555554

Reproducible: Always

Comment 1 Nikolas Nyby 2024-01-28 17:36:24 UTC
Also, here is my video card info from lspci, in case that's helpful:

00:02.0 VGA compatible controller: Intel Corporation RocketLake-S GT1 [UHD Graphics 730] (rev 04) (prog-if 00 [VGA controller])
	DeviceName: Onboard - Video
	Subsystem: Dell Device 09c5
	Flags: bus master, fast devsel, latency 0, IRQ 125
	Memory at b0000000 (64-bit, non-prefetchable) [size=16M]
	Memory at a0000000 (64-bit, prefetchable) [size=256M]
	I/O ports at 4000 [size=64]
	Expansion ROM at 000c0000 [virtual] [disabled] [size=128K]
	Capabilities: [40] Vendor Specific Information: Len=0c <?>
	Capabilities: [70] Express Root Complex Integrated Endpoint, MSI 00
	Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable+ 64bit-
	Capabilities: [d0] Power Management version 2
	Capabilities: [100] Process Address Space ID (PASID)
	Capabilities: [200] Address Translation Service (ATS)
	Capabilities: [300] Page Request Interface (PRI)
	Kernel driver in use: i915
	Kernel modules: i915

Comment 2 Nikolas Nyby 2024-01-31 03:49:44 UTC
The same bug is happening to me in vanilla kernel 6.7.2 as well, so it's possible it's an upstream bug in the i915 drivers.

Jan 30 22:41:04  kernel: BUG: kernel NULL pointer dereference, address: 000000000000007c
Jan 30 22:41:04  kernel: fbcon: Taking over console
Jan 30 22:41:04  kernel: #PF: supervisor read access in kernel mode
Jan 30 22:41:04  kernel: #PF: error_code(0x0000) - not-present page
Jan 30 22:41:04  kernel: PGD 0 P4D 0 
Jan 30 22:41:04  kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Jan 30 22:41:04  kernel: CPU: 8 PID: 463 Comm: (udev-worker) Not tainted 6.7.2-250.vanilla.fc39.x86_64 #1
Jan 30 22:41:04  kernel: Hardware name: Dell Inc. XPS 8940/0K3CM7, BIOS 2.16.0 11/02/2023
Jan 30 22:41:04  kernel: RIP: 0010:icl_aux_pw_to_phy.isra.0+0x35/0x40 [i915]
Jan 30 22:41:04  kernel: Code: 76 08 0f b6 d2 48 8d 04 52 f6 41 11 01 48 89 fb 48 8b 16 48 8d 04 c2 0f b6 70 14 8d 46 fa 0f 45 f0 e8 7f ed ff ff 48 89 df 5b <8b> 70 7c e9 e3 93 fe ff 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90
Jan 30 22:41:04  kernel: RSP: 0018:ffffa96381a37a40 EFLAGS: 00010246
Jan 30 22:41:04  kernel: RAX: 0000000000000000 RBX: ffff989bc1940000 RCX: ffffffffc0800e98
Jan 30 22:41:04  kernel: RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff989bc1940000
Jan 30 22:41:04  kernel: RBP: ffff989bc279c760 R08: ffffffffc086ffb1 R09: 0000000000000000
Jan 30 22:41:04  kernel: R10: 0000000000000001 R11: 0000000000000100 R12: ffff989bc1940b08
Jan 30 22:41:04  kernel: R13: 0000000000000001 R14: ffff989bc269d0c0 R15: ffff989bc269d000
Jan 30 22:41:04  kernel: FS:  00007fd3316a9980(0000) GS:ffff989f11400000(0000) knlGS:0000000000000000
Jan 30 22:41:04  kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 30 22:41:04  kernel: CR2: 000000000000007c CR3: 0000000101a0c001 CR4: 0000000000770ef0
Jan 30 22:41:04  kernel: PKRU: 55555554
Jan 30 22:41:04  kernel: Call Trace:
Jan 30 22:41:04  kernel:  <TASK>
Jan 30 22:41:04  kernel:  ? __die+0x23/0x70
Jan 30 22:41:04  kernel:  ? page_fault_oops+0x171/0x4e0
Jan 30 22:41:04  kernel:  ? __intel_display_power_get_domain.part.0+0x52/0x70 [i915]
Jan 30 22:41:04  kernel:  ? ktime_get+0x3c/0xa0
Jan 30 22:41:04  kernel:  ? exc_page_fault+0x7f/0x180
Jan 30 22:41:04  kernel:  ? asm_exc_page_fault+0x26/0x30
Jan 30 22:41:04  kernel:  ? icl_aux_pw_to_phy.isra.0+0x35/0x40 [i915]
Jan 30 22:41:04  kernel:  icl_aux_power_well_disable+0x39/0x1d0 [i915]
Jan 30 22:41:04  kernel:  intel_power_domains_sanitize_state+0x91/0xb0 [i915]
Jan 30 22:41:04  kernel:  intel_display_driver_probe_nogem+0x163/0x210 [i915]
Jan 30 22:41:04  kernel:  i915_driver_probe+0x707/0xbe0 [i915]
Jan 30 22:41:04  kernel:  local_pci_probe+0x42/0xa0
Jan 30 22:41:04  kernel:  pci_device_probe+0xc7/0x240
Jan 30 22:41:04  kernel:  really_probe+0x19b/0x3e0
Jan 30 22:41:04  kernel:  ? __pfx___driver_attach+0x10/0x10
Jan 30 22:41:04  kernel:  __driver_probe_device+0x78/0x160
Jan 30 22:41:04  kernel:  driver_probe_device+0x1f/0x90
Jan 30 22:41:04  kernel:  __driver_attach+0xd2/0x1c0
Jan 30 22:41:04  kernel:  bus_for_each_dev+0x85/0xd0
Jan 30 22:41:04  kernel:  bus_add_driver+0x116/0x220
Jan 30 22:41:04  kernel:  driver_register+0x59/0x100
Jan 30 22:41:04  kernel:  i915_init+0x22/0xc0 [i915]
Jan 30 22:41:04  kernel:  ? __pfx_i915_init+0x10/0x10 [i915]
Jan 30 22:41:04  kernel:  do_one_initcall+0x58/0x320
Jan 30 22:41:04  kernel:  do_init_module+0x60/0x240
Jan 30 22:41:04  kernel:  __do_sys_init_module+0x17f/0x1b0
Jan 30 22:41:04  kernel:  ? __seccomp_filter+0x32c/0x4f0
Jan 30 22:41:04  kernel:  do_syscall_64+0x61/0xe0
an 30 22:41:04  kernel:  ? __count_memcg_events+0x42/0x90
Jan 30 22:41:04  kernel:  ? count_memcg_events.constprop.0+0x1a/0x30
Jan 30 22:41:04  kernel:  ? handle_mm_fault+0xa2/0x360
Jan 30 22:41:04  kernel:  ? do_user_addr_fault+0x304/0x670
Jan 30 22:41:04  kernel:  ? exc_page_fault+0x7f/0x180
Jan 30 22:41:04  kernel:  entry_SYSCALL_64_after_hwframe+0x6e/0x76
Jan 30 22:41:04  kernel: RIP: 0033:0x7fd33209483e
Jan 30 22:41:04  kernel: Code: 48 8b 0d f5 15 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c2 15 0c 00 f7 d8 64 89 01 48
Jan 30 22:41:04  kernel: RSP: 002b:00007ffc63f64988 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Jan 30 22:41:04  kernel: RAX: ffffffffffffffda RBX: 00005587334eb1c0 RCX: 00007fd33209483e
Jan 30 22:41:04  kernel: RDX: 00007fd33219a07d RSI: 00000000008adf1e RDI: 0000558733ddb220
Jan 30 22:41:04  kernel: RBP: 00007ffc63f64a40 R08: 00005587334c1010 R09: 0000000000000007
Jan 30 22:41:04  kernel: R10: 0000000000000001 R11: 0000000000000246 R12: 00007fd33219a07d
Jan 30 22:41:04  kernel: R13: 0000000000020000 R14: 00005587334e8ad0 R15: 00005587334ec400
Jan 30 22:41:04  kernel:  </TASK>
Jan 30 22:41:04  kernel: Modules linked in: i915(+) crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni i2c_algo_bit polyval_generic drm_buddy ttm ghash_clmulni_intel sha512_ssse3 drm_display_helper r8169 sha256_ssse3 sha1_ssse3 realtek cec video wmi pinctrl_cannonlake scsi_dh_rdac scsi_dh_emc scsi_dh_alua dm_multipath fuse
Jan 30 22:41:04  kernel: CR2: 000000000000007c
Jan 30 22:41:04  kernel: ---[ end trace 0000000000000000 ]---

Comment 3 Nikolas Nyby 2024-01-31 03:57:49 UTC
Created i915 bug here: https://gitlab.freedesktop.org/drm/intel/-/issues/10184

Comment 4 Aoife Moloney 2024-11-13 11:56:14 UTC
This message is a reminder that Fedora Linux 39 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora Linux 39 on 2024-11-26.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
'version' of '39'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, change the 'version' 
to a later Fedora Linux version. Note that the version field may be hidden.
Click the "Show advanced fields" button if you do not see it.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora Linux 39 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora Linux, you are encouraged to change the 'version' to a later version
prior to this bug being closed.

Comment 5 Aoife Moloney 2024-11-27 22:48:44 UTC
Fedora Linux 39 entered end-of-life (EOL) status on 2024-11-26.

Fedora Linux 39 is no longer maintained, which means that it
will not receive any further security or bug fix updates. As a result we
are closing this bug.

If you can reproduce this bug against a currently maintained version of Fedora Linux
please feel free to reopen this bug against that version. Note that the version
field may be hidden. Click the "Show advanced fields" button if you do not see
the version field.

If you are unable to reopen this bug, please file a new report against an
active release.

Thank you for reporting this bug and we are sorry it could not be fixed.


Note You need to log in before you can comment on or make changes to this bug.