Bug 2372819 - Booting laptop on battery causes "fatal error during GPU init" on amdgpu and sets incorrect refresh rate for internal display
Summary: Booting laptop on battery causes "fatal error during GPU init" on amdgpu and ...
Keywords:
Status: NEW
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 42
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2025-06-13 22:31 UTC by Andre Costa
Modified: 2025-06-23 21:03 UTC (History)
13 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed:
Type: ---
Embargoed:


Attachments (Terms of Use)
boot messages with charger unplugged (111.23 KB, text/plain)
2025-06-13 22:32 UTC, Andre Costa
no flags Details
boot messages with charger plugged (112.84 KB, text/plain)
2025-06-13 22:33 UTC, Andre Costa
no flags Details
kernel log for kernel 6.16.0rc2 (168.38 KB, text/plain)
2025-06-23 21:03 UTC, Andre Costa
no flags Details

Description Andre Costa 2025-06-13 22:31:12 UTC
1. Please describe the problem:

When I boot my laptop on battery, the 144Hz internal display is not properly recognized, and the refresh rate is fixed at 60Hz. This does not happen when boot happens with the charger plugged.

2. What is the Version-Release number of the kernel:
6.14.9-300

3. Did it work previously in Fedora? If so, what kernel version did the issue
   *first* appear?  Old kernels are available for download at
   https://koji.fedoraproject.org/koji/packageinfo?packageID=8 :

Can't say for sure, but I would say yes. Cannot precise which exact version, though.

4. Can you reproduce this issue? If so, please provide the steps to reproduce
   the issue below:
Yes. Booting on battery shows this message on the logs:

[    8.311152] amdgpu 0000:05:00.0: enabling device (0006 -> 0007)
[    8.311219] [drm] initializing kernel modesetting (RENOIR 0x1002:0x1636 0x1025:0x1455 0xC6).
[    8.311536] [drm] register mmio base: 0xD1500000
[    8.311539] [drm] register mmio size: 524288
[    8.311550] workqueue: Failed to create a rescuer kthread for wq "amdgpu-reset-dev": -EINTR
[    8.311557] [drm:amdgpu_reset_create_reset_domain [amdgpu]] *ERROR* Failed to allocate wq for amdgpu_reset_domain!
[    8.312266] amdgpu 0000:05:00.0: amdgpu: Fatal error during GPU init
[    8.312272] amdgpu 0000:05:00.0: amdgpu: amdgpu: finishing device.
[    8.312351] amdgpu 0000:05:00.0: probe with driver amdgpu failed with error -12

As a result, internal display is not properly recognized and refresh rate is fixed at 60Hz.

Booting with the charger plugged leads to proper display recognition:

[    5.709804] amdgpu 0000:05:00.0: enabling device (0006 -> 0007)
[    5.709848] [drm] initializing kernel modesetting (RENOIR 0x1002:0x1636 0x1025:0x1455 0xC6).
[    5.709913] [drm] register mmio base: 0xD1500000
[    5.709915] [drm] register mmio size: 524288
[    5.712905] amdgpu 0000:05:00.0: amdgpu: detected ip block number 0 <soc15_common>
[    5.712910] amdgpu 0000:05:00.0: amdgpu: detected ip block number 1 <gmc_v9_0>
[    5.712912] amdgpu 0000:05:00.0: amdgpu: detected ip block number 2 <vega10_ih>
[    5.712915] amdgpu 0000:05:00.0: amdgpu: detected ip block number 3 <psp>
...
[    6.576357] amdgpu: Topology: Add dGPU node [0x1636:0x1002]
[    6.576369] kfd kfd: amdgpu: added device 1002:1636

5. Does this problem occur with the latest Rawhide kernel? To install the
   Rawhide kernel, run ``sudo dnf install fedora-repos-rawhide`` followed by
   ``sudo dnf update --enablerepo=rawhide kernel``:

Haven't tested, but this has been happening for quite some time (many kernel versions) already.

6. Are you running any modules that not shipped with directly Fedora's kernel?:
Yes, Nvidia proprietary driver 575.57.08

7. Please attach the kernel logs. You can get the complete kernel log
   for a boot with ``journalctl --no-hostname -k > dmesg.txt``. If the
   issue occurred on a previous boot, use the journalctl ``-b`` flag.

Reproducible: Always

Comment 1 Andre Costa 2025-06-13 22:32:47 UTC
Created attachment 2093916 [details]
boot messages with charger unplugged

Comment 2 Andre Costa 2025-06-13 22:33:17 UTC
Created attachment 2093917 [details]
boot messages with charger plugged

Comment 3 Andre Costa 2025-06-14 14:10:48 UTC
It is important to notice that the problem also manifests with nouveau (i.e without Nvidia proprietary driver). Some additional info and context can be found here: https://forums.developer.nvidia.com/t/570-release-feedback-discussion/321956/614?u=andre.ocosta

Comment 4 Andre Costa 2025-06-23 21:02:14 UTC
I Just checked with latest kernel from rawhide (6.16.0-rc2), and the problem also manifests, and in a worse way -- internal display fails to initialize at all (not only with wrong refresh rate). I'll attach corresponding dmesg messages

Comment 5 Andre Costa 2025-06-23 21:03:05 UTC
Created attachment 2094897 [details]
kernel log for kernel 6.16.0rc2


Note You need to log in before you can comment on or make changes to this bug.