Suddenly the shell crashed. I had to shut it down. I try to get logs but they are cut off somehow (journalctl -b -1 -r) ul 08 16:15:45 fedora kernel: watchdog: BUG: soft lockup - CPU#3 stuck for 101s! [CPU 4/KVM:5216] Jul 08 16:15:41 fedora kernel: </TASK> Jul 08 16:15:41 fedora kernel: R13: 00000000c0206440 R14: 0000000000000016 R15: 000056150d677920 Jul 08 16:15:41 fedora kernel: R10: 000056150d580010 R11: 0000000000000246 R12: 00007ffc3ec1dff0 Jul 08 16:15:41 fedora kernel: RBP: 00007ffc3ec1dfa0 R08: 0000000000000007 R09: 0000000000000009 Jul 08 16:15:41 fedora kernel: RDX: 00007ffc3ec1dff0 RSI: 00000000c0206440 RDI: 0000000000000016 Jul 08 16:15:41 fedora kernel: RAX: ffffffffffffffda RBX: 000056150f0df900 RCX: 00007f17cf728edd Jul 08 16:15:41 fedora kernel: RSP: 002b:00007ffc3ec1df50 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 Jul 08 16:15:41 fedora kernel: Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 7> Jul 08 16:15:41 fedora kernel: RIP: 0033:0x7f17cf728edd Jul 08 16:15:41 fedora kernel: entry_SYSCALL_64_after_hwframe+0x72/0xdc Jul 08 16:15:41 fedora kernel: ? exc_page_fault+0x7c/0x180 Jul 08 16:15:41 fedora kernel: ? do_user_addr_fault+0x237/0x600 Jul 08 16:15:41 fedora kernel: ? do_syscall_64+0x6c/0x90 Jul 08 16:15:41 fedora kernel: ? syscall_exit_to_user_mode+0x1b/0x40 Jul 08 16:15:41 fedora kernel: do_syscall_64+0x60/0x90 Jul 08 16:15:41 fedora kernel: __x64_sys_ioctl+0x94/0xd0 Jul 08 16:15:41 fedora kernel: amdgpu_drm_ioctl+0x4e/0x90 [amdgpu] Jul 08 16:15:41 fedora kernel: ? __pfx_amdgpu_gem_create_ioctl+0x10/0x10 [amdgpu] Jul 08 16:15:41 fedora kernel: drm_ioctl+0x26d/0x4b0 Jul 08 16:15:41 fedora kernel: drm_ioctl_kernel+0xcd/0x170 Jul 08 16:15:41 fedora kernel: ? __pfx_amdgpu_gem_create_ioctl+0x10/0x10 [amdgpu] Jul 08 16:15:41 fedora kernel: ? __pfx_amdgpu_bo_user_destroy+0x10/0x10 [amdgpu] Jul 08 16:15:41 fedora kernel: amdgpu_gem_create_ioctl+0x14c/0x3c0 [amdgpu] Jul 08 16:15:41 fedora kernel: amdgpu_bo_create_user+0x40/0x70 [amdgpu] Jul 08 16:15:41 fedora kernel: ? __pfx_amdgpu_bo_user_destroy+0x10/0x10 [amdgpu] Jul 08 16:15:41 fedora kernel: amdgpu_bo_create+0x1d4/0x4b0 [amdgpu] Jul 08 16:15:41 fedora kernel: ttm_bo_init_reserved+0x14e/0x1c0 [ttm] Jul 08 16:15:41 fedora kernel: ttm_bo_validate+0xf0/0x160 [ttm] Jul 08 16:15:41 fedora kernel: ttm_bo_handle_move_mem+0x15f/0x170 [ttm] Jul 08 16:15:41 fedora kernel: ttm_tt_populate+0xa1/0x140 [ttm] Jul 08 16:15:41 fedora kernel: amdgpu_ttm_tt_populate+0x39/0x90 [amdgpu] Jul 08 16:15:41 fedora kernel: ttm_pool_alloc+0x307/0x600 [ttm] Jul 08 16:15:41 fedora kernel: __alloc_pages+0x224/0x250 Jul 08 16:15:41 fedora kernel: ? prepare_alloc_pages.constprop.0+0x199/0x1b0 Jul 08 16:15:41 fedora kernel: __alloc_pages_slowpath.constprop.0+0x35a/0xe10 Jul 08 16:15:41 fedora kernel: try_to_free_pages+0xf0/0x210 Jul 08 16:15:41 fedora kernel: do_try_to_free_pages+0x118/0x5c0 Reproducible: Didn't try Specified App: amd-gpu-firmware-20230625-151.fc38.noarch xorg-x11-drv-amdgpu-23.0.0-1.fc38.x86_64 --- Software --- OS: Fedora Linux 38.20230707.0 (Kinoite) KDE Plasma: 5.27.6 KDE Frameworks: 5.107.0 Qt: 5.15.10 Kernel: 6.3.11-200.fc38.x86_64 Compositor: wayland --- Hardware --- CPU: AMD Ryzen 5 PRO 3500U w/ Radeon Vega Mobile Gfx RAM: 13.5 GB GPU: AMD Radeon Vega 8 Graphics Video memory: 2048MB
Moving to kernel, since the backtrace points at the amdgpu code in the kernel.
this was before, indicating the start of the suspend Jan 31 22:03:39 PC kernel: wlp1s0: associated Jan 31 22:03:39 PC kernel: wlp1s0: RX AssocResp from 40:75:c3:19:df:20 (capab=0x1011 status=0 aid=29) Jan 31 22:03:39 PC kernel: wlp1s0: associate with 40:75:c3:19:df:20 (try 1/3) Jan 31 22:03:39 PC kernel: wlp1s0: authenticated Jan 31 22:03:39 PC kernel: wlp1s0: send auth to 40:75:c3:19:df:20 (try 1/3) Jan 31 22:03:39 PC kernel: wlp1s0: authenticate with 40:75:c3:19:df:20 (local address=e4:5e:37:c9:a1:80) Jan 31 22:03:37 PC kernel: psmouse serio1: synaptics: queried min coordinates: x [1266..], y [1162..] Jan 31 22:03:36 PC kernel: psmouse serio1: synaptics: queried max coordinates: x [..5678], y [..4694] Jan 31 22:03:36 PC kernel: PM: suspend exit Jan 31 22:03:36 PC kernel: random: crng reseeded on system resumption Jan 31 22:03:36 PC kernel: Restarting tasks ... done. Jan 31 22:03:36 PC kernel: OOM killer enabled. Jan 31 22:03:36 PC kernel: PM: resume devices took 0.245 seconds Jan 31 22:03:36 PC kernel: PM: Some devices failed to suspend, or early wake event detected Jan 31 22:03:36 PC kernel: nvme 0000:02:00.0: PM: failed to suspend async: error -16 Jan 31 22:03:36 PC kernel: nvme 0000:02:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -16 Jan 31 22:03:36 PC kernel: nvme 0000:02:00.0: PM: pci_pm_suspend(): nvme_suspend+0x0/0x170 [nvme] returns -16 Jan 31 22:03:36 PC kernel: sd 0:0:0:0: [sda] Synchronizing SCSI cache Jan 31 22:03:36 PC kernel: printk: Suspending console(s) (use no_console_suspend to debug) Jan 31 22:03:36 PC kernel: Freezing remaining freezable tasks completed (elapsed 0.075 seconds) Jan 31 22:03:36 PC kernel: Freezing remaining freezable tasks Jan 31 22:03:36 PC kernel: OOM killer disabled. Jan 31 22:03:36 PC kernel: Freezing user space processes completed (elapsed 0.037 seconds) Jan 31 22:03:36 PC kernel: Freezing user space processes Jan 31 22:03:36 PC kernel: Filesystems sync: 0.418 seconds Jan 31 22:03:35 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Jan 31 22:03:35 PC kernel: [drm] Skip scheduling IBs! Jan 31 22:03:35 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Jan 31 22:03:35 PC kernel: PM: suspend entry (s2idle) Jan 31 22:03:35 PC kernel: PM: suspend exit Jan 31 22:03:35 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Jan 31 22:03:35 PC kernel: random: crng reseeded on system resumption Jan 31 22:03:35 PC kernel: Restarting tasks ... done. Jan 31 22:03:35 PC kernel: OOM killer enabled. Jan 31 22:03:35 PC kernel: PM: resume devices took 2.950 seconds Jan 31 22:03:35 PC kernel: [drm:process_one_work] *ERROR* ib ring test failed (-110). Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* IB test failed on gfx_high (-110). Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* IB test failed on gfx_low (-110). Jan 31 22:03:35 PC kernel: psmouse serio1: synaptics: queried min coordinates: x [1266..], y [1162..] Jan 31 22:03:35 PC kernel: psmouse serio1: synaptics: queried max coordinates: x [..5678], y [..4694] Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 8 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 8 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 8 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 8 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 8 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 13 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 12 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 11 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 10 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 9 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 8 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 7 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 6 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 5 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_high uses VM inv eng 4 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_low uses VM inv eng 1 on hub 0 Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0 Jan 31 22:03:35 PC kernel: [drm] VCN decode and encode initialized successfully(under SPG Mode). Jan 31 22:03:35 PC kernel: [drm] kiq ring mec 2 pipe 1 q 0 Jan 31 22:03:35 PC kernel: amdgpu: restore the fine grain parameters Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available Jan 31 22:03:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available Jan 31 22:03:35 PC kernel: [drm] reserve 0x400000 from 0xf47fc00000 for PSP TMR Jan 31 22:03:35 PC kernel: [drm] PSP is resuming... Jan 31 22:03:35 PC kernel: [drm] PTB located at 0x000000F400A00000 Jan 31 22:03:35 PC kernel: [drm] PCIE GART of 1024M enabled. Jan 31 22:03:35 PC kernel: PM: Some devices failed to suspend, or early wake event detected Jan 31 22:03:35 PC kernel: [drm] psp gfx command UNLOAD_TA(0x2) failed and response status is (0x117) Jan 31 22:03:35 PC kernel: nvme 0000:02:00.0: PM: failed to suspend async: error -16 Jan 31 22:03:35 PC kernel: nvme 0000:02:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -16 Jan 31 22:03:35 PC kernel: nvme 0000:02:00.0: PM: pci_pm_suspend(): nvme_suspend+0x0/0x170 [nvme] returns -16 Jan 31 22:03:35 PC kernel: sd 0:0:0:0: [sda] Synchronizing SCSI cache Jan 31 22:03:35 PC kernel: printk: Suspending console(s) (use no_console_suspend to debug) Jan 31 22:03:35 PC kernel: Freezing remaining freezable tasks completed (elapsed 0.002 seconds) Jan 31 22:03:35 PC kernel: Freezing remaining freezable tasks Jan 31 22:03:35 PC kernel: OOM killer disabled. Jan 31 22:03:35 PC kernel: Freezing user space processes completed (elapsed 0.004 seconds) Jan 31 22:03:35 PC kernel: Freezing user space processes Jan 31 22:03:31 PC kernel: Filesystems sync: 0.144 seconds Jan 31 22:03:31 PC kernel: PM: suspend entry (deep) Jan 31 22:03:31 PC kernel: wlp1s0: deauthenticating from 40:75:c3:19:df:20 by local choice (Reason: 3=DEAUTH_LEAVING) Jan 31 21:53:34 PC kernel: wlp1s0: associated Jan 31 21:53:34 PC kernel: wlp1s0: RX AssocResp from 40:75:c3:19:df:20 (capab=0x1011 status=0 aid=13) Jan 31 21:53:34 PC kernel: wlp1s0: associate with 40:75:c3:19:df:20 (try 1/3) Jan 31 21:53:34 PC kernel: wlp1s0: authenticated Jan 31 21:53:34 PC kernel: wlp1s0: send auth to 40:75:c3:19:df:20 (try 1/3) Jan 31 21:53:34 PC kernel: wlp1s0: authenticate with 40:75:c3:19:df:20 (local address=e4:5e:37:c9:a1:80) Jan 31 21:53:31 PC kernel: psmouse serio1: synaptics: queried min coordinates: x [1266..], y [1162..] Jan 31 21:53:31 PC kernel: psmouse serio1: synaptics: queried max coordinates: x [..5678], y [..4694] Jan 31 21:53:30 PC kernel: PM: suspend exit Jan 31 21:53:30 PC kernel: random: crng reseeded on system resumption Jan 31 21:53:30 PC kernel: Restarting tasks ... done. Jan 31 21:53:30 PC kernel: OOM killer enabled. Jan 31 21:53:30 PC kernel: PM: resume devices took 0.246 seconds Jan 31 21:53:30 PC kernel: PM: Some devices failed to suspend, or early wake event detected Jan 31 21:53:30 PC kernel: nvme 0000:02:00.0: PM: failed to suspend async: error -16 Jan 31 21:53:30 PC kernel: nvme 0000:02:00.0: PM: dpm_run_callback(): pci_pm_suspend+0x0/0x170 returns -16 Jan 31 21:53:30 PC kernel: nvme 0000:02:00.0: PM: pci_pm_suspend(): nvme_suspend+0x0/0x170 [nvme] returns -16 Jan 31 21:53:30 PC kernel: sd 0:0:0:0: [sda] Synchronizing SCSI cache Jan 31 21:53:30 PC kernel: printk: Suspending console(s) (use no_console_suspend to debug) Jan 31 21:53:30 PC kernel: Freezing remaining freezable tasks completed (elapsed 0.159 seconds) Jan 31 21:53:30 PC kernel: Freezing remaining freezable tasks Jan 31 21:53:30 PC kernel: OOM killer disabled. Jan 31 21:53:30 PC kernel: Freezing user space processes completed (elapsed 0.004 seconds) Jan 31 21:53:30 PC kernel: Freezing user space processes Jan 31 21:53:30 PC kernel: Filesystems sync: 0.142 seconds Jan 31 21:53:30 PC kernel: PM: suspend entry (s2idle) Jan 31 21:53:30 PC kernel: PM: suspend exit Jan 31 21:53:30 PC kernel: random: crng reseeded on system resumption Jan 31 21:53:29 PC kernel: Restarting tasks ... done. Jan 31 21:53:29 PC kernel: OOM killer enabled. Jan 31 21:53:29 PC kernel: PM: resume devices took 0.000 seconds Jan 31 21:53:29 PC kernel: PM: Some devices failed to suspend, or early wake event detected Jan 31 21:53:29 PC kernel: amdgpu 0000:04:00.0: PM: not prepared for power transition: code -12 Jan 31 21:53:29 PC kernel: amdgpu 0000:04:00.0: PM: device_prepare(): pci_pm_prepare+0x0/0x70 returns -12
I could reproduce this. My laptop went to sleep (not hibernate which is still not a thing on Fedora) and I could not turn it on again at all. My laptop was very how and the fan was running, even the power button didnt react at all. Feb 01 17:18:57 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:57 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:57 PC kernel: [drm] Skip scheduling IBs! Feb 01 17:18:57 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:57 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:56 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:56 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:55 PC kernel: [drm] Skip scheduling IBs! Feb 01 17:18:55 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:55 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:47 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:47 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:47 PC kernel: [drm] Skip scheduling IBs! Feb 01 17:18:47 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:47 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:45 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:45 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:45 PC kernel: [drm] Skip scheduling IBs! Feb 01 17:18:45 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:45 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:35 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:35 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:34 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:34 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:34 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:34 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:34 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:34 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:33 PC kernel: [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22) Feb 01 17:18:33 PC kernel: amdgpu 0000:04:00.0: amdgpu: couldn't schedule ib on ring <gfx_low> Feb 01 17:18:33 PC kernel: [drm] Skip scheduling IBs!
Fedora Linux 38 entered end-of-life (EOL) status on 2024-05-21. Fedora Linux 38 is no longer maintained, which means that it will not receive any further security or bug fix updates. As a result we are closing this bug. If you can reproduce this bug against a currently maintained version of Fedora Linux please feel free to reopen this bug against that version. Note that the version field may be hidden. Click the "Show advanced fields" button if you do not see the version field. If you are unable to reopen this bug, please file a new report against an active release. Thank you for reporting this bug and we are sorry it could not be fixed.