Bug 2299438 - 24.1.4-2.fc40 update makes AMD GPU hang all the time
Summary: 24.1.4-2.fc40 update makes AMD GPU hang all the time
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Fedora
Classification: Fedora
Component: mesa
Version: 40
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Adam Jackson
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-07-23 06:18 UTC by Ferry Huberts
Modified: 2025-01-31 03:08 UTC (History)
11 users (show)

Fixed In Version: mesa-24.3.4-3.fc41
Clone Of:
Environment:
Last Closed: 2025-01-31 03:08:24 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)

Description Ferry Huberts 2024-07-23 06:18:32 UTC
The 24.1.4-2.fc40 mesa update makes my AMD GPU hang all the time, like every 10 secs, and after a few hangs it can't recover.

Thinkpad T14s Gen 4 AMD

Architecture:             x86_64
  CPU op-mode(s):         32-bit, 64-bit
  Address sizes:          48 bits physical, 48 bits virtual
  Byte Order:             Little Endian
CPU(s):                   16
  On-line CPU(s) list:    0-15
Vendor ID:                AuthenticAMD
  Model name:             AMD Ryzen 7 PRO 7840U w/ Radeon 780M Graphics
    CPU family:           25
    Model:                116
    Thread(s) per core:   2
    Core(s) per socket:   8
    Socket(s):            1
    Stepping:             1
    CPU(s) scaling MHz:   24%
    CPU max MHz:          5132.0000
    CPU min MHz:          400.0000
    BogoMIPS:             6587.59
    Flags:                fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mm
                          xext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good amd_lbr_v2 nopl xtopology nonstop_tsc cpuid extd_apicid aper
                          fmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_
                          lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_c
                          ore perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba perfmon_v2 ibrs ibpb stibp ibrs_enhan
                          ced vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap avx512ifma clflus
                          hopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm
                          _mbm_local user_shstk avx512_bf16 clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save ts
                          c_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif x2avic v_spec_ctrl vnmi 
                          avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq rdpid overf
                          low_recov succor smca fsrm flush_l1d amd_lbr_pmc_freeze
Virtualization features:  
  Virtualization:         AMD-V
Caches (sum of all):      
  L1d:                    256 KiB (8 instances)
  L1i:                    256 KiB (8 instances)
  L2:                     8 MiB (8 instances)
  L3:                     16 MiB (1 instance)
NUMA:                     
  NUMA node(s):           1
  NUMA node0 CPU(s):      0-15
Vulnerabilities:          
  Gather data sampling:   Not affected
  Itlb multihit:          Not affected
  L1tf:                   Not affected
  Mds:                    Not affected
  Meltdown:               Not affected
  Mmio stale data:        Not affected
  Reg file data sampling: Not affected
  Retbleed:               Not affected
  Spec rstack overflow:   Mitigation; Safe RET
  Spec store bypass:      Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:             Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:             Mitigation; Enhanced / Automatic IBRS; IBPB conditional; STIBP always-on; RSB filling; PBRSB-eIBRS Not affected; B
                          HI Not affected
  Srbds:                  Not affected
  Tsx async abort:        Not affected


Reproducible: Always

Steps to Reproduce:
1.See description, install mesa update on thinkpad
2.
3.
Actual Results:  
very many gpu hangs, unrecoverable after a few hangs

Expected Results:  
no hangs

A downgrade to mesa 24.0.5-1.fc40 gets rid of the hangs

Comment 1 Ferry Huberts 2024-07-23 06:24:35 UTC
I think this is one of the traces from the log but somehow only a few hangs are in the log


Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 2 Ferry Huberts 2024-07-23 06:27:04 UTC
This is the more complete dump:

Jul 22 19:09:26 arrakeen systemd[1]: systemd-hostnamed.service: Deactivated successfully.
Jul 22 19:09:26 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=succes>
Jul 22 19:09:26 arrakeen audit: BPF prog-id=67 op=UNLOAD
Jul 22 19:09:26 arrakeen audit: BPF prog-id=66 op=UNLOAD
Jul 22 19:09:26 arrakeen audit: BPF prog-id=65 op=UNLOAD
Jul 22 19:09:26 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=189, emitted seq=191
Jul 22 19:09:26 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:26 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x00000000 != 0x00000380n
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:27 arrakeen kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resuming...
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resumed successfully!
Jul 22 19:09:27 arrakeen kernel: [drm] DMUB hardware initialized: version=0x08003D00
Jul 22 19:09:28 arrakeen kernel: [drm] kiq ring mec 3 pipe 1 q 0
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring vcn_unified_0 test failed (-110)
Jul 22 19:09:28 arrakeen kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <vcn_v4_0> failed -110
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset(1) failed
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset end with ret = -110
Jul 22 19:09:28 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -110
Jul 22 19:09:28 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:28 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x00000040 != 0x00000000n
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen systemd[1]: systemd-timedated.service: Deactivated successfully.
Jul 22 19:09:29 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-timedated comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=succes>
Jul 22 19:09:29 arrakeen audit: BPF prog-id=73 op=UNLOAD
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:31 arrakeen audit: BPF prog-id=75 op=UNLOAD
Jul 22 19:09:31 arrakeen audit: BPF prog-id=74 op=UNLOAD
Jul 22 19:09:31 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:31 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:33 arrakeen systemd[1]: systemd-localed.service: Deactivated successfully.
Jul 22 19:09:33 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-localed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:34 arrakeen audit: BPF prog-id=70 op=UNLOAD
Jul 22 19:09:34 arrakeen audit: BPF prog-id=69 op=UNLOAD
Jul 22 19:09:34 arrakeen audit: BPF prog-id=68 op=UNLOAD
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:35 arrakeen gnome-shell[2774]: meta_wayland_buffer_process_damage: assertion 'buffer->resource' failed
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:37 arrakeen gnome-shell[2774]: meta_wayland_buffer_process_damage: assertion 'buffer->resource' failed
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:38 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=191, emitted seq=191
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:38 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffb650 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54b8 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x1b/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffbfc8 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54d0 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x2c/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Found oopses: 3
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Creating problem directories
Jul 22 19:09:40 arrakeen abrt-server[5910]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:40 arrakeen abrt-server[5910]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0' exited with 1
Jul 22 19:09:40 arrakeen abrt-server[5910]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0'
Jul 22 19:09:41 arrakeen abrt-server[5913]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:41 arrakeen abrt-server[5913]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1' exited with 1
Jul 22 19:09:41 arrakeen abrt-server[5913]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1'
Jul 22 19:09:42 arrakeen abrt-server[5916]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:42 arrakeen abrt-server[5916]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2' exited with 1
Jul 22 19:09:42 arrakeen abrt-server[5916]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2'
Jul 22 19:09:43 arrakeen abrt-dump-journal-oops[1727]: Reported 3 kernel oopses to Abrt
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 3 Ferry Huberts 2024-07-23 06:28:25 UTC
And another one.
Hope that is enough.

Jul 22 19:09:38 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=191, emitted seq=191
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:38 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffb650 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54b8 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x1b/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffbfc8 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54d0 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x2c/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Found oopses: 3
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Creating problem directories
Jul 22 19:09:40 arrakeen abrt-server[5910]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:40 arrakeen abrt-server[5910]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0' exited with 1
Jul 22 19:09:40 arrakeen abrt-server[5910]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0'
Jul 22 19:09:41 arrakeen abrt-server[5913]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:41 arrakeen abrt-server[5913]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1' exited with 1
Jul 22 19:09:41 arrakeen abrt-server[5913]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1'
Jul 22 19:09:42 arrakeen abrt-server[5916]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:42 arrakeen abrt-server[5916]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2' exited with 1
Jul 22 19:09:42 arrakeen abrt-server[5916]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2'
Jul 22 19:09:43 arrakeen abrt-dump-journal-oops[1727]: Reported 3 kernel oopses to Abrt
Jul 22 19:09:48 arrakeen geoclue[2223]: Service not used for 60 seconds. Shutting down..
Jul 22 19:09:48 arrakeen systemd[1]: geoclue.service: Deactivated successfully.
Jul 22 19:09:48 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=geoclue comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:49 arrakeen realmd[2266]: quitting realmd service after timeout
Jul 22 19:09:49 arrakeen realmd[2266]: stopping service
Jul 22 19:09:49 arrakeen systemd[1]: realmd.service: Deactivated successfully.
Jul 22 19:09:49 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=realmd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: failed to write reg 1a774 wait reg 1a786
Jul 22 19:09:52 arrakeen kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resuming...
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resumed successfully!
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 4 José Expósito 2024-07-24 08:52:51 UTC
Could this be a duplicated of https://bugzilla.redhat.com/show_bug.cgi?id=2299031 ?

The solution proposed in the upstream bug report:
https://gitlab.freedesktop.org/mesa/mesa/-/issues/11533

Is already applied in Fedora, mesa-24.1.4-3.fc40:
https://bodhi.fedoraproject.org/updates/FEDORA-2024-face82e699

Comment 5 Ferry Huberts 2024-07-24 09:39:33 UTC
might be.

however, on my Thinkpad Z16 gen1 I see corrupted videos, but no hangs

Comment 6 Fedora Update System 2025-01-29 15:35:22 UTC
FEDORA-2025-a24369766c (mesa-24.3.4-3.fc41) has been submitted as an update to Fedora 41.
https://bodhi.fedoraproject.org/updates/FEDORA-2025-a24369766c

Comment 7 Fedora Update System 2025-01-31 03:08:24 UTC
FEDORA-2025-a24369766c (mesa-24.3.4-3.fc41) has been pushed to the Fedora 41 stable repository.
If problem still persists, please make note of it in this bug report.


Note You need to log in before you can comment on or make changes to this bug.