Bug 2299438

Summary: 24.1.4-2.fc40 update makes AMD GPU hang all the time
Product: [Fedora] Fedora Reporter: Ferry Huberts <mailings>
Component: mesaAssignee: Adam Jackson <ajax>
Status: CLOSED ERRATA QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: unspecified    
Version: 40CC: ajax, bskeggs, igor.raits, jexposit, j, lyude, rhughes, rstrode, suraj.ghimire7, tstellar, walter.pete
Target Milestone: ---Keywords: Desktop
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: mesa-24.3.4-3.fc41 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2025-01-31 03:08:24 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Ferry Huberts 2024-07-23 06:18:32 UTC
The 24.1.4-2.fc40 mesa update makes my AMD GPU hang all the time, like every 10 secs, and after a few hangs it can't recover.

Thinkpad T14s Gen 4 AMD

Architecture:             x86_64
  CPU op-mode(s):         32-bit, 64-bit
  Address sizes:          48 bits physical, 48 bits virtual
  Byte Order:             Little Endian
CPU(s):                   16
  On-line CPU(s) list:    0-15
Vendor ID:                AuthenticAMD
  Model name:             AMD Ryzen 7 PRO 7840U w/ Radeon 780M Graphics
    CPU family:           25
    Model:                116
    Thread(s) per core:   2
    Core(s) per socket:   8
    Socket(s):            1
    Stepping:             1
    CPU(s) scaling MHz:   24%
    CPU max MHz:          5132.0000
    CPU min MHz:          400.0000
    BogoMIPS:             6587.59
    Flags:                fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mm
                          xext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good amd_lbr_v2 nopl xtopology nonstop_tsc cpuid extd_apicid aper
                          fmperf rapl pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 x2apic movbe popcnt aes xsave avx f16c rdrand lahf_
                          lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_c
                          ore perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate ssbd mba perfmon_v2 ibrs ibpb stibp ibrs_enhan
                          ced vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid cqm rdt_a avx512f avx512dq rdseed adx smap avx512ifma clflus
                          hopt clwb avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm
                          _mbm_local user_shstk avx512_bf16 clzero irperf xsaveerptr rdpru wbnoinvd cppc arat npt lbrv svm_lock nrip_save ts
                          c_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload vgif x2avic v_spec_ctrl vnmi 
                          avx512vbmi umip pku ospke avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg avx512_vpopcntdq rdpid overf
                          low_recov succor smca fsrm flush_l1d amd_lbr_pmc_freeze
Virtualization features:  
  Virtualization:         AMD-V
Caches (sum of all):      
  L1d:                    256 KiB (8 instances)
  L1i:                    256 KiB (8 instances)
  L2:                     8 MiB (8 instances)
  L3:                     16 MiB (1 instance)
NUMA:                     
  NUMA node(s):           1
  NUMA node0 CPU(s):      0-15
Vulnerabilities:          
  Gather data sampling:   Not affected
  Itlb multihit:          Not affected
  L1tf:                   Not affected
  Mds:                    Not affected
  Meltdown:               Not affected
  Mmio stale data:        Not affected
  Reg file data sampling: Not affected
  Retbleed:               Not affected
  Spec rstack overflow:   Mitigation; Safe RET
  Spec store bypass:      Mitigation; Speculative Store Bypass disabled via prctl
  Spectre v1:             Mitigation; usercopy/swapgs barriers and __user pointer sanitization
  Spectre v2:             Mitigation; Enhanced / Automatic IBRS; IBPB conditional; STIBP always-on; RSB filling; PBRSB-eIBRS Not affected; B
                          HI Not affected
  Srbds:                  Not affected
  Tsx async abort:        Not affected


Reproducible: Always

Steps to Reproduce:
1.See description, install mesa update on thinkpad
2.
3.
Actual Results:  
very many gpu hangs, unrecoverable after a few hangs

Expected Results:  
no hangs

A downgrade to mesa 24.0.5-1.fc40 gets rid of the hangs

Comment 1 Ferry Huberts 2024-07-23 06:24:35 UTC
I think this is one of the traces from the log but somehow only a few hangs are in the log


Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 2 Ferry Huberts 2024-07-23 06:27:04 UTC
This is the more complete dump:

Jul 22 19:09:26 arrakeen systemd[1]: systemd-hostnamed.service: Deactivated successfully.
Jul 22 19:09:26 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-hostnamed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=succes>
Jul 22 19:09:26 arrakeen audit: BPF prog-id=67 op=UNLOAD
Jul 22 19:09:26 arrakeen audit: BPF prog-id=66 op=UNLOAD
Jul 22 19:09:26 arrakeen audit: BPF prog-id=65 op=UNLOAD
Jul 22 19:09:26 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=189, emitted seq=191
Jul 22 19:09:26 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:26 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x00000000 != 0x00000380n
Jul 22 19:09:27 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:27 arrakeen kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resuming...
Jul 22 19:09:27 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resumed successfully!
Jul 22 19:09:27 arrakeen kernel: [drm] DMUB hardware initialized: version=0x08003D00
Jul 22 19:09:28 arrakeen kernel: [drm] kiq ring mec 3 pipe 1 q 0
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring vcn_unified_0 test failed (-110)
Jul 22 19:09:28 arrakeen kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <vcn_v4_0> failed -110
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset(1) failed
Jul 22 19:09:28 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset end with ret = -110
Jul 22 19:09:28 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -110
Jul 22 19:09:28 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:28 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_RB_RPTR] failed to reach value 0x00000040 != 0x00000000n
Jul 22 19:09:29 arrakeen kernel: [drm] Register(0) [regUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002n
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:29 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:29 arrakeen systemd[1]: systemd-timedated.service: Deactivated successfully.
Jul 22 19:09:29 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-timedated comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=succes>
Jul 22 19:09:29 arrakeen audit: BPF prog-id=73 op=UNLOAD
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:30 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:31 arrakeen audit: BPF prog-id=75 op=UNLOAD
Jul 22 19:09:31 arrakeen audit: BPF prog-id=74 op=UNLOAD
Jul 22 19:09:31 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:31 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:32 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:33 arrakeen systemd[1]: systemd-localed.service: Deactivated successfully.
Jul 22 19:09:33 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-localed comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:33 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:34 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:34 arrakeen audit: BPF prog-id=70 op=UNLOAD
Jul 22 19:09:34 arrakeen audit: BPF prog-id=69 op=UNLOAD
Jul 22 19:09:34 arrakeen audit: BPF prog-id=68 op=UNLOAD
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:35 arrakeen gnome-shell[2774]: meta_wayland_buffer_process_damage: assertion 'buffer->resource' failed
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:35 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:36 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:37 arrakeen gnome-shell[2774]: meta_wayland_buffer_process_damage: assertion 'buffer->resource' failed
Jul 22 19:09:37 arrakeen kernel: [drm] Fence fallback timer expired on ring gfx_0.0.0
Jul 22 19:09:38 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=191, emitted seq=191
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:38 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffb650 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54b8 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x1b/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffbfc8 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54d0 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x2c/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Found oopses: 3
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Creating problem directories
Jul 22 19:09:40 arrakeen abrt-server[5910]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:40 arrakeen abrt-server[5910]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0' exited with 1
Jul 22 19:09:40 arrakeen abrt-server[5910]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0'
Jul 22 19:09:41 arrakeen abrt-server[5913]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:41 arrakeen abrt-server[5913]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1' exited with 1
Jul 22 19:09:41 arrakeen abrt-server[5913]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1'
Jul 22 19:09:42 arrakeen abrt-server[5916]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:42 arrakeen abrt-server[5916]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2' exited with 1
Jul 22 19:09:42 arrakeen abrt-server[5916]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2'
Jul 22 19:09:43 arrakeen abrt-dump-journal-oops[1727]: Reported 3 kernel oopses to Abrt
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 3 Ferry Huberts 2024-07-23 06:28:25 UTC
And another one.
Hope that is enough.

Jul 22 19:09:38 arrakeen kernel: [drm] Fence fallback timer expired on ring sdma0
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_unified_0 timeout, signaled seq=191, emitted seq=191
Jul 22 19:09:38 arrakeen kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 4134 thread firefox:cs0 pid 4315
Jul 22 19:09:38 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset begin!
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffb650 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54b8 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x1b/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:38 arrakeen kernel: WARNING: CPU: 7 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:38 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:38 arrakeen kernel: CPU: 7 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:38 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:38 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:38 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:38 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:38 arrakeen kernel: RAX: ffff90a281ffbfc8 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c6a54d0 RDI: ffff90a29c680000
Jul 22 19:09:38 arrakeen kernel: RBP: ffff90a29c6c4930 R08: 0017ffffc0000000 R09: 0000000000000000
Jul 22 19:09:38 arrakeen kernel: R10: ffffaf548089fc58 R11: 0000000000000000 R12: 0000000000000006
Jul 22 19:09:38 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000006 R15: ffffffffc13ce690
Jul 22 19:09:38 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bed80000(0000) knlGS:0000000000000000
Jul 22 19:09:38 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:38 arrakeen kernel: CR2: 00007ff432333018 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:38 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:38 arrakeen kernel: Call Trace:
Jul 22 19:09:38 arrakeen kernel:  <TASK>
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:38 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:38 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:38 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_hw_fini+0x2c/0xf0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  gfx_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:38 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:38 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:38 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:38 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:38 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:38 arrakeen kernel:  </TASK>
Jul 22 19:09:38 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:38 arrakeen kernel: ------------[ cut here ]------------
Jul 22 19:09:39 arrakeen kernel: WARNING: CPU: 8 PID: 1085 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:630 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Modules linked in: uinput michael_mic snd_seq_dummy snd_hrtimer rfcomm qrtr_mhi nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4>
Jul 22 19:09:39 arrakeen kernel:  videobuf2_memops videobuf2_v4l2 btbcm snd_rpl_pci_acp6x snd_hda_core videobuf2_common kvm snd_acp_pci snd_hwdep btmtk videodev snd_acp_legacy_common libarc4 snd_seq snd_pci_acp6x bluetooth snd_pci_acp5x >
Jul 22 19:09:39 arrakeen kernel: CPU: 8 PID: 1085 Comm: kworker/u64:10 Tainted: G        W          6.9.10-200.fc40.x86_64 #1
Jul 22 19:09:39 arrakeen kernel: Hardware name: LENOVO 21F8CTO1WW/21F8CTO1WW, BIOS R2EET38W (1.19 ) 03/15/2024
Jul 22 19:09:39 arrakeen kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 22 19:09:39 arrakeen kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 fa 22 b4 c2 e9 1a fd ff ff <0f> 0b b8 ea ff ff ff e9 e9 22 b4 c2 b8 ea ff ff ff e9 df 22 b4 c2
Jul 22 19:09:39 arrakeen kernel: RSP: 0018:ffffaf548089fca8 EFLAGS: 00010246
Jul 22 19:09:39 arrakeen kernel: RAX: ffff90a281ffbd20 RBX: ffff90a29c680000 RCX: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: RDX: 0000000000000000 RSI: ffff90a29c680c58 RDI: ffff90a29c680000
Jul 22 19:09:39 arrakeen kernel: RBP: ffff90a29c6c48e0 R08: 0000000000000001 R09: 0000000000000000
Jul 22 19:09:39 arrakeen kernel: R10: 0000000000000100 R11: 0000000000000000 R12: 0000000000000001
Jul 22 19:09:39 arrakeen kernel: R13: ffff90a29c680000 R14: 0000000000000001 R15: ffffffffc139f350
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Found oopses: 3
Jul 22 19:09:40 arrakeen abrt-dump-journal-oops[1727]: abrt-dump-journal-oops: Creating problem directories
Jul 22 19:09:40 arrakeen abrt-server[5910]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:40 arrakeen abrt-server[5910]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0' exited with 1
Jul 22 19:09:40 arrakeen abrt-server[5910]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-0'
Jul 22 19:09:41 arrakeen abrt-server[5913]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:41 arrakeen abrt-server[5913]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1' exited with 1
Jul 22 19:09:41 arrakeen abrt-server[5913]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-1'
Jul 22 19:09:42 arrakeen abrt-server[5916]: Package 'kernel-core' isn't signed with proper key
Jul 22 19:09:42 arrakeen abrt-server[5916]: 'post-create' on '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2' exited with 1
Jul 22 19:09:42 arrakeen abrt-server[5916]: Deleting problem directory '/var/spool/abrt/oops-2024-07-22-19:09:40-1727-2'
Jul 22 19:09:43 arrakeen abrt-dump-journal-oops[1727]: Reported 3 kernel oopses to Abrt
Jul 22 19:09:48 arrakeen geoclue[2223]: Service not used for 60 seconds. Shutting down..
Jul 22 19:09:48 arrakeen systemd[1]: geoclue.service: Deactivated successfully.
Jul 22 19:09:48 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=geoclue comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:49 arrakeen realmd[2266]: quitting realmd service after timeout
Jul 22 19:09:49 arrakeen realmd[2266]: stopping service
Jul 22 19:09:49 arrakeen systemd[1]: realmd.service: Deactivated successfully.
Jul 22 19:09:49 arrakeen audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=realmd comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: failed to write reg 1a774 wait reg 1a786
Jul 22 19:09:52 arrakeen kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resuming...
Jul 22 19:09:52 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: SMU is resumed successfully!
Jul 22 19:09:39 arrakeen kernel: FS:  0000000000000000(0000) GS:ffff90a9bee00000(0000) knlGS:0000000000000000
Jul 22 19:09:39 arrakeen kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 22 19:09:39 arrakeen kernel: CR2: 00007f5b89ba2808 CR3: 0000000491428000 CR4: 0000000000f50ef0
Jul 22 19:09:39 arrakeen kernel: PKRU: 55555554
Jul 22 19:09:39 arrakeen kernel: Call Trace:
Jul 22 19:09:39 arrakeen kernel:  <TASK>
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? __warn.cold+0x8e/0xe8
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? report_bug+0xff/0x140
Jul 22 19:09:39 arrakeen kernel:  ? handle_bug+0x3c/0x80
Jul 22 19:09:39 arrakeen kernel:  ? exc_invalid_op+0x17/0x70
Jul 22 19:09:39 arrakeen kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_hw_fini+0x24/0x90 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  gmc_v11_0_suspend+0xe/0x20 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend_phase2+0x141/0x5d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  ? amdgpu_device_ip_suspend_phase1+0x9a/0x180 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_ip_suspend+0x40/0x70 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_pre_asic_reset+0xcd/0x420 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_device_gpu_recover.cold+0x475/0xb44 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  amdgpu_job_timedout+0x18e/0x1d0 [amdgpu]
Jul 22 19:09:39 arrakeen kernel:  drm_sched_job_timedout+0x73/0x100 [gpu_sched]
Jul 22 19:09:39 arrakeen kernel:  process_one_work+0x17b/0x340
Jul 22 19:09:39 arrakeen kernel:  worker_thread+0x278/0x3b0
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_worker_thread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  kthread+0xcf/0x100
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork+0x31/0x50
Jul 22 19:09:39 arrakeen kernel:  ? __pfx_kthread+0x10/0x10
Jul 22 19:09:39 arrakeen kernel:  ret_from_fork_asm+0x1a/0x30
Jul 22 19:09:39 arrakeen kernel:  </TASK>
Jul 22 19:09:39 arrakeen kernel: ---[ end trace 0000000000000000 ]---
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: MODE2 reset
Jul 22 19:09:39 arrakeen kernel: amdgpu 0000:c3:00.0: amdgpu: GPU reset succeeded, trying to resume

Comment 4 José Expósito 2024-07-24 08:52:51 UTC
Could this be a duplicated of https://bugzilla.redhat.com/show_bug.cgi?id=2299031 ?

The solution proposed in the upstream bug report:
https://gitlab.freedesktop.org/mesa/mesa/-/issues/11533

Is already applied in Fedora, mesa-24.1.4-3.fc40:
https://bodhi.fedoraproject.org/updates/FEDORA-2024-face82e699

Comment 5 Ferry Huberts 2024-07-24 09:39:33 UTC
might be.

however, on my Thinkpad Z16 gen1 I see corrupted videos, but no hangs

Comment 6 Fedora Update System 2025-01-29 15:35:22 UTC
FEDORA-2025-a24369766c (mesa-24.3.4-3.fc41) has been submitted as an update to Fedora 41.
https://bodhi.fedoraproject.org/updates/FEDORA-2025-a24369766c

Comment 7 Fedora Update System 2025-01-31 03:08:24 UTC
FEDORA-2025-a24369766c (mesa-24.3.4-3.fc41) has been pushed to the Fedora 41 stable repository.
If problem still persists, please make note of it in this bug report.