Bug 2212984 - [abrt] amdgpu_irq_put: WARNING: CPU: 2 PID: 6365 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu] [amdgpu]
Summary: [abrt] amdgpu_irq_put: WARNING: CPU: 2 PID: 6365 at drivers/gpu/drm/amd/amdgp...
Keywords:
Status: NEW
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 38
Hardware: x86_64
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL: https://retrace.fedoraproject.org/faf...
Whiteboard: abrt_hash:e2f422a933be7cca3f3fd7a030c...
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-06-06 19:36 UTC by bendem
Modified: 2023-07-05 21:00 UTC (History)
17 users (show)

Fixed In Version:
Doc Type: ---
Doc Text:
Clone Of:
Environment:
Last Closed:
Type: ---
Embargoed:


Attachments (Terms of Use)
File: dmesg (140.64 KB, text/plain)
2023-06-06 19:36 UTC, bendem
no flags Details

Description bendem 2023-06-06 19:36:30 UTC
Description of problem:
GPU crash while playing a full screen 1080p video and playing minecraft simultaneously.

Additional info:
reporter:       libreport-2.17.10
WARNING: CPU: 2 PID: 6365 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
Modules linked in: uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink qrtr bnep sunrpc binfmt_misc vfat fat iwlmvm snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek mac80211 snd_sof_utils snd_hda_codec_generic snd_hda_codec_hdmi intel_rapl_msr snd_soc_core intel_rapl_common edac_mce_amd snd_hda_intel uvcvideo snd_compress snd_intel_dspcfg ac97_bus snd_intel_sdw_acpi snd_pcm_dmaengine kvm_amd snd_hda_codec uvc libarc4 videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_hda_core snd_pci_ps videobuf2_common snd_rpl_pci_acp6x snd_hwdep kvm btusb iwlwifi snd_pci_acp6x snd_seq snd_pci_acp5x btrtl btbcm btintel snd_seq_device snd_pcm btmtk irqbypass
 snd_rn_pci_acp3x tps6598x videodev thinkpad_acpi snd_acp_config think_lmi snd_timer snd_soc_acpi ledtrig_audio cfg80211 bluetooth firmware_attributes_class rapl pcspkr wmi_bmof platform_profile i2c_piix4 k10temp mc snd_pci_acp3x ipmi_devintf snd rfkill ipmi_msghandler soundcore serial_multi_instantiate i2c_scmi acpi_cpufreq joydev loop zram dm_crypt r8152 mii uas usb_storage hid_logitech_hidpp hid_logitech_dj amdgpu i2c_algo_bit drm_ttm_helper ttm crct10dif_pclmul crc32_pclmul iommu_v2 crc32c_intel rtsx_pci_sdmmc polyval_clmulni drm_buddy polyval_generic mmc_core gpu_sched nvme ghash_clmulni_intel drm_display_helper sha512_ssse3 nvme_core sp5100_tco ucsi_acpi ccp typec_ucsi rtsx_pci cec r8169 typec video nvme_common wmi serio_raw ip6_tables ip_tables fuse
CPU: 2 PID: 6365 Comm: kworker/u32:25 Not tainted 6.3.5-200.fc38.x86_64 #1
Hardware name: LENOVO 20UHCTO1WW/20UHCTO1WW, BIOS R1CET73W(1.42 ) 12/09/2022
Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f 9b 5a e1 e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e 9b 5a e1 b8 ea ff ff ff e9 24 9b 5a e1
RSP: 0018:ffffb283cc107c90 EFLAGS: 00010246
RAX: ffff8951d6def238 RBX: ffff8951d6c80000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: ffff8951d6c8bef0 RDI: ffff8951d6c80000
RBP: ffff8951d6c80000 R08: 000000000003ae80 R09: 0000000000000006
R10: ffffdbac53ffc008 R11: 0000000000000000 R12: 0000000000001050
R13: ffff8951d6c989a8 R14: ffff89557ed0d200 R15: 0000000000000000
FS:  0000000000000000(0000) GS:ffff8958af880000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f7089544d70 CR3: 0000000510022000 CR4: 0000000000350ee0
Call Trace:
 <TASK>
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 ? __warn+0x81/0x130
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 ? report_bug+0x171/0x1a0
 ? handle_bug+0x3c/0x80
 ? exc_invalid_op+0x17/0x70
 ? asm_exc_invalid_op+0x1a/0x20
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 gfx_v9_0_hw_fini+0x35/0x700 [amdgpu]
 amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
 ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
 amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
 amdgpu_device_pre_asic_reset+0xd3/0x2b0 [amdgpu]
 amdgpu_device_gpu_recover+0x4c7/0xd60 [amdgpu]
 amdgpu_job_timedout+0x18d/0x240 [amdgpu]
 drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
 process_one_work+0x1c7/0x3d0
 worker_thread+0x51/0x390
 ? __pfx_worker_thread+0x10/0x10
 kthread+0xde/0x110
 ? __pfx_kthread+0x10/0x10
 ret_from_fork+0x2c/0x50
 </TASK>

Comment 1 bendem 2023-06-06 19:36:36 UTC
Created attachment 1969357 [details]
File: dmesg

Comment 2 dgcampea 2023-06-26 16:42:01 UTC
Description of problem:
This occurred  after doing a screen-lock in gnome.

Version-Release number of selected component:
kernel-core-6.3.8-200.fc38

Additional info:
reporter:       libreport-2.17.10
kernel:         6.3.8-200.fc38.x86_64
crash_function: amdgpu_irq_put
reason:         WARNING: CPU: 11 PID: 95799 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu] [amdgpu]
type:           Kerneloops
cmdline:        BOOT_IMAGE=(hd7,gpt2)/vmlinuz-6.3.8-200.fc38.x86_64 root=UUID=be8820e5-4fb9-41b5-b1c2-83d3fa719db3 ro rootflags=subvol=root rd.luks.uuid=luks-1b26ba47-3c8c-4c73-9429-42f7e16bd996 rhgb quiet amdgpu.ppfeaturemask=0xfffd7fff lockdown=integrity iommu=Force transparent_hugepage=always nowatchdog
package:        kernel-core-6.3.8-200.fc38
runlevel:       N 5
comment:        This occurred  after doing a screen-lock in gnome.

Truncated backtrace:
WARNING: CPU: 11 PID: 95799 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
Modules linked in: uinput overlay rfcomm snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack nf_nat_tftp nf_conntrack_tftp bridge stp llc nf_nat_sip nf_conntrack_sip xt_DSCP xt_hashlimit xt_multiport ipt_REJECT xt_set xt_nat xt_comment ip_set_hash_ip nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nf_log_syslog nft_log nft_ct nft_chain_nat ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_mangle iptable_raw iptable_security ip_set nf_tables nfnetlink ip6table_filter iptable_filter bnep sunrpc binfmt_misc vfat fat intel_rapl_msr intel_rapl_common amd64_edac edac_mce_amd snd_hda_codec_hdmi snd_hda_codec_realtek kvm_amd snd_hda_codec_generic kvm btusb snd_hda_intel irqbypass eeepc_wmi btrtl snd_intel_dspcfg gspca_ov534 btbcm gspca_main asus_wmi snd_intel_sdw_acpi btintel videobuf2_vmalloc snd_usb_audio ledtrig_audio
 videobuf2_memops snd_hda_codec snd_usbmidi_lib btmtk videobuf2_v4l2 snd_hda_core snd_rawmidi videobuf2_common sparse_keymap snd_hwdep platform_profile rapl bluetooth snd_seq snd_seq_device uas videodev pcspkr wmi_bmof snd_pcm rfkill k10temp i2c_piix4 usb_storage snd_timer mc snd soundcore joydev acpi_cpufreq sch_cake loop zram dm_crypt mlx4_en amdgpu i2c_algo_bit drm_ttm_helper ttm video crct10dif_pclmul iommu_v2 drm_buddy crc32_pclmul crc32c_intel gpu_sched polyval_clmulni polyval_generic drm_display_helper ghash_clmulni_intel sha512_ssse3 nvme cec mlx4_core ccp nvme_core r8169 nvme_common wmi tcp_bbr scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables dm_multipath i2c_dev fuse ecryptfs
CPU: 11 PID: 95799 Comm: kworker/u64:12 Not tainted 6.3.8-200.fc38.x86_64 #1
Hardware name: System manufacturer System Product Name/PRIME X570-P, BIOS 4602 02/23/2023
Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 bf aa 75 cc e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 ae aa 75 cc b8 ea ff ff ff e9 a4 aa 75 cc
RSP: 0018:ffffc23300e87c90 EFLAGS: 00010246
RAX: ffff9fbf0546bb58 RBX: ffff9fbf0f120000 RCX: 0000000000000000
RDX: 0000000000000000 RSI: ffff9fbf0f12bf20 RDI: ffff9fbf0f120000
RBP: ffff9fbf0f120000 R08: 000000000003ae80 R09: 0000000000000006
R10: ffffed2415e50008 R11: 0000000000000000 R12: 0000000000001050
R13: ffff9fbf0f1389a8 R14: ffff9fc2b2b20800 R15: 0000000000000000
FS:  0000000000000000(0000) GS:ffff9fcdeecc0000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007faa68655018 CR3: 00000004873e2000 CR4: 0000000000350ee0
Call Trace:
 <TASK>
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 ? __warn+0x81/0x130
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 ? report_bug+0x171/0x1a0
 ? handle_bug+0x3c/0x80
 ? exc_invalid_op+0x17/0x70
 ? asm_exc_invalid_op+0x1a/0x20
 ? amdgpu_irq_put+0x46/0x70 [amdgpu]
 gfx_v9_0_hw_fini+0x17e/0x700 [amdgpu]
 amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
 ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
 amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
 amdgpu_device_pre_asic_reset+0xd3/0x2b0 [amdgpu]
 amdgpu_device_gpu_recover+0x4c7/0xd60 [amdgpu]
 amdgpu_job_timedout+0x18d/0x240 [amdgpu]
 drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
 process_one_work+0x1c7/0x3d0
 worker_thread+0x51/0x390
 ? __pfx_worker_thread+0x10/0x10
 kthread+0xde/0x110
 ? __pfx_kthread+0x10/0x10
 ret_from_fork+0x2c/0x50
 </TASK>

Comment 3 Arne 2023-07-05 21:00:08 UTC
Description of problem:
Playing a game on Steam. Probably the integrated GPU overheated

Version-Release number of selected component:
kernel-core-6.3.8-200.fc38

Additional info:
reporter:       libreport-2.17.11
kernel:         6.3.8-200.fc38.x86_64
crash_function: amdgpu_irq_put
reason:         WARNING: CPU: 5 PID: 98819 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu] [amdgpu]
type:           Kerneloops
cmdline:        BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.3.8-200.fc38.x86_64 root=UUID=49eb640a-3fb0-476f-9871-ceeaf9912908 ro rootflags=subvol=root rd.luks.uuid=luks-833935e8-e1c9-48ef-a3ac-da068f536210 rhgb quiet
package:        kernel-core-6.3.8-200.fc38
runlevel:       N 5
comment:        Playing a game on Steam. Probably the integrated GPU overheated

Truncated backtrace:
#1 [TASK] ? amdgpu_irq_put in amdgpu
#2 [TASK] ? __warn
#3 [TASK] ? amdgpu_irq_put in amdgpu
#4 [TASK] ? report_bug
#5 [TASK] ? handle_bug
#6 [TASK] ? exc_invalid_op
#7 [TASK] ? asm_exc_invalid_op
#8 [TASK] ? amdgpu_irq_put in amdgpu
#9 [TASK] gfx_v9_0_hw_fini in amdgpu
#10 [TASK] amdgpu_device_ip_suspend_phase2 in amdgpu
#11 [TASK] ? amdgpu_device_ip_suspend_phase1 in amdgpu
#12 [TASK] amdgpu_device_ip_suspend in amdgpu
#13 [TASK] amdgpu_device_pre_asic_reset in amdgpu
#14 [TASK] amdgpu_device_gpu_recover in amdgpu
#15 [TASK] amdgpu_job_timedout in amdgpu
#16 [TASK] drm_sched_job_timedout in gpu_sched
#17 [TASK] ? __pfx_worker_thread
#18 [TASK] ? __pfx_kthread
#19 [TASK] ret_from_fork


Note You need to log in before you can comment on or make changes to this bug.