Bug 1399983 - [abrt] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [plymouthd:4897]
Summary: [abrt] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [plymouthd:4897]
Keywords:
Status: CLOSED DUPLICATE of bug 1447677
Alias: None
Product: Fedora
Classification: Fedora
Component: xorg-x11-drv-nouveau
Version: 27
Hardware: x86_64
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: Karol Herbst
QA Contact: Fedora Extras Quality Assurance
URL: https://retrace.fedoraproject.org/faf...
Whiteboard: abrt_hash:9d9710d51af9513215f52f0da04...
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-11-30 07:52 UTC by Orsiris de Jong
Modified: 2018-06-12 11:35 UTC (History)
11 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-06-12 11:35:56 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)
File: dmesg (72.99 KB, text/plain)
2016-11-30 07:52 UTC, Orsiris de Jong
no flags Details
syslog for entire boot/shutdown cycle (368.81 KB, text/plain)
2017-05-05 18:05 UTC, Need Real Name
no flags Details

Description Orsiris de Jong 2016-11-30 07:52:04 UTC
Description of problem:
If I happen to have nouveau driver loaded, I usually get PCI AER errors, or CPU softlockups. Does not happen under windows (hardware is ok).

Additional info:
reporter:       libreport-2.8.0
NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [plymouthd:4897]
Modules linked in: rfcomm fuse ccm xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_broute bridge ebtable_nat ip6table_security ip6table_raw ip6table_mangle ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 iptable_security iptable_raw iptable_mangle iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack ebtable_filter ebtables ip6table_filter ip6_tables cmac bnep vfat fat intel_rapl btrfs x86_pkg_temp_thermal coretemp kvm_intel arc4 kvm irqbypass crct10dif_pclmul crc32_pclmul xor ghash_clmulni_intel iwlmvm snd_hda_codec_hdmi snd_hda_codec_realtek mac80211 snd_hda_codec_generic iTCO_wdt iTCO_vendor_support msi_wmi sparse_keymap
 snd_hda_intel snd_hda_codec iwlwifi snd_hda_core snd_hwdep snd_seq cfg80211 snd_seq_device snd_pcm uvcvideo intel_cstate raid6_pq videobuf2_vmalloc hci_uart videobuf2_memops videobuf2_v4l2 videobuf2_core snd_timer btusb intel_uncore i2c_i801 btrtl intel_rapl_perf btbcm snd mei_me btqca btintel videodev mei i2c_smbus soundcore shpchp bluetooth rtsx_usb_ms memstick media joydev pinctrl_sunrisepoint rfkill pinctrl_intel intel_lpss_acpi intel_lpss tpm_tis tpm_tis_core acpi_als kfifo_buf acpi_pad industrialio tpm nfsd auth_rpcgss nfs_acl lockd grace sunrpc rtsx_usb_sdmmc mmc_core rtsx_usb nouveau i915 mxm_wmi ttm i2c_algo_bit 8021q drm_kms_helper garp stp crc32c_intel llc mrp serio_raw drm nvme nvme_core alx mdio wmi video i2c_hid fjes
CPU: 2 PID: 4897 Comm: plymouthd Not tainted 4.8.8-300.fc25.x86_64 #1
Hardware name: Micro-Star International Co., Ltd. GE62/MS-16J5, BIOS 1.22 11/02/2016
task: ffff8cebb44f3e00 task.stack: ffff8cebdbc9c000
RIP: 0010:[<ffffffffa0400890>]  [<ffffffffa0400890>] ioread32+0x30/0x40
RSP: 0018:ffff8cebdbc9f820  EFLAGS: 00000292
RAX: 00000000ffffffff RBX: ffff8cebe8028800 RCX: 0000000000000018
RDX: 00000ff38b8e8840 RSI: ffffbba80410a014 RDI: ffffbba804009400
RBP: ffff8cebdbc9f840 R08: ffffea0011504820 R09: 000000000037f000
R10: 000000000047f000 R11: ffff8cebdbc9f7d7 R12: 00000000ffffffff
R13: ffff8cebe539c008 R14: ffff8cebe6a07980 R15: ffffffffffffffff
FS:  00007fe5e066dc40(0000) GS:ffff8cebfec80000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000558e34d8d610 CR3: 0000000420b5a000 CR4: 00000000003406e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Stack:
 ffffffffc04584db ffff8cebe8028800 0000000000000000 ffff8cebe539c008
 ffff8cebdbc9f850 ffffffffc045805f ffff8cebdbc9f888 ffffffffc0450a20
 ffff8cebe539c008 ffff8cebe8028800 000009b5e49d974b 000009b5e21ab94d
Call Trace:
 [<ffffffffc04584db>] ? nv04_timer_read+0x3b/0x70 [nouveau]
 [<ffffffffc045805f>] nvkm_timer_read+0xf/0x20 [nouveau]
 [<ffffffffc0450a20>] nvkm_pmu_init+0x50/0x450 [nouveau]
 [<ffffffffc04069b5>] nvkm_subdev_init+0x95/0x200 [nouveau]
 [<ffffffffc045c908>] nvkm_device_init+0x178/0x280 [nouveau]
 [<ffffffffc0460638>] nvkm_udevice_init+0x48/0x60 [nouveau]
 [<ffffffffc0405340>] nvkm_object_init+0x40/0x190 [nouveau]
 [<ffffffffc04053b4>] nvkm_object_init+0xb4/0x190 [nouveau]
 [<ffffffffc040255e>] nvkm_client_init+0xe/0x10 [nouveau]
 [<ffffffffc04a64ce>] nvkm_client_resume+0xe/0x10 [nouveau]
 [<ffffffffc0401797>] nvif_client_resume+0x17/0x20 [nouveau]
 [<ffffffffc04a3a9b>] nouveau_do_resume+0x4b/0x120 [nouveau]
 [<ffffffffc04a3e9f>] nouveau_pmops_runtime_resume+0x7f/0x120 [nouveau]
 [<ffffffffa0438f6b>] pci_pm_runtime_resume+0x7b/0xa0
 [<ffffffffa0538a77>] __rpm_callback+0x27/0x70
 [<ffffffffa0538ae4>] rpm_callback+0x24/0x80
 [<ffffffffa0438ef0>] ? pci_restore_standard_config+0x40/0x40
 [<ffffffffa0539477>] rpm_resume+0x4a7/0x6b0
 [<ffffffffa036c272>] ? cred_has_capability+0x72/0x120
 [<ffffffffa053a52e>] __pm_runtime_resume+0x4e/0x70
 [<ffffffffc04a43db>] nouveau_drm_open+0x3b/0x1a0 [nouveau]
 [<ffffffffa036c380>] ? selinux_capable+0x20/0x30
 [<ffffffffa0362918>] ? security_capable+0x48/0x60
 [<ffffffffc016dcc5>] drm_open+0x1f5/0x400 [drm]
 [<ffffffffc0173f50>] drm_stub_open+0xb0/0x110 [drm]
 [<ffffffffa0255590>] chrdev_open+0xb0/0x180
 [<ffffffffa024dac5>] do_dentry_open+0x205/0x2e0
 [<ffffffffa02554e0>] ? cdev_put+0x30/0x30
 [<ffffffffa024eeac>] vfs_open+0x4c/0x70
 [<ffffffffa025c64b>] ? may_open+0x9b/0x100
 [<ffffffffa025fb14>] path_openat+0x664/0x1380
 [<ffffffffa0261b31>] do_filp_open+0x91/0x100
 [<ffffffffa02724a4>] ? mntput+0x24/0x40
 [<ffffffffa026fdcf>] ? __alloc_fd+0x3f/0x170
 [<ffffffffa024f2c0>] do_sys_open+0x130/0x220
 [<ffffffffa024f3ce>] SyS_open+0x1e/0x20
 [<ffffffffa08027b2>] entry_SYSCALL_64_fastpath+0x1a/0xa4
Code: 03 00 77 25 48 81 ff 00 00 01 00 76 05 0f b7 d7 ed c3 55 48 c7 c6 e6 6d c7 a0 48 89 e5 e8 19 ff ff ff b8 ff ff ff ff 5d c3 8b 07 <c3> 0f 1f 44 00 00 66 2e 0f 1f 84 00 00 00 00 00 48 81 fe ff ff

Comment 1 Orsiris de Jong 2016-11-30 07:52:13 UTC
Created attachment 1226188 [details]
File: dmesg

Comment 2 Frank Büttner 2017-02-24 11:50:42 UTC
Before the NMI, I can see, that the kernel fails to change the power mode of the nvida gpu.
In my case it in an Lenovo T440p.
When I let the system run via power supply, than it don't happens.
Only running on battery.

Comment 3 Sam Roza 2017-04-13 01:23:56 UTC
Same issue here:

~~~
Apr 12 18:04:53 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Apr 12 18:04:53 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 12 18:04:53 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Apr 12 18:04:53 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 12 18:04:53 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Apr 12 18:04:53 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 12 18:04:53 ypestis2 kernel: nouveau 0000:01:00.0: DRM: resuming kernel object tree...
Apr 12 18:04:53 ypestis2 tracker-extract[3671]: unable to create file '/run/user/0/dconf/user': Permission denied.  dconf will not work properly.
Apr 12 18:05:21 ypestis2 kernel: NMI watchdog: BUG: soft lockup - CPU#6 stuck for 23s! [Xorg:3216]
Apr 12 18:05:21 ypestis2 kernel: Modules linked in: rfcomm fuse xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nfsv3 rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache ip6t_rpfilter 
Apr 12 18:05:21 ypestis2 kernel:  intel_uncore videodev intel_rapl_perf btintel bluetooth joydev media cfg80211 snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_hda_in
Apr 12 18:05:21 ypestis2 kernel: CPU: 6 PID: 3216 Comm: Xorg Not tainted 4.10.8-200.fc25.x86_64 #1
Apr 12 18:05:21 ypestis2 kernel: Hardware name: LENOVO 20EGS0R600/20EGS0R600, BIOS GNET71WW (2.19 ) 02/05/2015
Apr 12 18:05:21 ypestis2 kernel: task: ffff8b8c13050000 task.stack: ffffadc6ca27c000
Apr 12 18:05:21 ypestis2 kernel: RIP: 0010:ioread32+0x30/0x40
Apr 12 18:05:21 ypestis2 kernel: RSP: 0018:ffffadc6ca27fae8 EFLAGS: 00000296 ORIG_RAX: ffffffffffffff10
Apr 12 18:05:21 ypestis2 kernel: RAX: 00000000ffffffff RBX: ffff8b8c55567800 RCX: 0000000000000018
Apr 12 18:05:21 ypestis2 kernel: RDX: 00000417600a8538 RSI: ffffadc6c910a014 RDI: ffffadc6c9009410
Apr 12 18:05:21 ypestis2 kernel: RBP: ffffadc6ca27fb08 R08: 0000000000000002 R09: ffffadc6ca27faf4
Apr 12 18:05:21 ypestis2 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 00000000ffffffff
Apr 12 18:05:21 ypestis2 kernel: R13: 00000000ffffffff R14: ffffffffffffffff R15: ffff8b8c52d60d80
Apr 12 18:05:21 ypestis2 kernel: FS:  00007fb4de80e600(0000) GS:ffff8b8c7e380000(0000) knlGS:0000000000000000
Apr 12 18:05:21 ypestis2 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 12 18:05:21 ypestis2 kernel: CR2: 00007f5b0c5d0028 CR3: 0000000827fcd000 CR4: 00000000001406e0
Apr 12 18:05:21 ypestis2 kernel: Call Trace:
Apr 12 18:05:21 ypestis2 kernel:  ? nv04_timer_read+0x48/0x60 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_timer_read+0xf/0x20 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_pmu_reset+0x71/0x160 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_pmu_preinit+0x12/0x20 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_subdev_preinit+0x34/0x110 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_device_init+0x62/0x280 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_udevice_init+0x48/0x60 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_object_init+0x40/0x190 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_object_init+0xb4/0x190 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_client_init+0xe/0x10 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvkm_client_resume+0xe/0x10 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nvif_client_resume+0x17/0x20 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nouveau_do_resume+0x4b/0x130 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  nouveau_pmops_runtime_resume+0x78/0x150 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  ? pci_restore_standard_config+0x40/0x40
Apr 12 18:05:21 ypestis2 kernel:  pci_pm_runtime_resume+0x7b/0xa0
Apr 12 18:05:21 ypestis2 kernel:  __rpm_callback+0xc2/0x200
Apr 12 18:05:21 ypestis2 kernel:  rpm_callback+0x24/0x80
Apr 12 18:05:21 ypestis2 kernel:  ? pci_restore_standard_config+0x40/0x40
Apr 12 18:05:21 ypestis2 kernel:  rpm_resume+0x4a4/0x6b0
Apr 12 18:05:21 ypestis2 kernel:  __pm_runtime_resume+0x4e/0x80
Apr 12 18:05:21 ypestis2 kernel:  nouveau_drm_ioctl+0x3d/0xc0 [nouveau]
Apr 12 18:05:21 ypestis2 kernel:  do_vfs_ioctl+0xa3/0x5f0
Apr 12 18:05:21 ypestis2 kernel:  SyS_ioctl+0x79/0x90
Apr 12 18:05:21 ypestis2 kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
Apr 12 18:05:21 ypestis2 kernel: RIP: 0033:0x7fb4dc0b6787
Apr 12 18:05:21 ypestis2 kernel: RSP: 002b:00007ffc65867678 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
Apr 12 18:05:21 ypestis2 kernel: RAX: ffffffffffffffda RBX: 0000000001fd1880 RCX: 00007fb4dc0b6787
Apr 12 18:05:21 ypestis2 kernel: RDX: 00007ffc658676b0 RSI: 00000000c05064a7 RDI: 000000000000000d
Apr 12 18:05:21 ypestis2 kernel: RBP: 00007ffc65867710 R08: 0000000001fd81e0 R09: 0000000000000000
Apr 12 18:05:21 ypestis2 kernel: R10: 0000000000000000 R11: 0000000000003246 R12: 00000000c05064a7
Apr 12 18:05:21 ypestis2 kernel: R13: 000000000000000e R14: 00000000c05064a7 R15: 0000000001fd1780
Apr 12 18:05:21 ypestis2 kernel: Code: 03 00 77 25 48 81 ff 00 00 01 00 76 05 0f b7 d7 ed c3 55 48 c7 c6 42 14 c9 b0 48 89 e5 e8 19 ff ff ff b8 ff ff ff ff 5d c3 8b 07 <c3> 0f 1f 44 00 
Apr 12 18:05:22 ypestis2 abrt-dump-journal-oops[2603]: abrt-dump-journal-oops: Found oopses: 1
Apr 12 18:05:22 ypestis2 abrt-dump-journal-oops[2603]: abrt-dump-journal-oops: Creating problem directories
Apr 12 18:05:23 ypestis2 abrt-dump-journal-oops[2603]: Reported 1 kernel oopses to Abrt
Apr 12 18:05:33 ypestis2 abrt-server[18476]: Can't find a meaningful backtrace for hashing in '.'
Apr 12 18:05:33 ypestis2 abrt-server[18476]: Option 'DropNotReportableOopses' is not configured
Apr 12 18:05:33 ypestis2 abrt-server[18476]: Preserving oops '.' because DropNotReportableOopses is 'no'
~~~

Lenovo W541

If you need info out of the abrt dir, please let me know what.

This is semi-reproduceable: Both times I've experienced this lockup, it occurred in the following steps:

- Watch video source
- Press windows key to raise system menu
- Type 'sound' to bring up sound applet
- Video freezes, but audio continues to play
- Mouse continuously repositionable, but nothing else is active/clickable/responsive
- hardware requires forcible shutdown via powerswitch

Comment 4 Need Real Name 2017-05-05 17:54:19 UTC
kernel update:
4.10.11-200.fc25.x86_64 to 4.10.12-200.fc25.x86_64

has triggered the same problem on my  Dell Inc. XPS 15 9560/05FFDN

To see this problem:
boot and shutdown

Then I see a string of errors very similar to the above (these appear to
continue every 23 seconds until forcible shutdown via powerswitch).


Apr 30 17:59:38 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 30 17:59:38 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 30 17:59:38 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Apr 30 17:59:38 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: DRM: resuming kernel object tree...
Apr 30 17:59:53 dhcp19.astro.cornell.edu realmd[1620]: quitting realmd service after timeout
Apr 30 17:59:53 dhcp19.astro.cornell.edu realmd[1620]: stopping service
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [plymouthd:2652]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: Modules linked in: rfcomm fuse ccm xt_CHECKSUM tun ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 ip_set nfnetlink arc4 ebtable_broute bridge stp 
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  snd_usb_audio ghash_clmulni_intel snd_hda_core snd_usbmidi_lib snd_seq snd_hwdep ath snd_rawmidi snd_seq_device cfg80211 snd_pcm uvcvideo videobuf2_vmalloc vid
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ttm i2c_algo_bit drm_kms_helper crc32c_intel serio_raw drm nvme nvme_core rtsx_pci i2c_hid video wmi fjes [last unloaded: nf_nat_ipv6]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: CPU: 3 PID: 2652 Comm: plymouthd Not tainted 4.10.12-200.fc25.x86_64 #1
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: Hardware name: Dell Inc. XPS 15 9560/05FFDN, BIOS 1.2.4 03/29/2017
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: task: ffff8b52b5fa4c00 task.stack: ffff9b670766c000
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RIP: 0010:nv04_timer_read+0x58/0x60 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RSP: 0018:ffff9b670766f7d8 EFLAGS: 00000286 ORIG_RAX: ffffffffffffff10
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RAX: ffffffffffffffff RBX: 0000000000000000 RCX: 0000000000000018
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RDX: 000015eeb0a39e90 RSI: ffff9b671110a014 RDI: ffff9b6711009410
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RBP: ffff9b670766f7e8 R08: 0000000000000002 R09: ffff9b670766f7d4
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffffff00000000
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: R13: 00000000ffffffff R14: ffffffffffffffff R15: ffff8b52d558af00
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: FS:  00007fc2c2161c40(0000) GS:ffff8b52ff4c0000(0000) knlGS:0000000000000000
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: CR2: 000055a86cde5628 CR3: 000000083c9df000 CR4: 00000000003406e0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: Call Trace:
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_timer_read+0xf/0x20 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_pmu_reset+0x71/0x160 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_pmu_preinit+0x12/0x20 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_subdev_preinit+0x34/0x110 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_device_init+0x62/0x280 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_udevice_init+0x48/0x60 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_object_init+0x40/0x190 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_object_init+0xb4/0x190 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_client_init+0xe/0x10 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvkm_client_resume+0xe/0x10 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nvif_client_resume+0x17/0x20 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nouveau_do_resume+0x4b/0x130 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nouveau_pmops_runtime_resume+0x78/0x150 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? pci_restore_standard_config+0x40/0x40
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  pci_pm_runtime_resume+0x7b/0xa0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  __rpm_callback+0xc2/0x200
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  rpm_callback+0x24/0x80
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? pci_restore_standard_config+0x40/0x40
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  rpm_resume+0x4a4/0x6b0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  __pm_runtime_resume+0x4e/0x80
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  nouveau_drm_open+0x3d/0x1e0 [nouveau]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? selinux_capable+0x20/0x30
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? security_capable+0x48/0x60
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  drm_open+0x21c/0x430 [drm]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  drm_stub_open+0xbb/0x100 [drm]
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  chrdev_open+0xb0/0x1c0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  do_dentry_open+0x20a/0x2f0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? cdev_put+0x30/0x30
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  vfs_open+0x4c/0x70
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? may_open+0x9b/0x100
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  path_openat+0x659/0x1420
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  do_filp_open+0x91/0x100
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  ? __alloc_fd+0x3f/0x170
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  do_sys_open+0x130/0x220
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  SyS_open+0x1e/0x20
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RIP: 0033:0x7fc2c11709b0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RSP: 002b:00007ffc55fdce18 EFLAGS: 00000246 ORIG_RAX: 0000000000000002
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fc2c11709b0
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RDX: 00007fc2c0177000 RSI: 0000000000000002 RDI: 000055c25c3bb390
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: RBP: 00007ffc55fdcc40 R08: 00007fc2c1439db8 R09: 0000000000000005
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: R10: 000055c25c3b0840 R11: 0000000000000246 R12: 0000000080001002
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: R13: 00007fc2c1b668e0 R14: 00007fc2c1d7a45e R15: 00007fc2c21a9000
Apr 30 18:00:05 dhcp19.astro.cornell.edu kernel: Code: 94 00 00 e8 bb dd 06 d9 41 89 c5 48 8b 43 78 48 8d b8 10 94 00 00 e8 a8 dd 06 d9 41 39 c4 75 c5 49 c1 e4 20 44 89 e8 5b 4c 09 e0 <41> 5c 41 5d 5d c3 66 90
Apr 30 18:00:33 dhcp19.astro.cornell.edu kernel: NMI watchdog: BUG: soft lockup - CPU#3 stuck for 22s! [plymouthd:2652]
Apr 30 18:00:33 dhcp19.astro.cornell.edu kernel: Modules linked in: rfcomm fuse ccm xt_CHECKSUM tun ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 ip_set nfnetlink arc4 ebtable_broute bridge stp llc ebtable_nat ip6t

etc...

Comment 5 Need Real Name 2017-05-05 18:05:40 UTC
Created attachment 1276661 [details]
syslog for entire boot/shutdown cycle

Comment 6 Need Real Name 2017-05-12 21:29:26 UTC
The problem persists for kernel

4.10.14-200.fc25.x86_64

Comment 7 Need Real Name 2017-05-12 21:37:43 UTC
For Dell XPS 15, the problem is independent of whether the power is from battery or from wall.

Comment 8 Need Real Name 2017-06-04 21:33:00 UTC
I am seeing the same thing for 4.11.3-200.fc25.x86_64

Comment 9 Need Real Name 2017-07-04 15:17:25 UTC
I see this problem for 4.11.7-200, 4.11.6-201 also

Change to CRITICAL

It is now a serious issue which occurs during normal uptime i.e.
before attempting to log out. The errors look identical to me:

Jul 04 10:57:24 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Jul 04 10:57:24 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Jul 04 10:57:24 dhcp19.astro.cornell.edu kernel: nouveau 0000:01:00.0: DRM: resuming object tree...
Jul 04 10:57:46 dhcp19.astro.cornell.edu gnome-terminal-[4264]: Unable to load blank_cursor from the cursor theme
Jul 04 10:57:46 dhcp19.astro.cornell.edu gnome-terminal-[4264]: Unable to load blank_cursor from the cursor theme
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: NMI watchdog: BUG: soft lockup - CPU#7 stuck for 22s! [kworker/7:0:4223]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: Modules linked in: ccm asix usbnet mii rfcomm fuse xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_ne
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  dell_smbios dcdbas kvm snd_hda_codec_realtek snd_hda_codec_generic ath10k_pci ath10k_core irqbypass crct10dif_pclmul cr
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  int3400_thermal tpm acpi_pad acpi_thermal_rel hid_multitouch nouveau i915 rtsx_pci_sdmmc mmc_core mxm_wmi ttm i2c_algo_
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: CPU: 7 PID: 4223 Comm: kworker/7:0 Not tainted 4.11.7-200.fc25.x86_64 #1
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: Hardware name: Dell Inc. XPS 15 9560/05FFDN, BIOS 1.3.3 05/08/2017
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: Workqueue: pm pm_runtime_work
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: task: ffff9c9adad9cc00 task.stack: ffffbca651730000
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: RIP: 0010:ioread32+0x30/0x40
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: RSP: 0018:ffffbca651733b28 EFLAGS: 00000292 ORIG_RAX: ffffffffffffff10
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: RAX: 00000000ffffffff RBX: ffff9c9ad5491400 RCX: 0000000000000018
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: RDX: 00000ee173ccacab RSI: ffffbca64510a014 RDI: ffffbca645009400
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: RBP: ffffbca651733b48 R08: 0000000000000002 R09: ffffbca651733b34
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: R10: 0000000000000000 R11: 00000000000004f0 R12: 00000000ffffffff
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: R13: ffff9c9ad55f3de0 R14: ffffffffffffffff R15: ffff9c9ad69b8c00
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: FS:  0000000000000000(0000) GS:ffff9c9aff5c0000(0000) knlGS:0000000000000000
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: CR2: 0000000001355000 CR3: 00000007ac832000 CR4: 00000000003406e0
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: Call Trace:
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ? nv04_timer_read+0x35/0x60 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_timer_read+0xf/0x20 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_pmu_reset+0x71/0x180 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_pmu_preinit+0x12/0x20 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_subdev_preinit+0x34/0x110 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_device_init+0x62/0x280 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_udevice_init+0x48/0x60 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_object_init+0x40/0x190 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_object_init+0xb4/0x190 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvkm_client_resume+0xe/0x10 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nvif_client_resume+0x17/0x20 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nouveau_do_resume+0x40/0xe0 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  nouveau_pmops_runtime_resume+0x78/0x150 [nouveau]
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ? pci_restore_standard_config+0x40/0x40
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  pci_pm_runtime_resume+0x7b/0xa0
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  __rpm_callback+0xc2/0x200
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  rpm_callback+0x24/0x80
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ? pci_restore_standard_config+0x40/0x40
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  rpm_resume+0x4a4/0x6b0
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  pm_runtime_work+0x50/0x90
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  process_one_work+0x197/0x450
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  worker_thread+0x4e/0x4a0
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  kthread+0x109/0x140
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ? process_one_work+0x450/0x450
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ? kthread_park+0x90/0x90
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel:  ret_from_fork+0x2c/0x40
Jul 04 10:57:51 dhcp19.astro.cornell.edu kernel: Code: 03 00 77 25 48 81 ff 00 00 01 00 76 05 0f b7 d7 ed c3 55 48 c7 c6 71 e1 c9 aa 48 89 e5 e8 19 ff ff ff b8 ff ff ff 
Jul 04 10:57:52 dhcp19.astro.cornell.edu abrt-dump-journal-oops[996]: abrt-dump-journal-oops: Found oopses: 1
Jul 04 10:57:52 dhcp19.astro.cornell.edu abrt-dump-journal-oops[996]: abrt-dump-journal-oops: Creating problem directories
Jul 04 10:57:52 dhcp19.astro.cornell.edu abrt-server[4427]: Can't find a meaningful backtrace for hashing in '.'
Jul 04 10:57:52 dhcp19.astro.cornell.edu abrt-server[4427]: Option 'DropNotReportableOopses' is not configured
Jul 04 10:57:52 dhcp19.astro.cornell.edu abrt-server[4427]: Preserving oops '.' because DropNotReportableOopses is 'no'
Jul 04 10:57:53 dhcp19.astro.cornell.edu abrt-dump-journal-oops[996]: Reported 1 kernel oopses to Abrt

And this repeats every 22 seconds until the machine is powered off.

Comment 10 Need Real Name 2017-07-10 01:59:48 UTC
Continues in 4.11.8-200.fc25.x86_64

Machine cannot power off without holding down the power
button for many seconds

Comment 11 Need Real Name 2017-07-10 02:00:46 UTC
This also leads to corrupted (discarded) journalctl logs

Comment 12 Sam Roza 2017-07-13 17:02:32 UTC
Problem still exists in 4.10.14-200.fc25.x86_64:

~~~
Jul 13 12:39:30 ypestis2 pkexec[12980]: root: Executing command [USER=root] [TTY=unknown] [CWD=/root] [COMMAND=/usr/libexec/gsd-backlight-helper --set-brightness 1438]
Jul 13 12:39:30 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Jul 13 12:39:30 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Jul 13 12:39:30 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Jul 13 12:39:30 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Jul 13 12:39:30 ypestis2 kernel: video LNXVIDEO:01: Cannot transition to power state D0 for parent in D3cold
Jul 13 12:39:30 ypestis2 kernel: nouveau 0000:01:00.0: Refused to change power state, currently in D3
Jul 13 12:39:30 ypestis2 kernel: nouveau 0000:01:00.0: DRM: resuming kernel object tree...
Jul 13 12:39:56 ypestis2 kernel: NMI watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [Xorg:2225]
Jul 13 12:39:56 ypestis2 kernel: Modules linked in: rfcomm fuse xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache ccm ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 x
Jul 13 12:39:56 ypestis2 kernel:  videobuf2_v4l2 videobuf2_core intel_rapl_perf cfg80211 videodev joydev snd_hda_codec_hdmi media snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_hda_c
Jul 13 12:39:56 ypestis2 kernel: CPU: 5 PID: 2225 Comm: Xorg Not tainted 4.10.14-200.fc25.x86_64 #1
Jul 13 12:39:56 ypestis2 kernel: Hardware name: LENOVO 20EGS0R600/20EGS0R600, BIOS GNET71WW (2.19 ) 02/05/2015
Jul 13 12:39:56 ypestis2 kernel: task: ffff8eab5e470000 task.stack: ffffb8a6088cc000
Jul 13 12:39:56 ypestis2 kernel: RIP: 0010:ioread32+0x30/0x40
Jul 13 12:39:56 ypestis2 kernel: RSP: 0018:ffffb8a6088cfae8 EFLAGS: 00000296 ORIG_RAX: ffffffffffffff10
Jul 13 12:39:56 ypestis2 kernel: RAX: 00000000ffffffff RBX: ffff8eab99e6fc00 RCX: 0000000000000018
Jul 13 12:39:56 ypestis2 kernel: RDX: 00000d12aaafe3a8 RSI: ffffb8a60910a014 RDI: ffffb8a609009410
Jul 13 12:39:56 ypestis2 kernel: RBP: ffffb8a6088cfb08 R08: 0000000000000002 R09: ffffb8a6088cfaf4
Jul 13 12:39:56 ypestis2 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff8eab99e6fc00
Jul 13 12:39:56 ypestis2 kernel: R13: ffff8eab95ced6c0 R14: ffffffffffffffff R15: ffff8eab95bd0900
Jul 13 12:39:56 ypestis2 kernel: FS:  00007f353f31dec0(0000) GS:ffff8eabbe340000(0000) knlGS:0000000000000000
Jul 13 12:39:56 ypestis2 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 13 12:39:56 ypestis2 kernel: CR2: 00007f05f6abf000 CR3: 000000081e3da000 CR4: 00000000001406e0
Jul 13 12:39:56 ypestis2 kernel: Call Trace:
Jul 13 12:39:56 ypestis2 kernel:  ? nv04_timer_read+0x22/0x60 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_timer_read+0xf/0x20 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_pmu_reset+0x71/0x160 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_pmu_preinit+0x12/0x20 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_subdev_preinit+0x34/0x110 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_device_init+0x62/0x280 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_udevice_init+0x48/0x60 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_object_init+0x40/0x190 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_object_init+0xb4/0x190 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_client_init+0xe/0x10 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvkm_client_resume+0xe/0x10 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nvif_client_resume+0x17/0x20 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nouveau_do_resume+0x4b/0x130 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  nouveau_pmops_runtime_resume+0x78/0x150 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  ? pci_restore_standard_config+0x40/0x40
Jul 13 12:39:56 ypestis2 kernel:  pci_pm_runtime_resume+0x7b/0xa0
Jul 13 12:39:56 ypestis2 kernel:  __rpm_callback+0xc2/0x200
Jul 13 12:39:56 ypestis2 kernel:  rpm_callback+0x24/0x80
Jul 13 12:39:56 ypestis2 kernel:  ? pci_restore_standard_config+0x40/0x40
Jul 13 12:39:56 ypestis2 kernel:  rpm_resume+0x4a4/0x6b0
Jul 13 12:39:56 ypestis2 kernel:  __pm_runtime_resume+0x4e/0x80
Jul 13 12:39:56 ypestis2 kernel:  nouveau_drm_ioctl+0x3d/0xc0 [nouveau]
Jul 13 12:39:56 ypestis2 kernel:  do_vfs_ioctl+0xa3/0x5f0
Jul 13 12:39:56 ypestis2 kernel:  SyS_ioctl+0x79/0x90
Jul 13 12:39:56 ypestis2 kernel:  entry_SYSCALL_64_fastpath+0x1a/0xa9
Jul 13 12:39:56 ypestis2 kernel: RIP: 0033:0x7f353cbc5637
Jul 13 12:39:56 ypestis2 kernel: RSP: 002b:00007fff0deb7d98 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
~~~

This is a *real* problem. It *must* be looked into. I cannot be rebooting my laptop several times a day if on battery power, or when I un-dock my laptop, or then the background dims, or if I want to turn the volume up while watching a video.

Comment 13 Need Real Name 2017-07-26 00:43:07 UTC
With kernel 4.11.8-200 the bug (with the same sort of traceback) occurs during normal operation, not just at power down. The system is now entirely unusable.

Comment 14 Need Real Name 2017-07-26 00:45:12 UTC
Sorry, make that 4.11.10-200

Comment 15 Fedora End Of Life 2017-11-16 15:06:14 UTC
This message is a reminder that Fedora 25 is nearing its end of life.
Approximately 4 (four) weeks from now Fedora will stop maintaining
and issuing updates for Fedora 25. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as EOL if it remains open with a Fedora  'version'
of '25'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not
able to fix it before Fedora 25 is end of life. If you would still like
to see this bug fixed and are able to reproduce it against a later version
of Fedora, you are encouraged  change the 'version' to a later Fedora
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.

Comment 16 Sam Roza 2017-12-08 00:31:50 UTC
Problem is worked around by adding 'nouveau.runpm=0' to the boot options.

Comment 17 Peter Lemenkov 2018-06-12 11:35:56 UTC

*** This bug has been marked as a duplicate of bug 1447677 ***


Note You need to log in before you can comment on or make changes to this bug.