Bug 1509294 - Random freezes with nouveau driver on Lenovo Thinkpad P50
Summary: Random freezes with nouveau driver on Lenovo Thinkpad P50
Keywords:
Status: CLOSED EOL
Alias: None
Product: Fedora
Classification: Fedora
Component: xorg-x11-drv-nouveau
Version: 33
Hardware: x86_64
OS: Linux
high
high
Target Milestone: ---
Assignee: Ben Skeggs
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-11-03 13:41 UTC by Will Newton
Modified: 2021-11-30 19:15 UTC (History)
44 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-11-30 19:15:55 UTC
Type: Bug
Embargoed:


Attachments (Terms of Use)
Output of journalctl -k -b -1 --no-pager --no-hostname (96.68 KB, text/plain)
2018-04-30 17:06 UTC, Stefano Biagiotti
no flags Details

Description Will Newton 2017-11-03 13:41:50 UTC
Description of problem:

With both Wayland an Xorg after some time of using the system I get a lockup. The mouse freezes and some of the text gets drawn upside down and backwards on the screen. I have seen it happen in both Firefox and GNOME Terminal.

My hardware is a Thinkpad P50, with Nvidia graphics selected in the BIOS (not hybrid).

Version-Release number of selected component (if applicable):

kernel 4.13.9-200.fc26.x86_64

How reproducible:

Occurs several times per day but not obvious what triggers it.

Additional info:

dmesg logs:

Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1081]]
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000041
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000041
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000041
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 00039ba000 engine 00 [GR] client 1e [GPC0/PE_7] reason 02 [PTE] on channel 16 [00ff817000 systemd-logind[1081]]
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 16: killed
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 03 13:11:12 localhost.localdomain kernel: nouveau 0000:01:00.0: systemd-logind[1081]: channel 16 killed!
Nov 03 13:11:24 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 03 13:11:34 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 03 13:11:44 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out

There are no errors visible in the Xorg log.

Comment 1 Jeremy Cline 2017-11-03 13:56:45 UTC
Hello,

Thank you for the bug report. This bug is in a video subsystem that has a kernel part. We track and work on these bugs via the driver package name instead of leaving them assigned to the kernel.

Comment 2 Will Newton 2017-11-06 10:14:16 UTC
The logs vary slightly between crashes:

Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1858]]
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000000
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000000
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 00028af000 engine 00 [GR] client 18 [GPC0/PE_5] reason 02 [PTE] on channel 17 [00fd8ff000 Xorg[1858]]
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 17: killed
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 06 10:04:02 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1858]: channel 17 killed!
Nov 06 10:04:19 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:04:30 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:04:40 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Nov 06 10:04:50 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:05:00 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:05:10 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Nov 06 10:05:21 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:05:31 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:05:41 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Nov 06 10:08:47 localhost.localdomain kernel: IPv6: ADDRCONF(NETDEV_UP): wlp4s0: link is not ready
Nov 06 10:09:19 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:09:29 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:09:39 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Nov 06 10:09:50 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:10:00 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:10:10 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Nov 06 10:10:30 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 10:10:41 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out

Let me know if there is any further diagnostic information it would be helpful for me to provide.

Comment 3 Will Newton 2017-11-06 15:15:07 UTC
Some more logs, with some different messages preceding the failure:

Nov 06 15:05:47 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:47 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 3760, y = 2112, format = 11, storage type = 0
Nov 06 15:05:47 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:47 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 2640, y = 2128, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 0, y = 2112, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 624, y = 2128, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 2864, y = 2144, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 0, y = 2112, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 1072, y = 2128, format = 11, storage type = 0
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 2432, y = 2144, format = 11, storage type = 0
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 3760, y = 2112, format = 11, storage type = 0
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 2384, y = 2128, format = 11, storage type = 0
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:05:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 1504, y = 2144, format = 11, storage type = 0
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: write fault at 00029a5000 engine 00 [GR] client 0f [GPC0/PROP_0] reason 02 [PTE] on channel 17 [00fd8ff000 Xorg[1747]]
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 17: killed
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 06 15:06:42 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1747]: channel 17 killed!

This still occurs with 4.13.10-200.fc26.x86_64

Comment 4 Brian Kaye 2017-11-07 02:55:22 UTC
I have a Lenovo P50 Laptop with a 4K screen. I experience this problem frequently when using mplayer or other multimedia applications. The symptoms are variable but inevitably the system freezes and a power off/reboot is required. 
I have had this problem 3 times in the past hour.

Kernel is 

4.13.10-200.fc26.x86_64

Nouveau driver is

xorg-x11-drv-nouveau-1.0.15-1.fc26.x86_64

Final lines from journalctl:


Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000000
Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: fifo: read fault at 000448b000 engine 00 [GR] client 0a [GPC0/T1_3] reason 02 [PTE] on channel 6 [007f2c5000 Xorg[884]]
Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: fifo: channel 6: killed
Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 06 21:41:30 titan kernel: nouveau 0000:01:00.0: Xorg[884]: channel 6 killed!
Nov 06 21:41:41 titan kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 21:41:45 titan NetworkManager[800]: <info>  [1510018905.3144] device (wlp4s0): supplicant interface state: inactive -> scanning
Nov 06 21:41:51 titan kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 06 21:41:59 titan systemd[1]: Starting dnf makecache...
lines 2355-2393/2393 (END)

Comment 5 Will Newton 2017-11-07 15:02:09 UTC
I tried a couple of workarounds but none worked so far.

I tried switching to hybrid graphics rather than discrete (the P50 allows selection of hybrid or discrete but cannot select integrated graphics in the BIOS) but with hybrid enabled Fedora 26 doesn't boot.

I also tried the Nvidia binary driver but that was also unsuccessful (boot failed to bring up the display).

So in summary, it would be great to get nouveau stable on this machine. ;-)

Comment 6 Will Newton 2017-11-07 16:12:22 UTC
These are all the logs from the nouveau driver across a boot:

Nov 06 16:02:57 localhost.localdomain kernel: nouveau: detected PR support, will not use DSM
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: NVIDIA GM107 (117300a2)
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: bios: version 82.07.9d.00.14
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: fb: 4096 MiB GDDR5
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: bus: MMIO read of 00000000 FAULT at 001228 [ IBUS ]
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: bus: MMIO read of 00000000 FAULT at 10ac08 [ IBUS ]
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: VRAM: 4096 MiB
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: TMDS table version 2.0
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB version 4.0
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 00: 04800fb6 04420010
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 01: 02011fa6 04420010
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 02: 02011f62 00020010
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 03: 08022fc6 04420010
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 04: 08022f82 00020010
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 05: 01033fd6 04420020
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB outp 06: 01033f92 00020020
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB conn 00: 00002047
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB conn 01: 00001146
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB conn 02: 00010246
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: DCB conn 03: 00020346
Nov 06 16:02:57 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
Nov 06 16:02:58 localhost.localdomain kernel: nouveau 0000:01:00.0: DRM: allocated 3840x2160 fb: 0x60000, bo ffff9d7b027bd000
Nov 06 16:02:58 localhost.localdomain kernel: fbcon: nouveaufb (fb0) is primary device
Nov 06 16:02:58 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 06 16:02:58 localhost.localdomain kernel: nouveau 0000:01:00.0: fb0: nouveaufb frame buffer device
Nov 06 16:02:58 localhost.localdomain kernel: [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
Nov 06 17:47:39 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 06 18:11:25 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 07 09:53:01 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1773]]
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000009
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 0001a80000 engine 00 [GR] client 04 [GPC0/T1_1] reason 02 [PTE] on channel 17 [00fd8ff000 Xorg[1773]]
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 17: killed
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 07 16:07:02 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1773]: channel 17 killed!

Note that there are two FAULT errors printed during the boot process but this doesn't seem to cause a problem in itself.

Comment 7 Will Newton 2017-11-09 11:02:03 UTC
I've just seen this crash with kernel 4.13.11-200:

Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000001
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: FB_FLUSH_TIMEOUT
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Nov 09 10:51:40 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002

The last message repeats forever until I shutdown.

Let me know if there is any more relevant information that would be helpful.

Comment 8 Will Newton 2017-11-09 17:04:23 UTC
Another slightly different trace:

Nov 09 16:00:01 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1671]]
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000041
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000041
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000041
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 0001b2d000 engine 00 [GR] client 07 [GPC0/T1_2] reason 02 [PTE] on channel 17 [00fd8ff000 Xorg[1671]]
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 17: killed
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 09 16:28:20 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1671]: channel 17 killed!

Comment 9 Will Newton 2017-11-10 09:40:35 UTC
I updated the BIOS to the latest version but that doesn't seem to have changed the behaviour at all.

I am almost certain that the problem is correlated with system load. For example, light web browsing seems to allow a reasonable uptime but if I start a large build or repo sync of the Android tree then a freeze is almost inevitable.

Note that the system continues running it is just the graphics that freeze.

Output of lspci:

00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Host Bridge/DRAM Registers (rev 07)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) (rev 07)
00:14.0 USB controller: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller (rev 31)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-H Thermal subsystem (rev 31)
00:16.0 Communication controller: Intel Corporation Sunrise Point-H CSME HECI #1 (rev 31)
00:16.3 Serial controller: Intel Corporation Sunrise Point-H KT Redirection (rev 31)
00:17.0 SATA controller: Intel Corporation Sunrise Point-H SATA controller [AHCI mode] (rev 31)
00:1c.0 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #1 (rev f1)
00:1c.2 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #3 (rev f1)
00:1c.4 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #5 (rev f1)
00:1d.0 PCI bridge: Intel Corporation Sunrise Point-H PCI Express Root Port #13 (rev f1)
00:1f.0 ISA bridge: Intel Corporation Sunrise Point-H LPC Controller (rev 31)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-H PMC (rev 31)
00:1f.3 Audio device: Intel Corporation Sunrise Point-H HD Audio (rev 31)
00:1f.4 SMBus: Intel Corporation Sunrise Point-H SMBus (rev 31)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (2) I219-LM (rev 31)
01:00.0 VGA compatible controller: NVIDIA Corporation GM107GLM [Quadro M2000M] (rev a2)
01:00.1 Audio device: NVIDIA Corporation Device 0fbc (rev a1)
04:00.0 Network controller: Intel Corporation Wireless 8260 (rev 3a)
3e:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS525A PCI Express Card Reader (rev 01)

Comment 10 Will Newton 2017-11-10 12:21:48 UTC
Logs with nouveau debug enabled:

Nov 10 12:17:16 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:16 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36
Nov 10 12:17:17 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:17 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36
Nov 10 12:17:18 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 34%
Nov 10 12:17:18 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target: 34
Nov 10 12:17:18 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 34
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target: 36
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 26, y = 2112, format = 11, storage type = 0
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 26, y = 2112, format = 11, storage type = 0
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 26, y = 2112, format = 11, storage type = 0
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 26, y = 2112, format = 11, storage type = 0
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000020 [RT_HEIGHT_OVERRUN] x = 26, y = 2112, format = 11, storage type = 0
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: write fault at 0000000000 engine 00 [GR] client 0f [GPC0/PROP_0] reason 02 [PTE] on channel 17 [00fd8ff000 Xorg[1748]]
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 17: killed
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 00: busy 1 faulted 1 chsw 0 save 0 load 1 chid 17*-> chid 17 
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 00: busy 1 faulted 1 chsw 0 save 0 load 1 chid 17*-> chid 17 
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 05: busy 0 faulted 0 chsw 0 save 0 load 1 chid 17*-> chid 17 
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 05: busy 0 faulted 0 chsw 0 save 0 load 1 chid 17*-> chid 17 
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 05: busy 0 faulted 1 chsw 0 save 0 load 1 chid 17*-> chid 17 
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1748]: channel 17 killed!
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: released GPCCS falcon
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: released FECS falcon
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: acquired FECS falcon
Nov 10 12:17:21 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: acquired GPCCS falcon
Nov 10 12:17:22 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN target request: 36%
Nov 10 12:17:22 localhost.localdomain kernel: nouveau 0000:01:00.0: therm: FAN update: 36

Comment 11 Will Newton 2017-11-13 13:18:06 UTC
Setting nouveau.runpm=0 didn't seem to stop the crash, although weirdly the system did seem to limp along for a few seconds longer than normal after the display corruption began:

Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000000
Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000000
Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000009
Nov 13 13:07:18 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000009
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000009
Nov 13 13:07:19 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000009
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000000
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000000
Nov 13 13:07:20 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000009
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000009
Nov 13 13:07:22 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000000
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000000
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000000
Nov 13 13:07:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000000
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000041
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000041
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000041
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 00039bf000 engine 00 [GR] client 01 [GPC0/T1_0] reason 02 [PTE] on channel 16 [00ff817000 systemd-logind[1019]]
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 16: killed
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 13 13:07:52 localhost.localdomain kernel: nouveau 0000:01:00.0: systemd-logind[1019]: channel 16 killed!

My offer to help in any way to diagnose this further still stands. I don't have the knowledge or time to dig into the driver code and figure this out from first principles however.

Comment 12 Brian Kaye 2017-11-17 00:53:16 UTC
Nov 16 11:31:53 titan audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=dnf-makecache comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Nov 16 11:45:10 titan kernel: nouveau 0000:01:00.0: gr: TRAP ch 15 [007e4fd000 Xorg[918]]
Nov 16 11:45:10 titan kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000000
Nov 16 11:45:10 titan kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Nov 16 11:45:10 titan kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000000



...The last 3 lines are repeated many times followed by:

Nov 16 11:47:10 titan kernel: nouveau 0000:01:00.0: fifo: read fault at 00015cb000 engine 00 [GR] client 01 [GPC0/T1_0] reason 02 [PTE] on channel 15 [007e4fd000 Xorg[918]]
Nov 16 11:47:10 titan kernel: nouveau 0000:01:00.0: fifo: channel 15: killed
Nov 16 11:47:10 titan kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 16 11:47:10 titan kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 16 11:47:10 titan kernel: nouveau 0000:01:00.0: Xorg[918]: channel 15 killed!
Nov 16 11:47:29 titan kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 16 11:47:39 titan kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Nov 16 11:47:49 titan kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out

Comment 13 Will Newton 2017-11-17 12:57:06 UTC
This is still happening with Fedora 27 and wayland:

Nov 17 12:17:20 localhost.localdomain kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00ff817000 systemd-logind[1084]]
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000041
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000041
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000041
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 0005a54000 engine 00 [GR] client 15 [GPC0/PE_4] reason 02 [PTE] on channel 13 [00ff817000 systemd-logind[1084]]
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 13: killed
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 17 12:45:53 localhost.localdomain kernel: nouveau 0000:01:00.0: systemd-logind[1084]: channel 13 killed!
Nov 17 12:45:57 localhost.localdomain kernel: nouveau 0000:01:00.0: Xwayland[1908]: nv50cal_space: -16

Comment 14 amashah 2017-11-17 22:42:39 UTC
We most certainly have the same issue Will.

Here are my collected logs which likely duplicate yours.  Out of curiosity, are you using the P50 with a docking station when these lockups occur? I have noticed it frequently when docked, however I have had it a couple while not docked as well.  It seems to be getting worse as of late.. 


Nov 16 12:11:28  gsd-media-keys[1791]: Unable to get default source
Nov 16 12:11:28  gsd-color[1783]: unable to get EDID for xrandr-eDP-1: unable to get EDID for output
Nov 16 12:11:28  gnome-shell[1412]: Failed to apply DRM plane transform 0: Invalid argument
Nov 16 12:11:28  gnome-shell[1412]: Failed to apply DRM plane transform 0: Invalid argument
Nov 16 12:11:28  gnome-shell[1412]: Failed to apply DRM plane transform 0: Invalid argument
Nov 16 12:11:28  gnome-shell[1412]: Failed to apply DRM plane transform 0: Invalid argument
Nov 16 12:11:28  kernel: nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 16 12:11:29  kernel: nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 16 12:11:29  kernel: nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 16 12:11:29  kernel: nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
Nov 16 12:11:29  gsd-color[1783]: no xrandr-eDP-1 device found: Failed to find output xrandr-eDP-1



-----------------------



[   22.083936] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   22.245498] e1000e: enp0s31f6 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
[   22.245601] IPv6: ADDRCONF(NETDEV_CHANGE): enp0s31f6: link becomes ready
[   22.323148] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   22.461500] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   22.699517] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07


------------------------


[   22.245601] IPv6: ADDRCONF(NETDEV_CHANGE): enp0s31f6: link becomes ready
[   22.323148] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   22.461500] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   22.699517] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   23.050614] ------------[ cut here ]------------
[   23.050640] WARNING: CPU: 4 PID: 436 at drivers/gpu/drm/nouveau/include/nvkm/subdev/i2c.h:169 nvkm_dp_train_pattern+0x117/0x130 [nouveau]
[   23.050641] Modules linked in: xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack libcrc32c iptable_mangle iptable_raw iptable_security ebtable_filter ebtables ip6table_filter ip6_tables cmac binfmt_misc bnep sunrpc arc4 snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mei_wdt intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel iwlmvm kvm mac80211 irqbypass intel_cstate snd_hda_codec_realtek intel_uncore snd_hda_codec_generic intel_rapl_perf
[   23.050661]  snd_hda_intel snd_hda_codec btusb snd_hda_core btrtl btbcm iwlwifi snd_hwdep btintel snd_seq bluetooth snd_seq_device snd_pcm thinkpad_acpi uvcvideo cfg80211 snd_timer videobuf2_vmalloc videobuf2_memops rtsx_pci_ms videobuf2_v4l2 videobuf2_core memstick wmi_bmof i2c_i801 videodev joydev snd mei_me media ecdh_generic mei soundcore intel_pch_thermal rfkill shpchp tpm_tis tpm_tis_core tpm dm_crypt hid_logitech_hidpp hid_logitech_dj rtsx_pci_sdmmc mmc_core nouveau crct10dif_pclmul crc32_pclmul crc32c_intel mxm_wmi ghash_clmulni_intel i2c_algo_bit drm_kms_helper e1000e ttm serio_raw drm ptp nvme pps_core rtsx_pci nvme_core wmi video
[   23.050681] CPU: 4 PID: 436 Comm: kworker/u16:3 Not tainted 4.13.10-200.fc26.x86_64 #1
[   23.050681] Hardware name: LENOVO 20EQS64N0B/20EQS64N0B, BIOS N1EET71W (1.44 ) 08/31/2017
[   23.050701] Workqueue: nvkm-disp gf119_disp_super [nouveau]
[   23.050702] task: ffff94077b04a6c0 task.stack: ffffbad643850000
[   23.050720] RIP: 0010:nvkm_dp_train_pattern+0x117/0x130 [nouveau]
[   23.050721] RSP: 0018:ffffbad643853c70 EFLAGS: 00010297
[   23.050722] RAX: 0000000000000000 RBX: ffff94077b76c800 RCX: 0000000000000000
[   23.050722] RDX: 0000000000000001 RSI: ffffbad64500e534 RDI: 0000000001009000
[   23.050723] RBP: ffffbad643853c98 R08: ffffbad643853c75 R09: ffffbad643853c77
[   23.050723] R10: 0000000000000000 R11: 0000000000000010 R12: 0000000000000002
[   23.050724] R13: ffff94077a8b4800 R14: 0000000000000000 R15: 0000000000000000
[   23.050724] FS:  0000000000000000(0000) GS:ffff9407a3d00000(0000) knlGS:0000000000000000
[   23.050725] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   23.050726] CR2: 000055cd7cbad748 CR3: 000000076ae09000 CR4: 00000000003406e0
[   23.050726] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   23.050727] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   23.050727] Call Trace:
[   23.050744]  nvkm_dp_acquire+0xb1d/0xcd0 [nouveau]
[   23.050761]  nv50_disp_super_2_2+0x5d/0x470 [nouveau]
[   23.050774]  ? nvkm_devinit_pll_set+0xf/0x20 [nouveau]
[   23.050790]  gf119_disp_super+0x19c/0x2f0 [nouveau]
[   23.050793]  process_one_work+0x193/0x3c0
[   23.050794]  worker_thread+0x4a/0x3a0
[   23.050795]  kthread+0x125/0x140
[   23.050796]  ? process_one_work+0x3c0/0x3c0
[   23.050798]  ? kthread_park+0x60/0x60
[   23.050799]  ? do_syscall_64+0x67/0x140
[   23.050801]  ret_from_fork+0x25/0x30
[   23.050802] Code: 5d c3 4c 8d 4d df 4c 8d 45 dd b9 02 01 00 00 ba 09 00 00 00 be 01 00 00 00 4c 89 ef e8 13 96 fd ff 85 c0 75 08 80 7d df 01 74 02 <0f> ff 4c 89 ef e8 ff 93 fd ff e9 62 ff ff ff e8 25 c1 da c1 0f 
[   23.050819] ---[ end trace 3eed2fc104e1faf5 ]---
[   23.051361] nouveau 0000:01:00.0: disp: outp 00:0006:0f44: training failed
[   24.018812] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   24.059747] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   24.106100] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   24.152434] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[   31.238301] systemd-journald[822]: File /var/log/journal/2d527a653933486b8f8b825accf05f57/user-1000.journal corrupted or uncleanly shut down, renaming and replacing.
[   31.665868] fuse init (API version 7.26)
[   33.077090] Bluetooth: RFCOMM TTY layer initialized
[   33.077101] Bluetooth: RFCOMM socket layer initialized
[   33.077144] Bluetooth: RFCOMM ver 1.11
[   33.846061] rfkill: input handler disabled
[   37.288120] logitech-hidpp-device 0003:046D:401B.0006: HID++ 2.0 device connected.
[  753.899412] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[  753.942957] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[  754.120261] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[  754.366448] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6910.378331] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6910.568186] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6910.879628] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6911.123081] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6911.577347] ------------[ cut here ]------------
[ 6911.577405] WARNING: CPU: 5 PID: 7853 at drivers/gpu/drm/nouveau/include/nvkm/subdev/i2c.h:169 nvkm_dp_train_sense+0xd9/0x200 [nouveau]
[ 6911.577405] Modules linked in: rfcomm fuse xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CT ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack libcrc32c iptable_mangle iptable_raw iptable_security ebtable_filter ebtables ip6table_filter ip6_tables cmac binfmt_misc bnep sunrpc arc4 snd_hda_codec_hdmi iTCO_wdt iTCO_vendor_support mei_wdt intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel iwlmvm kvm mac80211 irqbypass intel_cstate snd_hda_codec_realtek intel_uncore snd_hda_codec_generic
[ 6911.577427]  intel_rapl_perf snd_hda_intel snd_hda_codec btusb snd_hda_core btrtl btbcm iwlwifi snd_hwdep btintel snd_seq bluetooth snd_seq_device snd_pcm thinkpad_acpi uvcvideo cfg80211 snd_timer videobuf2_vmalloc videobuf2_memops rtsx_pci_ms videobuf2_v4l2 videobuf2_core memstick wmi_bmof i2c_i801 videodev joydev snd mei_me media ecdh_generic mei soundcore intel_pch_thermal rfkill shpchp tpm_tis tpm_tis_core tpm dm_crypt hid_logitech_hidpp hid_logitech_dj rtsx_pci_sdmmc mmc_core nouveau crct10dif_pclmul crc32_pclmul crc32c_intel mxm_wmi ghash_clmulni_intel i2c_algo_bit drm_kms_helper e1000e ttm serio_raw drm ptp nvme pps_core rtsx_pci nvme_core wmi video
[ 6911.577449] CPU: 5 PID: 7853 Comm: kworker/u16:5 Tainted: G        W       4.13.10-200.fc26.x86_64 #1
[ 6911.577449] Hardware name: LENOVO 20EQS64N0B/20EQS64N0B, BIOS N1EET71W (1.44 ) 08/31/2017
[ 6911.577469] Workqueue: nvkm-disp gf119_disp_super [nouveau]
[ 6911.577470] task: ffff940712ab0000 task.stack: ffffbad64a9a4000
[ 6911.577488] RIP: 0010:nvkm_dp_train_sense+0xd9/0x200 [nouveau]
[ 6911.577489] RSP: 0018:ffffbad64a9a7c58 EFLAGS: 00010297
[ 6911.577490] RAX: 0000000000000000 RBX: ffff94077a8b4800 RCX: 0000000000000000
[ 6911.577490] RDX: 0000000000000006 RSI: ffffbad64500e534 RDI: 0000000001009005
[ 6911.577491] RBP: ffffbad64a9a7c98 R08: ffffbad64a9a7d40 R09: ffffbad64a9a7c66
[ 6911.577491] R10: 0000000000000000 R11: 0000000000000010 R12: ffff94077b76c800
[ 6911.577492] R13: ffffbad64a9a7d38 R14: 0000000000000000 R15: 0000000000000000
[ 6911.577493] FS:  0000000000000000(0000) GS:ffff9407a3d40000(0000) knlGS:0000000000000000
[ 6911.577493] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6911.577494] CR2: 000056255d3efb18 CR3: 000000076ae09000 CR4: 00000000003406e0
[ 6911.577495] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 6911.577495] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 6911.577496] Call Trace:
[ 6911.577513]  nvkm_dp_acquire+0x587/0xcd0 [nouveau]
[ 6911.577531]  nv50_disp_super_2_2+0x5d/0x470 [nouveau]
[ 6911.577534]  ? pick_next_task_fair+0x137/0x550
[ 6911.577536]  ? __switch_to+0x1fc/0x4a0
[ 6911.577552]  gf119_disp_super+0x19c/0x2f0 [nouveau]
[ 6911.577554]  process_one_work+0x193/0x3c0
[ 6911.577555]  worker_thread+0x4a/0x3a0
[ 6911.577556]  kthread+0x125/0x140
[ 6911.577557]  ? process_one_work+0x3c0/0x3c0
[ 6911.577559]  ? kthread_park+0x60/0x60
[ 6911.577560]  ? kthread_park+0x60/0x60
[ 6911.577562]  ret_from_fork+0x25/0x30
[ 6911.577563] Code: b9 02 02 00 00 ba 09 00 00 00 be 01 00 00 00 48 89 df 49 89 c0 48 89 45 c0 e8 04 92 fd ff 85 c0 41 89 c7 75 5d 80 7d ce 06 74 02 <0f> ff 48 89 df e8 ed 8f fd ff 45 84 f6 75 55 49 8b 44 24 08 83 
[ 6911.577580] ---[ end trace 3eed2fc104e1faf6 ]---
[ 6911.577733] nouveau 0000:01:00.0: disp: outp 00:0006:0f44: training failed
[ 6912.191101] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6912.226312] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[ 6912.814793] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[11733.067864] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[11733.177269] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[11733.349730] nouveau 0000:01:00.0: disp: 0x000064a8[0]: INIT_GENERIC_CONDITON: unknown 0x07
[11733.598189] nouveau 0000:01:00.0: disp: 0x00006671[0]: INIT_GENERIC_CONDITON: unknown 0x07

-----------------------------------------------------



Oct 23 14:56:17 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 470da27000 engine 00 [GR] client 0d [GPC0/GCC] reason 00 [PDE] on channel 20 [00fd8b5000 Xorg[1972]]
Oct 23 14:56:17 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 20: killed
Oct 23 14:56:17 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Oct 23 14:56:17 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Oct 23 14:56:17 localhost.localdomain kernel: nouveau 0000:01:00.0: Xorg[1972]: channel 20 killed!
Oct 23 14:56:27 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Oct 23 14:56:38 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out
Oct 23 14:56:48 localhost.localdomain kernel: [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:38:head-0] flip_done timed out
Oct 23 14:56:48 localhost.localdomain systemd[1]: Starting Cleanup of Temporary Directories...
Oct 23 14:56:48 localhost.localdomain systemd[1]: Started Cleanup of Temporary Directories.
Oct 23 14:56:48 localhost.localdomain audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-tmpfiles-clean comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? ter
Oct 23 14:56:48 localhost.localdomain audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=systemd-tmpfiles-clean comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? term
Oct 23 14:56:58 localhost.localdomain kernel: [drm:drm_atomic_helper_swap_state [drm_kms_helper]] *ERROR* [CRTC:38:head-0] hw_done timed out

Comment 15 Brian Kaye 2017-11-18 02:54:40 UTC
I don't have a docking station. I can go several days without one and then have a couple within a couple of minutes. What we need is some way to get a trace when it starts. The cursor freezes, sound if any continues for a few seconds then total hang. No keyboard input at all is recognized.

Comment 16 Will Newton 2017-11-20 10:18:43 UTC
I don't have a docking station either. The crashes are still ongoing:

Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00ff817000 systemd-logind[1121]]
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/PROP trap: 00000100 [RT_STORAGE_TYPE_MISMATCH] x = 3832, y = 2054, format = 2a, storage type = 17
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00ff817000 systemd-logind[1121]]
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000041
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000041
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000041
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC3/TEX: 80000041
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC4/TEX: 80000041
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: read fault at 0006946000 engine 00 [GR] client 07 [GPC0/T1_2] reason 02 [PTE] on channel 13 [00ff817000 systemd-logind[1121]]
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: channel 13: killed
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: fifo: engine 5: scheduled for recovery
Nov 20 10:14:30 localhost.localdomain kernel: nouveau 0000:01:00.0: systemd-logind[1121]: channel 13 killed!

Comment 17 Brian Kaye 2017-11-20 14:46:13 UTC
Is there a way to change the severity of this bug? Since its currently "unspecified" perhaps the Red Hat folks are not paying attention.

Comment 18 Will Newton 2017-11-20 14:51:04 UTC
I created an upstream bug report here: https://bugs.freedesktop.org/show_bug.cgi?id=103721

Comment 19 Peter Larsen 2017-11-20 21:45:19 UTC
(In reply to Will Newton from comment #18)
> I created an upstream bug report here:
> https://bugs.freedesktop.org/show_bug.cgi?id=103721

Looks like this is an old and unresolved issue with no traction? https://bugs.freedesktop.org/show_bug.cgi?id=100567

I've got this issue on several platforms. Not just Lenovo.

Comment 20 Will Newton 2017-11-21 09:52:24 UTC
I'm not so sure. The bug here seems to be characterized by the read fault PTE message (which seems present in almost all the traces) and the bug you referenced seems to cause a CTXSW_TIMEOUT message which is not seen in any of these traces.

That said I don't have any knowledge of the driver architecture so if someone with that knowledge thinks they are the same issue then they should be merged.

Comment 22 Jim Scarborough 2018-01-30 13:03:41 UTC
This may be related to or the same as bug 1527669.

Comment 23 Will Newton 2018-02-19 10:38:10 UTC
This issue is still present with kernel-4.15.3-300.fc27.x86_64

The below upstream issue suggests updating Mesa may help, although I haven't had chance to try that: https://bugs.freedesktop.org/show_bug.cgi?id=105045

Comment 24 Will Newton 2018-02-19 16:00:07 UTC
I've seen a lot of lockups today (> 10) which is making it very hard to use this laptop with Fedora. The logs when the lockup happens seem to have changed, for example:

Feb 19 15:47:24 localhost.localdomain kernel: swiotlb_tbl_map_single: 63 callbacks suppressed
Feb 19 15:47:24 localhost.localdomain kernel: nouveau 0000:01:00.0: swiotlb buffer is full (sz: 2097152 bytes)
Feb 19 15:47:24 localhost.localdomain kernel: swiotlb: coherent allocation failed for device 0000:01:00.0 size=2097152
Feb 19 15:47:24 localhost.localdomain kernel: CPU: 7 PID: 1866 Comm: gnome-shell Not tainted 4.15.3-300.fc27.x86_64 #1
Feb 19 15:47:24 localhost.localdomain kernel: Hardware name: LENOVO 20EN0007UK/20EN0007UK, BIOS N1EET73W (1.46 ) 09/28/2017
Feb 19 15:47:24 localhost.localdomain kernel: Call Trace:
Feb 19 15:47:24 localhost.localdomain kernel:  dump_stack+0x5c/0x85
Feb 19 15:47:24 localhost.localdomain kernel:  swiotlb_alloc_coherent+0xe0/0x150
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_dma_pool_get_pages+0x20e/0x5e0 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_dma_populate+0x24d/0x340 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_tt_bind+0x29/0x60 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_bo_handle_move_mem+0x5da/0x610 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_bo_validate+0x135/0x150 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_bo_init_reserved+0x385/0x430 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ttm_bo_init+0x2f/0x90 [ttm]
Feb 19 15:47:24 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  ? _cond_resched+0x15/0x40
Feb 19 15:47:24 localhost.localdomain kernel:  nouveau_bo_new+0x416/0x590 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  nouveau_gem_new+0x5d/0x120 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  nouveau_gem_ioctl_new+0x51/0xd0 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  drm_ioctl_kernel+0x5b/0xb0 [drm]
Feb 19 15:47:24 localhost.localdomain kernel:  drm_ioctl+0x2d5/0x370 [drm]
Feb 19 15:47:24 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  nouveau_drm_ioctl+0x64/0xc0 [nouveau]
Feb 19 15:47:24 localhost.localdomain kernel:  do_vfs_ioctl+0xa4/0x620
Feb 19 15:47:24 localhost.localdomain kernel:  SyS_ioctl+0x74/0x80
Feb 19 15:47:24 localhost.localdomain kernel:  do_syscall_64+0x75/0x180
Feb 19 15:47:24 localhost.localdomain kernel:  entry_SYSCALL_64_after_hwframe+0x21/0x86
Feb 19 15:47:24 localhost.localdomain kernel: RIP: 0033:0x7f3f9310b8e7
Feb 19 15:47:24 localhost.localdomain kernel: RSP: 002b:00007ffccc1e0ea8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Feb 19 15:47:24 localhost.localdomain kernel: RAX: ffffffffffffffda RBX: 000055a973cc1800 RCX: 00007f3f9310b8e7
Feb 19 15:47:24 localhost.localdomain kernel: RDX: 00007ffccc1e0f00 RSI: 00000000c0306480 RDI: 000000000000000c
Feb 19 15:47:24 localhost.localdomain kernel: RBP: 00007ffccc1e0f00 R08: 0000000000000004 R09: 0000000000000006
Feb 19 15:47:24 localhost.localdomain kernel: R10: ffffffffffffffb0 R11: 0000000000000246 R12: 00000000c0306480
Feb 19 15:47:24 localhost.localdomain kernel: R13: 000000000000000c R14: 000055a974074748 R15: 000055a970e74950
Feb 19 15:47:27 localhost.localdomain kernel: nouveau 0000:01:00.0: swiotlb buffer is full (sz: 2097152 bytes)
Feb 19 15:47:27 localhost.localdomain kernel: swiotlb: coherent allocation failed for device 0000:01:00.0 size=2097152
Feb 19 15:47:27 localhost.localdomain kernel: CPU: 5 PID: 1866 Comm: gnome-shell Not tainted 4.15.3-300.fc27.x86_64 #1
Feb 19 15:47:27 localhost.localdomain kernel: Hardware name: LENOVO 20EN0007UK/20EN0007UK, BIOS N1EET73W (1.46 ) 09/28/2017
Feb 19 15:47:27 localhost.localdomain kernel: Call Trace:
Feb 19 15:47:27 localhost.localdomain kernel:  dump_stack+0x5c/0x85
Feb 19 15:47:27 localhost.localdomain kernel:  swiotlb_alloc_coherent+0xe0/0x150
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_dma_pool_get_pages+0x20e/0x5e0 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_dma_populate+0x24d/0x340 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_tt_bind+0x29/0x60 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_bo_handle_move_mem+0x5da/0x610 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_bo_validate+0x135/0x150 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_bo_init_reserved+0x385/0x430 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ttm_bo_init+0x2f/0x90 [ttm]
Feb 19 15:47:27 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  ? _cond_resched+0x15/0x40
Feb 19 15:47:27 localhost.localdomain kernel:  nouveau_bo_new+0x416/0x590 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  nouveau_gem_new+0x5d/0x120 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  nouveau_gem_ioctl_new+0x51/0xd0 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  drm_ioctl_kernel+0x5b/0xb0 [drm]
Feb 19 15:47:27 localhost.localdomain kernel:  drm_ioctl+0x2d5/0x370 [drm]
Feb 19 15:47:27 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  nouveau_drm_ioctl+0x64/0xc0 [nouveau]
Feb 19 15:47:27 localhost.localdomain kernel:  do_vfs_ioctl+0xa4/0x620
Feb 19 15:47:27 localhost.localdomain kernel:  SyS_ioctl+0x74/0x80
Feb 19 15:47:27 localhost.localdomain kernel:  do_syscall_64+0x75/0x180
Feb 19 15:47:27 localhost.localdomain kernel:  entry_SYSCALL_64_after_hwframe+0x21/0x86
Feb 19 15:47:27 localhost.localdomain kernel: RIP: 0033:0x7f3f9310b8e7
Feb 19 15:47:27 localhost.localdomain kernel: RSP: 002b:00007ffccc1e0ea8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Feb 19 15:47:27 localhost.localdomain kernel: RAX: ffffffffffffffda RBX: 000055a973cc1800 RCX: 00007f3f9310b8e7
Feb 19 15:47:27 localhost.localdomain kernel: RDX: 00007ffccc1e0f00 RSI: 00000000c0306480 RDI: 000000000000000c
Feb 19 15:47:27 localhost.localdomain kernel: RBP: 00007ffccc1e0f00 R08: 0000000000000004 R09: 0000000000000006
Feb 19 15:47:27 localhost.localdomain kernel: R10: ffffffffffffffb0 R11: 0000000000000246 R12: 00000000c0306480
Feb 19 15:47:27 localhost.localdomain kernel: R13: 000000000000000c R14: 000055a97383b538 R15: 000055a970e74950
Feb 19 15:47:28 localhost.localdomain kernel: nouveau 0000:01:00.0: swiotlb buffer is full (sz: 2097152 bytes)
Feb 19 15:47:28 localhost.localdomain kernel: swiotlb: coherent allocation failed for device 0000:01:00.0 size=2097152
Feb 19 15:47:28 localhost.localdomain kernel: CPU: 3 PID: 1866 Comm: gnome-shell Not tainted 4.15.3-300.fc27.x86_64 #1
Feb 19 15:47:28 localhost.localdomain kernel: Hardware name: LENOVO 20EN0007UK/20EN0007UK, BIOS N1EET73W (1.46 ) 09/28/2017
Feb 19 15:47:28 localhost.localdomain kernel: Call Trace:
Feb 19 15:47:28 localhost.localdomain kernel:  dump_stack+0x5c/0x85
Feb 19 15:47:28 localhost.localdomain kernel:  swiotlb_alloc_coherent+0xe0/0x150
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_dma_pool_get_pages+0x20e/0x5e0 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_dma_populate+0x24d/0x340 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_tt_bind+0x29/0x60 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_bo_handle_move_mem+0x5da/0x610 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_bo_validate+0x135/0x150 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_bo_init_reserved+0x385/0x430 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ttm_bo_init+0x2f/0x90 [ttm]
Feb 19 15:47:28 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  ? _cond_resched+0x15/0x40
Feb 19 15:47:28 localhost.localdomain kernel:  nouveau_bo_new+0x416/0x590 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  nouveau_gem_new+0x5d/0x120 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  nouveau_gem_ioctl_new+0x51/0xd0 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  drm_ioctl_kernel+0x5b/0xb0 [drm]
Feb 19 15:47:28 localhost.localdomain kernel:  drm_ioctl+0x2d5/0x370 [drm]
Feb 19 15:47:28 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  nouveau_drm_ioctl+0x64/0xc0 [nouveau]
Feb 19 15:47:28 localhost.localdomain kernel:  do_vfs_ioctl+0xa4/0x620
Feb 19 15:47:28 localhost.localdomain kernel:  SyS_ioctl+0x74/0x80
Feb 19 15:47:28 localhost.localdomain kernel:  do_syscall_64+0x75/0x180
Feb 19 15:47:28 localhost.localdomain kernel:  entry_SYSCALL_64_after_hwframe+0x21/0x86
Feb 19 15:47:28 localhost.localdomain kernel: RIP: 0033:0x7f3f9310b8e7
Feb 19 15:47:28 localhost.localdomain kernel: RSP: 002b:00007ffccc1e0ea8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Feb 19 15:47:28 localhost.localdomain kernel: RAX: ffffffffffffffda RBX: 000055a973cc1800 RCX: 00007f3f9310b8e7
Feb 19 15:47:28 localhost.localdomain kernel: RDX: 00007ffccc1e0f00 RSI: 00000000c0306480 RDI: 000000000000000c
Feb 19 15:47:28 localhost.localdomain kernel: RBP: 00007ffccc1e0f00 R08: 0000000000000004 R09: 0000000000000006
Feb 19 15:47:28 localhost.localdomain kernel: R10: ffffffffffffffb0 R11: 0000000000000246 R12: 00000000c0306480
Feb 19 15:47:28 localhost.localdomain kernel: R13: 000000000000000c R14: 000055a9740e4ad8 R15: 000055a970e74950

And on previous boots I also saw:

Feb 19 15:23:30 localhost.localdomain kernel: nouveau 0000:01:00.0: Xwayland[2077]: nv50cal_space: -16
Feb 19 15:23:30 localhost.localdomain kernel: nouveau 0000:01:00.0: Xwayland[2077]: nv50cal_space: -16

So this may be several issues or just one, I'm not sure. Either way a fix or a workaround would be extremely valuable.

Comment 25 Will Newton 2018-02-20 10:15:22 UTC
The problem persists with Mesa 17.3.4 installed.

Comment 26 Jim Scarborough 2018-02-26 16:24:01 UTC
4.14.18-300.fc27.x86_64 is working better for me.  I have been able to get the external display on a docking station to come on from time to time (docking while active (not suspended), I think) and I've had uptime of 11 days so far, knock on PCB.

Comment 27 Will Newton 2018-02-26 16:31:37 UTC
I have found Xorg is much more stable than Wayland at the moment with this driver, ~1 crash per day versus >10 crashes per day, depending on workload. However when Xorg crashes I don't see anything in the kernel logs, so I'm not sure if the issue is the same or not.

Did you upgrade or downgrade your kernel to that revision? I am still seeing crashes with the 4.15 kernel in Fedora.

Comment 28 Brian Kaye 2018-03-07 00:27:39 UTC
I switched to the nvidia drivers a week or so ago and have not had a single freeze-up. Got tired of fighting.Unfortunately the drivers are not signed so you have to disable secure boot. Running kernel 4.15.6-200.fc26.x86_64

Comment 29 Peter Larsen 2018-03-07 19:59:14 UTC
(In reply to Brian Kaye from comment #28)
> I switched to the nvidia drivers a week or so ago and have not had a single
> freeze-up. Got tired of fighting.Unfortunately the drivers are not signed so
> you have to disable secure boot. Running kernel 4.15.6-200.fc26.x86_64

I've done that on the desktop where I have an NVidia card in, and it too resolved ALL freezes immediately.

Comment 30 Stefano Biagiotti 2018-04-30 17:06:04 UTC
Created attachment 1428905 [details]
Output of journalctl -k -b -1 --no-pager --no-hostname

Same here although on Fedora 27 and different hardware.

Display adapter is (from lspci -nn):
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GT215 [GeForce GT 320] [10de:0ca2] (rev a2)

Packages are kernel-4.16.4-200.fc27.x86_64 and xorg-x11-drv-nouveau-1.0.15-3.fc27.x86_64.

Comment 31 Fedora End Of Life 2018-05-03 07:57:43 UTC
This message is a reminder that Fedora 26 is nearing its end of life.
Approximately 4 (four) weeks from now Fedora will stop maintaining
and issuing updates for Fedora 26. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as EOL if it remains open with a Fedora  'version'
of '26'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not
able to fix it before Fedora 26 is end of life. If you would still like
to see this bug fixed and are able to reproduce it against a later version
of Fedora, you are encouraged  change the 'version' to a later Fedora
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.

Comment 32 Corey Ashford 2018-05-03 08:29:34 UTC
(In reply to Fedora End Of Life from comment #31)
> This message is a reminder that Fedora 26 is nearing its end of life.

This bug should be updated to at least Fedora 27, as it's still occurring.

Comment 33 Will Newton 2018-05-03 08:40:45 UTC
I can confirm this issue is still present in F27, I haven't tried F28 yet.

Comment 34 Will Newton 2018-05-09 12:17:48 UTC
Still present in F28.

May 09 13:10:48 localhost.localdomain kernel: nouveau 0000:01:00.0: swiotlb buffer is full (sz: 2097152 bytes)
May 09 13:10:48 localhost.localdomain kernel: nouveau 0000:01:00.0: swiotlb: coherent allocation failed, size=2097152
May 09 13:10:48 localhost.localdomain kernel: CPU: 4 PID: 1966 Comm: Xorg Not tainted 4.16.6-302.fc28.x86_64 #1
May 09 13:10:48 localhost.localdomain kernel: Hardware name: LENOVO 20EN0007UK/20EN0007UK, BIOS N1EET73W (1.46 ) 09/28/2017
May 09 13:10:48 localhost.localdomain kernel: Call Trace:
May 09 13:10:48 localhost.localdomain kernel:  dump_stack+0x5c/0x85
May 09 13:10:48 localhost.localdomain kernel:  swiotlb_alloc_coherent+0x1c3/0x1e0
May 09 13:10:48 localhost.localdomain kernel:  ttm_dma_pool_get_pages+0x21a/0x620 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ttm_dma_populate+0xdd/0x390 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ttm_tt_bind+0x2e/0x60 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ttm_bo_handle_move_mem+0x4cd/0x530 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ttm_bo_validate+0x119/0x130 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ? drm_add_edid_modes+0x1046/0x1840 [drm]
May 09 13:10:48 localhost.localdomain kernel:  ttm_bo_init_reserved+0x334/0x380 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ? ttm_bo_init+0x62/0xd0 [ttm]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_bo_new+0x401/0x580 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_bo_invalidate_caches+0x10/0x10 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_gem_new+0x5d/0x120 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_gem_ioctl_new+0x53/0xe0 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? drm_ioctl_kernel+0x5b/0xb0 [drm]
May 09 13:10:48 localhost.localdomain kernel:  ? drm_ioctl+0x1c0/0x380 [drm]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_gem_new+0x120/0x120 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? nouveau_drm_ioctl+0x65/0xc0 [nouveau]
May 09 13:10:48 localhost.localdomain kernel:  ? do_vfs_ioctl+0xa4/0x610
May 09 13:10:48 localhost.localdomain kernel:  ? SyS_ioctl+0x74/0x80
May 09 13:10:48 localhost.localdomain kernel:  ? do_syscall_64+0x74/0x180
May 09 13:10:48 localhost.localdomain kernel:  ? entry_SYSCALL_64_after_hwframe+0x3d/0xa2
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1966]]
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1966]]
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000000
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
May 09 13:13:49 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1966]]

Comment 35 Will Newton 2018-06-06 09:39:24 UTC
FWIW this is still present and locking up regularly in 4.16.13-300.fc28.x86_64:

Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1930]]
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC0/TEX: 80000009
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000000
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1930]]
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000000
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: TRAP ch 13 [00fe117000 Xorg[1930]]
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC1/TEX: 80000009
Jun 06 10:15:50 localhost.localdomain kernel: nouveau 0000:01:00.0: gr: GPC0/TPC2/TEX: 80000009

Comment 36 Mario Fusco 2018-06-10 15:45:22 UTC
I am experiencing the same also on F28. A couple of times it happened today while listening music with VLC. Not sure if this is related, but I wasn't doing any other relevant activity other than browsing.

Comment 37 Jim Scarborough 2018-07-09 16:33:48 UTC
4.14.18-300.fc27.x86_64 was substantially more reliable than 4.17.3-200.fc28.x86_64 which has been crashing on me several times a day.  

It may also be of note that Chrome gets messed up, with the tabs, location bar, and bookmarks bar getting obscured by large black rectangles sometimes with some random graphics.  I can restart the Chrome window to fix it.

I have seen a similar failure in the KDE bar which shows clock, icons, and apps.  It occasionally gets corrupted and each window or tray icon replaced by some random slice of some graphic.

Comment 38 Jim Scarborough 2018-07-30 19:29:56 UTC
Judging by my rebooting patterns, this could be related to bug 1584463.  I have noticed crashes more often when there's some audio or video going.

Comment 39 lkjsldfads 2018-08-08 15:51:36 UTC
I am experiencing the same problem with Fedora 28.
Kernel: 4.17.11-200.fc28.x86_64
GPU:    GTX 780-ti

Comment 40 Will Newton 2018-09-27 14:52:40 UTC
I am still seeing the problem with the latest Fedora 28 (4.18.9-200.fc28.x86_64) but I will no longer have access to the hardware from tomorrow, it will not be missed.

Comment 41 Ben Cotton 2019-05-02 19:23:59 UTC
This message is a reminder that Fedora 28 is nearing its end of life.
On 2019-May-28 Fedora will stop maintaining and issuing updates for
Fedora 28. It is Fedora's policy to close all bug reports from releases
that are no longer maintained. At that time this bug will be closed as
EOL if it remains open with a Fedora 'version' of '28'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version' 
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora 28 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora, you are encouraged  change the 'version' to a later Fedora 
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events. Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

Comment 42 Ben Cotton 2019-05-02 20:40:18 UTC
This message is a reminder that Fedora 28 is nearing its end of life.
On 2019-May-28 Fedora will stop maintaining and issuing updates for
Fedora 28. It is Fedora's policy to close all bug reports from releases
that are no longer maintained. At that time this bug will be closed as
EOL if it remains open with a Fedora 'version' of '28'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version' 
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora 28 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora, you are encouraged  change the 'version' to a later Fedora 
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events. Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

Comment 43 Ben Cotton 2019-05-28 22:52:44 UTC
Fedora 28 changed to end-of-life (EOL) status on 2019-05-28. Fedora 28 is
no longer maintained, which means that it will not receive any further
security or bug fix updates. As a result we are closing this bug.

If you can reproduce this bug against a currently maintained version of
Fedora please feel free to reopen this bug against that version. If you
are unable to reopen this bug, please file a new report against the
current release. If you experience problems, please add a comment to this
bug.

Thank you for reporting this bug and we are sorry it could not be fixed.

Comment 44 icewater 2020-05-13 18:51:26 UTC
Under F32 on a Lenovo P51, I am still seeing this happen rather frequently when using totem to play a video, usually (I think) if I tab away to another application.  

The display will freeze and the mouse/keyboard will not respond.  I have to restart with the power button.

Is there a bug for this for F32?

Comment 45 Sanjay Upadhyay 2021-06-10 21:10:39 UTC
I have a p50,

I have Fedora 33 installed with latest update - 

xorg-x11-drv-nouveau-1.0.17-1.fc33.x86_64

with kernel 5.12.8-200.fc33.x86_64 -

I still see ocassional freezing and needing a cold reboot.

Logs - 

Apr 06 08:50:50 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002
Apr 06 08:50:51 ramnaam kernel: nouveau 0000:01:00.0: fifo: CHSW_ERROR 00000002

I am reopening as in fedora 33 its still happening. 

I do see this in logs - 
at hang 
Jun 11 00:12:35 ramnaam kernel: nouveau 0000:01:00.0: fifo: fault 00 [READ] at 000000000041f000 engine 00 [gr] client 10 [HUB/PD] reason 02 [PTE] on channel 6 [00ff294000 Xwayland[3123]]
Jun 11 00:12:35 ramnaam kernel: nouveau 0000:01:00.0: fifo: channel 6: killed
Jun 11 00:12:35 ramnaam kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
Jun 11 00:12:35 ramnaam kernel: nouveau 0000:01:00.0: fifo: engine 0: scheduled for recovery
Jun 11 00:14:39 ramnaam cupsd[1047]: REQUEST localhost - - "POST / HTTP/1.1" 200 186 Renew-Subscription client-error-not-found
Jun 11 00:17:21 ramnaam com.slack.Slack.desktop[377010]: Cannot upload crash dump: failed to open
Jun 11 00:17:21 ramnaam com.slack.Slack.desktop[377010]: --2021-06-11 00:17:21--  https://slack.com/apps/sentryproxy/api/5277886/minidump/?sentry_key=fd30fe469dbf4aec9db40548e5acf91e
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]: Resolving slack.com (slack.com)... 15.206.34.128
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]: Connecting to slack.com (slack.com)|15.206.34.128|:443... connected.
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]: HTTP request sent, awaiting response... 200 OK
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]: Length: unspecified [text/html]
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]: Saving to: ‘/dev/fd/4’
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]:      0K
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]:  Failed to get crash dump id.
Jun 11 00:17:22 ramnaam com.slack.Slack.desktop[377010]:  Report Id:  57d0ec0e-e225-42
Jun 11 00:17:26 ramnaam com.slack.Slack.desktop[377010]:       libva error: vaGetDriverNameByIndex() failed with unknown libva error, driver_name = (null)
Jun 11 00:18:30 ramnaam kernel: ------------[ cut here ]------------
Jun 11 00:18:30 ramnaam kernel: WARNING: CPU: 3 PID: 512802 at drivers/gpu/drm/nouveau/nouveau_bo.c:921 nouveau_bo_move_ntfy.constprop.0+0xfa/0x150 [nouveau]
Jun 11 00:18:30 ramnaam kernel: Modules linked in: ath9k_htc ath9k_common ath9k_hw ath tun uinput rfcomm ccm xt_conntrack xt_MASQUERADE nf_conntrack_netlink xt_addrtype br_netfilter bridge stp llc nft_objref nf_conntrack_netbios_ns nf_c>
Jun 11 00:18:30 ramnaam kernel:  snd_intel_sdw_acpi btbcm libarc4 btintel vfat fat snd_hda_codec iwlwifi videobuf2_memops videobuf2_v4l2 irqbypass snd_hda_core videobuf2_common rapl intel_cstate snd_hwdep bluetooth videodev joydev intel>
Jun 11 00:18:30 ramnaam kernel: CPU: 3 PID: 512802 Comm: kworker/3:0 Tainted: G        W  OE     5.12.8-200.fc33.x86_64 #1
Jun 11 00:18:30 ramnaam kernel: Hardware name: LENOVO 20EQS64N1D/20EQS64N1D, BIOS N1EET86W (1.59 ) 08/28/2019
Jun 11 00:18:30 ramnaam kernel: Workqueue: pm pm_runtime_work
Jun 11 00:18:30 ramnaam kernel: RIP: 0010:nouveau_bo_move_ntfy.constprop.0+0xfa/0x150 [nouveau]
Jun 11 00:18:30 ramnaam kernel: Code: db 4d 85 e4 0f 84 50 ff ff ff 49 83 3c 24 00 74 58 49 8b 44 24 08 48 c1 e0 0c 48 89 83 f0 02 00 00 5b 5d 41 5c 41 5d 41 5e c3 <0f> 0b eb be 0f b6 83 08 03 00 00 d0 e8 83 e0 1f 38 45 49 75 8d 48
Jun 11 00:18:30 ramnaam kernel: RSP: 0018:ffffb744c8b07a90 EFLAGS: 00010286
Jun 11 00:18:30 ramnaam kernel: RAX: 00000000fffffff0 RBX: ffff9a1a5b161400 RCX: 0000000000000000
Jun 11 00:18:30 ramnaam kernel: RDX: ffff9a1c6ab02748 RSI: 0000000000000282 RDI: ffff9a20131c7700
Jun 11 00:18:30 ramnaam kernel: RBP: ffff9a1b1b903a40 R08: ffff9a1c877b0070 R09: 0000000000000000
Jun 11 00:18:30 ramnaam kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffb744c8b07c18
Jun 11 00:18:30 ramnaam kernel: R13: ffff9a1a5b1616f8 R14: ffff9a1a5b1616c0 R15: ffffb744c8b07c18
Jun 11 00:18:30 ramnaam kernel: FS:  0000000000000000(0000) GS:ffff9a21c3cc0000(0000) knlGS:0000000000000000
Jun 11 00:18:30 ramnaam kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 11 00:18:30 ramnaam kernel: CR2: 000030d30b581000 CR3: 0000000780a10003 CR4: 00000000003706e0
Jun 11 00:18:30 ramnaam kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 11 00:18:30 ramnaam kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Jun 11 00:18:30 ramnaam kernel: Call Trace:
Jun 11 00:18:30 ramnaam kernel:  nouveau_bo_move+0x43/0x990 [nouveau]
Jun 11 00:18:30 ramnaam kernel:  ? ttm_pool_alloc+0x17a/0x5e0 [ttm]
Jun 11 00:18:30 ramnaam kernel:  ttm_bo_handle_move_mem+0x90/0x170 [ttm]
Jun 11 00:18:30 ramnaam kernel:  ttm_bo_evict+0x10d/0x160 [ttm]
Jun 11 00:18:30 ramnaam kernel:  ttm_mem_evict_first+0x106/0x3b0 [ttm]
Jun 11 00:18:30 ramnaam kernel:  ttm_resource_manager_evict_all+0x9d/0x190 [ttm]
Jun 11 00:18:30 ramnaam kernel:  nouveau_do_suspend+0x82/0x180 [nouveau]
Jun 11 00:18:30 ramnaam kernel:  nouveau_pmops_runtime_suspend+0x3b/0xb0 [nouveau]
Jun 11 00:18:30 ramnaam kernel:  pci_pm_runtime_suspend+0x5e/0x170
Jun 11 00:18:30 ramnaam kernel:  ? pci_dev_put+0x20/0x20
Jun 11 00:18:30 ramnaam kernel:  ? pci_dev_put+0x20/0x20
Jun 11 00:18:30 ramnaam kernel:  __rpm_callback+0x81/0x140
Jun 11 00:18:30 ramnaam kernel:  ? pci_dev_put+0x20/0x20
Jun 11 00:18:30 ramnaam kernel:  rpm_callback+0x1f/0x70
Jun 11 00:18:30 ramnaam kernel:  ? pci_dev_put+0x20/0x20
Jun 11 00:18:30 ramnaam kernel:  rpm_suspend+0x137/0x6c0
Jun 11 00:18:30 ramnaam kernel:  ? __switch_to_asm+0x42/0x70
Jun 11 00:18:30 ramnaam kernel:  ? __switch_to+0x114/0x450
Jun 11 00:18:30 ramnaam kernel:  pm_runtime_work+0x8e/0x90
Jun 11 00:18:30 ramnaam kernel:  process_one_work+0x1ec/0x380
Jun 11 00:18:30 ramnaam kernel:  worker_thread+0x53/0x3e0
Jun 11 00:18:30 ramnaam kernel:  ? process_one_work+0x380/0x380
Jun 11 00:18:30 ramnaam kernel:  kthread+0x11b/0x140
Jun 11 00:18:30 ramnaam kernel:  ? __kthread_bind_mask+0x60/0x60
Jun 11 00:18:30 ramnaam kernel:  ret_from_fork+0x22/0x30
Jun 11 00:18:30 ramnaam kernel: ---[ end trace 16826addd8657bbf ]---
Jun 11 00:18:31 ramnaam abrt-dump-journal-oops[954]: abrt-dump-journal-oops: Found oopses: 1
Jun 11 00:18:31 ramnaam abrt-dump-journal-oops[954]: abrt-dump-journal-oops: Creating problem directories
Jun 11 00:18:31 ramnaam abrt-server[512832]: Oops looks like a problem in kernel module, new component xorg-x11-drv-nouveau
Jun 11 00:18:32 ramnaam abrt-notification[512850]: System encountered a non-fatal error in nouveau_bo_move()
Jun 11 00:18:32 ramnaam abrt-dump-journal-oops[954]: Reported 1 kernel oopses to Abrt
Jun 11 00:18:41 ramnaam com.slack.Slack.desktop[377010]: Cannot upload crash dump: failed to open
Jun 11 00:18:41 ramnaam com.slack.Slack.desktop[377010]: --2021-06-11 00:18:41--  https://slack.com/apps/sentryproxy/api/5277886/minidump/?sentry_key=fd30fe469dbf4aec9db40548e5acf91e
Jun 11 00:18:41 ramnaam com.slack.Slack.desktop[377010]: Resolving slack.com (slack.com)... 15.206.34.128
Jun 11 00:18:41 ramnaam com.slack.Slack.desktop[377010]: Connecting to slack.com (slack.com)|15.206.34.128|:443... connected.
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]: HTTP request sent, awaiting response... 200 OK
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]: Length: unspecified [text/html]
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]: Saving to: ‘/dev/fd/4’
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]:      0K
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]:  Failed to get crash dump id.
Jun 11 00:18:42 ramnaam com.slack.Slack.desktop[377010]:  Report Id:  9de2efc4-a5b7-42
Jun 11 00:18:45 ramnaam kernel: [TTM] Buffer eviction failed
Jun 11 00:19:04 ramnaam kernel: nouveau 0000:01:00.0: Xwayland[3123]: failed to idle channel 2 [Xwayland[3123]]
Jun 11 00:19:19 ramnaam kernel: nouveau 0000:01:00.0: Xwayland[3123]: failed to idle channel 2 [Xwayland[3123]]
Jun 11 00:19:19 ramnaam kernel: nouveau 0000:01:00.0: fifo: fault 00 [READ] at 0000000000013000 engine 07 [HOST0] client 07 [HUB/HOST_CPU] reason 02 [PTE] on channel 2 [00ff8f9000 Xwayland[3123]]
Jun 11 00:19:19 ramnaam kernel: nouveau 0000:01:00.0: fifo: channel 2: killed
Jun 11 00:19:19 ramnaam kernel: nouveau 0000:01:00.0: fifo: runlist 0: scheduled for recovery
...
...
Jun 11 01:33:02 ramnaam kernel: nouveau 0000:01:00.0: Xwayland[3123]: nv50cal_space: -16
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: kernel rejected pushbuf: Device or resource busy
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: krec 0 pushes 1 bufs 8 relocs 0
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000000 00000004 00000004 00000004 00000000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000001 00000008 00000002 00000002 00000002
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000002 0000000a 00000002 00000002 00000000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000003 00000006 00000004 00000000 00000004
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000004 00000007 00000002 00000002 00000000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000005 00000103 00000002 00000000 00000002
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000006 0000000b 00000004 00000004 00000000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: buf 00000007 00000020 00000002 00000000 00000002
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau: ch6: psh 00000000 000007fb0c 000007ffd8
un 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x200203fd
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x00640000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x00080000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x20090200
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x00000000
Jun 11 01:33:02 ramnaam gnome-shell[3123]: nouveau:         0x01e74000



at reboot
Jun 11 02:14:47 ramnaam kernel: nouveau: detected PR support, will not use DSM
Jun 11 02:14:47 ramnaam kernel: checking generic (b1000000 7e9000) vs hw (b2000000 1000000)
Jun 11 02:14:47 ramnaam kernel: checking generic (b1000000 7e9000) vs hw (a0000000 10000000)
Jun 11 02:14:47 ramnaam kernel: checking generic (b1000000 7e9000) vs hw (b0000000 2000000)
Jun 11 02:14:47 ramnaam kernel: fb0: switching to nouveaufb from EFI VGA
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: vgaarb: deactivate vga console
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: NVIDIA GM107 (117310a2)
...
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: bios: version 82.07.9d.00.1f
Jun 11 02:14:47 ramnaam kernel: clocksource: Switched to clocksource tsc
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: fb: 4096 MiB GDDR5
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: bus: MMIO read of 00000000 FAULT at 001228 [ PRIVRING ]
Jun 11 02:14:47 ramnaam systemd-udevd[361]: Using default interface naming scheme 'v245'.
Jun 11 02:14:47 ramnaam kernel: [TTM] Zone  kernel: Available graphics memory: 16384234 KiB
Jun 11 02:14:47 ramnaam kernel: [TTM] Zone   dma32: Available graphics memory: 2097152 KiB
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: VRAM: 4096 MiB
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: GART: 1048576 MiB
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: TMDS table version 2.0
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB version 4.0
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 00: 04800fb6 04420010
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 01: 02011fa6 04420010
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 02: 02011f62 00020010
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 03: 08022fc6 04420010
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 04: 08022f82 00020010
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 05: 01033fd6 04420020
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB outp 06: 01033f92 00020020
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB conn 00: 00002047
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB conn 01: 00001146
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB conn 02: 00010246
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: DCB conn 03: 00020346
Jun 11 02:14:47 ramnaam kernel: nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
Jun 11 02:14:47 ramnaam kernel: psmouse serio2: trackpoint: IBM TrackPoint firmware: 0x0e, buttons: 3/3
Jun 11 02:14:48 ramnaam rngd[250]: [jitter]: Enabling JITTER rng support
Jun 11 02:14:48 ramnaam rngd[250]: [jitter]: Initialized
Jun 11 02:14:48 ramnaam rngd[250]: [pkcs11]: Unable to load pkcs11 engine: (null)
Jun 11 02:14:48 ramnaam rngd[250]: [pkcs11]: Initialization Failed
Jun 11 02:14:48 ramnaam kernel: nouveau 0000:01:00.0: DRM: allocated 1920x1080 fb: 0x80000, bo 000000005a9567e0
Jun 11 02:14:48 ramnaam kernel: fbcon: nouveaudrmfb (fb0) is primary device
Jun 11 02:14:48 ramnaam kernel: fbcon: Deferring console take-over
Jun 11 02:14:48 ramnaam kernel: nouveau 0000:01:00.0: [drm] fb0: nouveaudrmfb frame buffer device
Jun 11 02:14:48 ramnaam kernel: [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
Jun 11 02:14:48 ramnaam kernel: nouveau 0000:01:00.0: DRM: Disabling PCI power management to avoid bug

I see an older xserver -nouveau bug dating 2016 and still unresolved - https://bugs.freedesktop.org/show_bug.cgi?id=93629

Comment 46 Wayne Walker 2021-07-07 00:33:11 UTC
@supadhya : Your logs looks almost identical to mine.  Your problem seems to have started when mine did (mine started on 2021-06-16, but I hadn't run dnf for about a week.

I have a bug open for mine :  1979758

Comment 47 Sanjay Upadhyay 2021-07-13 05:45:46 UTC
no other info to give. 
I see this happening on -> p50 -> gnome -> sleep/wakeup times or long time running. 
Since I have moved to i3 WM and it only happens with google chrome now, so I am using i3 WM with firefox, things are a bit more stable.

Comment 48 Ben Cotton 2021-11-04 13:38:21 UTC
This message is a reminder that Fedora 33 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora 33 on 2021-11-30.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
Fedora 'version' of '33'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version' 
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora 33 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora, you are encouraged  change the 'version' to a later Fedora 
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events. Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

Comment 49 Ben Cotton 2021-11-04 14:07:58 UTC
This message is a reminder that Fedora 33 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora 33 on 2021-11-30.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
Fedora 'version' of '33'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version' 
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora 33 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora, you are encouraged  change the 'version' to a later Fedora 
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events. Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

Comment 50 Ben Cotton 2021-11-04 15:04:56 UTC
This message is a reminder that Fedora 33 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora 33 on 2021-11-30.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
Fedora 'version' of '33'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version' 
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora 33 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora, you are encouraged  change the 'version' to a later Fedora 
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's 
lifetime, sometimes those efforts are overtaken by events. Often a 
more recent Fedora release includes newer upstream software that fixes 
bugs or makes them obsolete.

Comment 51 Ben Cotton 2021-11-30 19:15:55 UTC
Fedora 33 changed to end-of-life (EOL) status on 2021-11-30. Fedora 33 is
no longer maintained, which means that it will not receive any further
security or bug fix updates. As a result we are closing this bug.

If you can reproduce this bug against a currently maintained version of
Fedora please feel free to reopen this bug against that version. If you
are unable to reopen this bug, please file a new report against the
current release. If you experience problems, please add a comment to this
bug.

Thank you for reporting this bug and we are sorry it could not be fixed.


Note You need to log in before you can comment on or make changes to this bug.