Bug 2274749 - [nouveau] failures prevent use of external (usb-c) LCD/display across suspend/resume cycles on Thinkpad P1 (gen2)
Summary: [nouveau] failures prevent use of external (usb-c) LCD/display across suspen...
Keywords:
Status: CLOSED EOL
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: 40
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2024-04-12 17:00 UTC by Satish Balay
Modified: 2025-05-16 08:04 UTC (History)
15 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2025-05-16 08:04:46 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)
dmesg (146.52 KB, text/plain)
2024-04-12 17:00 UTC, Satish Balay
no flags Details
dmesg-rawhide-kernel.txt (149.48 KB, text/plain)
2024-04-12 17:11 UTC, Satish Balay
no flags Details


Links
System ID Private Priority Status Summary Last Updated
freedesktop.org Gitlab drm nouveau issues 352 0 None opened Reattaching USB-C HDMI dongle crashes driver 2024-09-19 17:01:19 UTC

Description Satish Balay 2024-04-12 17:00:09 UTC
1. Please describe the problem:

I'm encountering various issues with F40 upgrade - likely triggered by nouveau failures in kernel

2. What is the Version-Release number of the kernel:

kernel-6.8.5-301.fc40.x86_64

3. Did it work previously in Fedora? If so, what kernel version did the issue
   *first* appear?  Old kernels are available for download at
   https://koji.fedoraproject.org/koji/packageinfo?packageID=8 :

Fedora 39 - before upgrade to Fedora 40 worked fine. Kernel used previously:
kernel-6.7.10-200.fc39.x86_64

4. Can you reproduce this issue? If so, please provide the steps to reproduce
   the issue below:

- On Thinkpad P1 Gen2, upgrade from F39 to F40
- Update to latest pkgs - including latest F40 kernel
- reboot

5. Does this problem occur with the latest Rawhide kernel? To install the
   Rawhide kernel, run ``sudo dnf install fedora-repos-rawhide`` followed by
   ``sudo dnf update --enablerepo=rawhide kernel``:

Yet to try

6. Are you running any modules that not shipped with directly Fedora's kernel?:

No

7. Please attach the kernel logs. You can get the complete kernel log
   for a boot with ``journalctl --no-hostname -k > dmesg.txt``. If the
   issue occurred on a previous boot, use the journalctl ``-b`` flag.

Will attach.

Reproducible: Always

Comment 1 Satish Balay 2024-04-12 17:00:50 UTC
Created attachment 2026650 [details]
dmesg

Comment 2 Satish Balay 2024-04-12 17:11:13 UTC
Created attachment 2026652 [details]
dmesg-rawhide-kernel.txt

Issue persists with kernel-6.9.0-0.rc3.20240409git20cb38a7af88.31.fc41.x86_64

Comment 3 Satish Balay 2024-05-01 15:26:33 UTC
To update: kernel-6.9.0-0.rc6.51.fc41.x86_64 is working better now

- The above error has not come up yet.
- I'm now able to connect/use my external display via usb-c [this didn't work before]

I'm now seeing a different stack though [perhaps its a different issue]

May 01 10:04:15 p1 kernel: WARNING: CPU: 4 PID: 6212 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e1/0x860 [nouveau]

Comment 4 Satish Balay 2024-05-02 14:48:12 UTC
To update: for the past one day - I get:

$ journalctl -k -b -1 |grep 'WARNING: '
May 01 10:04:15 p1 kernel: WARNING: CPU: 4 PID: 6212 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e1/0x860 [nouveau]
May 01 19:32:34 p1 kernel: WARNING: CPU: 4 PID: 30671 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e1/0x860 [nouveau]
May 02 07:42:40 p1 kernel: WARNING: CPU: 11 PID: 33673 at kernel/power/suspend_test.c:53 suspend_test_finish+0x74/0x80
May 02 08:50:31 p1 kernel: WARNING: CPU: 9 PID: 2852 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
May 02 08:50:31 p1 kernel: WARNING: CPU: 9 PID: 2852 at drivers/gpu/drm/nouveau/nvkm/engine/disp/r535.c:395 r535_sor_dp_audio+0x116/0x130 [nouveau]

Note: r535_gsp_msgq_wait did re-appear.

And subsequently - the machine froze on suspend - so issue(s) persist.

Comment 5 Satish Balay 2024-05-15 14:19:03 UTC
using: kernel-6.9.0-64.fc41.x86_64

Issue with using external (usb-c) monitor persist - but don't see r535_gsp_msgq_wait in the messages (yet).

I guess suspend/resume triggers some errors - that prevent subsequent connection to the external monitor

$ journalctl -k  |grep 'WARNING: '
May 14 16:07:54 p1 kernel: WARNING: CPU: 7 PID: 85942 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e2/0x860 [nouveau]
May 14 17:48:16 p1 kernel: WARNING: CPU: 4 PID: 106799 at kernel/power/suspend_test.c:53 suspend_test_finish+0x74/0x80

Comment 6 Satish Balay 2024-05-31 17:26:24 UTC
$ journalctl -k  -b -1 | head -1
May 31 12:11:29 p1 kernel: Linux version 6.10.0-0.rc1.20240531git4a4be1ad3a6e.21.fc41.x86_64 (mockbuild@1a4207f1ad9d444f9b3af705d3c56c96) (gcc (GCC) 14.1.1 20240522 (Red Hat 14.1.1-4), GNU ld version 2.42.50.20240513) #1 SMP PREEMPT_DYNAMIC Fri May 31 13:39:24 UTC 2024

$ journalctl -k  -b -1 |grep 'WARNING: '
May 31 12:14:51 p1 kernel: WARNING: CPU: 1 PID: 4357 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e2/0x860 [nouveau]
May 31 12:16:26 p1 kernel: WARNING: CPU: 5 PID: 929 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
May 31 12:16:26 p1 kernel: WARNING: CPU: 5 PID: 929 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:2057 r535_gsp_fini+0x30f/0x340 [nouveau]


i.e issues persist for me with latest 6.10-rc

And likely related to (or triggered by?) https://bugzilla.redhat.com/show_bug.cgi?id=2275153

Comment 7 Satish Balay 2024-06-24 19:47:22 UTC
$ journalctl -k  |head -1
Jun 24 13:42:24 p1 kernel: Linux version 6.10.0-0.rc5.43.fc41.x86_64 (mockbuild@f81a602120654e7fb44d556e88cd2f47) (gcc (GCC) 14.1.1 20240620 (Red Hat 14.1.1-6), GNU ld version 2.42.50.20240531) #1 SMP PREEMPT_DYNAMIC Mon Jun 24 17:24:24 UTC 2024
$ journalctl -k  |grep 'WARNING: '
Jun 24 14:01:45 p1 kernel: WARNING: CPU: 7 PID: 5519 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e2/0x860 [nouveau]


Even though there is one strack trace here - this kernel has been stable [so far] wrt external LCD usage (connect/disconnect along with  suspend/resume) - likely earlier issues were primarily due to  https://bugzilla.redhat.com/show_bug.cgi?id=2275153 - and now thats fixed [in 6.10.0-0.rc5].

Comment 8 Satish Balay 2024-06-24 21:10:29 UTC
I spoke too soon. 

$ journalctl -k  -b -1 |grep 'WARNING: '
Jun 24 14:01:45 p1 kernel: WARNING: CPU: 7 PID: 5519 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e2/0x860 [nouveau]
Jun 24 15:35:01 p1 kernel: WARNING: CPU: 10 PID: 15455 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7e2/0x860 [nouveau]
Jun 24 15:38:18 p1 kernel: WARNING: CPU: 3 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:38:32 p1 kernel: WARNING: CPU: 1 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:38:46 p1 kernel: WARNING: CPU: 9 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:38:59 p1 kernel: WARNING: CPU: 3 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:39:13 p1 kernel: WARNING: CPU: 10 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:39:27 p1 kernel: WARNING: CPU: 4 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:39:40 p1 kernel: WARNING: CPU: 3 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:39:54 p1 kernel: WARNING: CPU: 2 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:40:07 p1 kernel: WARNING: CPU: 3 PID: 15764 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]
Jun 24 15:40:20 p1 kernel: WARNING: CPU: 2 PID: 16217 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:112 r535_gsp_msgq_wait+0x1ad/0x1d0 [nouveau]

With:

Jun 24 15:38:18 p1 kernel: nouveau 0000:01:00.0: Xorg[15764]: VMM allocation failed: -110
Jun 24 15:38:31 p1 kernel: usb 1-4-port4: cannot reset (err = -71)
Jun 24 15:38:31 p1 kernel: usb 1-4-port4: cannot reset (err = -71)
...

And both displays stopped working - requiring a reboot.

Checking earlier logs:

$ journalctl |grep 'VMM allocation failed'
May 02 15:40:58 p1 kernel: nouveau 0000:01:00.0: chrome[34303]: VMM allocation failed: -110
May 02 15:41:14 p1 kernel: nouveau 0000:01:00.0: chrome[34303]: VMM allocation failed: -110
May 10 08:57:07 p1 kernel: nouveau 0000:01:00.0: plymouthd[45231]: VMM allocation failed: -110
May 10 08:59:51 p1 kernel: nouveau 0000:01:00.0: plymouthd[45231]: VMM allocation failed: -110
Jun 24 15:38:18 p1 kernel: nouveau 0000:01:00.0: Xorg[15764]: VMM allocation failed: -110
Jun 24 15:40:20 p1 kernel: nouveau 0000:01:00.0: plymouthd[16217]: VMM allocation failed: -110

Comment 9 Satish Balay 2024-07-29 15:19:00 UTC
My inability to use an external (usb-c) monitor with Thinkpad P1 Gen 2 (across suspend/resume cycles) persists with 6.11.0-0.rc1

$ journalctl -k |head -1
Jul 29 10:09:34 p1 kernel: Linux version 6.11.0-0.rc1.20240729gitdc1c8034e31b.16.fc41.x86_64 (mockbuild@8a7a492a2be847b9889593d8ce342245) (gcc (GCC) 14.1.1 20240701 (Red Hat 14.1.1-7), GNU ld version 2.42.90.20240720) #1 SMP PREEMPT_DYNAMIC Mon Jul 29 14:18:39 UTC 2024

$ journalctl -k |grep WARNING:
Jul 29 10:11:14 p1 kernel: WARNING: CPU: 8 PID: 4183 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Jul 29 10:12:27 p1 kernel: WARNING: CPU: 1 PID: 4010 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:114 r535_gsp_msgq_wait+0x1af/0x1d0 [nouveau]
Jul 29 10:12:27 p1 kernel: WARNING: CPU: 1 PID: 4010 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:2371 r535_gsp_fini+0x30f/0x340 [nouveau]

$ journalctl -k |grep 'kernel: nouveau '
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: enabling device (0000 -> 0003)
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: NVIDIA TU117 (167000a1)
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: bios: version 90.17.20.00.14
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: DRM: VRAM: 4096 MiB
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: DRM: GART: 536870912 MiB
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: DRM: Disabling PCI power management to avoid bug
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
Jul 29 10:09:37 p1 kernel: nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
Jul 29 10:10:21 p1 kernel: nouveau 0000:01:00.0: [drm] fb1: nouveaudrmfb frame buffer device
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:540000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:270000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:162000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:540000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:270000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:162000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:540000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:270000] (ret:-5)
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Jul 29 10:11:14 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:162000] (ret:-5)
Jul 29 10:11:16 p1 kernel: nouveau 0000:01:00.0: DRM: core notifier timeout
Jul 29 10:12:27 p1 kernel: nouveau 0000:01:00.0: gsp:msg fn:103 len:0x60/0x40 res:0x65 resp:0x65
Jul 29 10:12:27 p1 kernel: nouveau 0000:01:00.0: gsp: fini failed, -110
Jul 29 10:12:27 p1 kernel: nouveau 0000:01:00.0: DRM: Client resume failed with error: -22
Jul 29 10:12:27 p1 kernel: nouveau 0000:01:00.0: DRM: resume failed with: -22
Jul 29 10:12:34 p1 kernel: nouveau 0000:01:00.0: DRM: Dropped ACPI reprobe event due to RPM error: -22
Jul 29 10:12:39 p1 kernel: nouveau 0000:01:00.0: DRM: Dropped ACPI reprobe event due to RPM error: -22

Comment 10 Satish Balay 2024-09-16 21:36:48 UTC
This failure persists with 6.11.0-63.fc42.x86_64. i.e:

- boot/login
- connect external monitor via usb-c
- disconnect external monitor

This gives:

Sep 16 16:29:52 p1 kernel: usb 1-4: USB disconnect, device number 4
Sep 16 16:29:52 p1 kernel: usb 1-4.1: USB disconnect, device number 5
Sep 16 16:29:52 p1 kernel: usb 1-4.4: USB disconnect, device number 7
Sep 16 16:29:52 p1 kernel: usb 1-4.5: USB disconnect, device number 6
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:540000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:270000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:4 bw:162000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:540000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:270000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:2 bw:162000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:540000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:270000] (ret:-5)
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: gsp: cli:0xc1d00002 obj:0x00730000 ctrl cmd:0x00731343 failed: 0x00000001
Sep 16 16:29:53 p1 kernel: nouveau 0000:01:00.0: DRM: [DRM/00000009:kmsOutp] [DP_TRAIN retrain:0 mst:0 lttprs:1 post_lt_adj:0 nr:1 bw:162000] (ret:-5)
Sep 16 16:29:53 p1 kernel: ------------[ cut here ]------------
Sep 16 16:29:53 p1 kernel: WARNING: CPU: 7 PID: 3837 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel: Modules linked in: hid_lenovo uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables qrtr bnep sunrpc rmi_smbus rmi_core binfmt_misc snd_sof_pci_intel_cnl snd_sof_intel_hda_generic soundwire_intel soundwire_cadence vfat fat snd_sof_intel_hda_common snd_sof_intel_hda_mlink snd_sof_intel_hda intel_uncore_frequency intel_uncore_frequency_common snd_sof_pci intel_pmc_core_pltdrv intel_pmc_core snd_sof_xtensa_dsp intel_vsec snd_sof pmt_telemetry pmt_class intel_tcc_cooling iwlmvm x86_pkg_temp_thermal intel_powerclamp snd_sof_utils coretemp soundwire_generic_allocation r8153_ecm soundwire_bus iTCO_wdt spi_nor cdc_ether intel_pmc_bxt kvm_intel mtd iTCO_vendor_support ee1004 mei_wdt mei_hdcp mei_pxp intel_rapl_msr usbnet mac80211 snd_soc_avs kvm snd_soc_hda_codec snd_soc_skl libarc4
Sep 16 16:29:53 p1 kernel:  snd_soc_hdac_hda snd_hda_ext_core snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match rapl snd_soc_acpi intel_cstate snd_hda_codec_conexant snd_soc_core snd_hda_codec_generic intel_uncore snd_hda_codec_hdmi snd_compress ac97_bus snd_pcm_dmaengine pcspkr snd_hda_intel think_lmi iwlwifi snd_intel_dspcfg firmware_attributes_class snd_intel_sdw_acpi intel_wmi_thunderbolt wmi_bmof uvcvideo snd_hda_codec btusb uvc snd_hda_core videobuf2_vmalloc btrtl videobuf2_memops processor_thermal_device_pci_legacy btintel videobuf2_v4l2 processor_thermal_device snd_hwdep processor_thermal_wt_hint snd_ctl_led snd_seq r8152 btbcm videobuf2_common processor_thermal_rfim btmtk cfg80211 videodev mii snd_seq_device processor_thermal_rapl thinkpad_acpi i2c_i801 bluetooth e1000e intel_rapl_common mc spi_intel_pci thunderbolt spi_intel snd_pcm i2c_smbus sparse_keymap mei_me platform_profile processor_thermal_wt_req processor_thermal_power_floor rfkill snd_timer mei processor_thermal_mbox idma64 intel_soc_dts_iosf
Sep 16 16:29:53 p1 kernel:  intel_pch_thermal snd int3403_thermal soundcore int340x_thermal_zone acpi_pad int3400_thermal acpi_thermal_rel joydev squashfs loop dm_multipath nfnetlink zram dm_crypt i915 nouveau crct10dif_pclmul crc32_pclmul crc32c_intel drm_ttm_helper polyval_clmulni polyval_generic gpu_sched drm_gpuvm drm_exec mxm_wmi rtsx_pci_sdmmc drm_buddy i2c_algo_bit ghash_clmulni_intel mmc_core video ttm sha512_ssse3 nvme sha256_ssse3 drm_display_helper nvme_core sha1_ssse3 ucsi_acpi nvme_auth rtsx_pci typec_ucsi cec typec wmi pinctrl_cannonlake serio_raw scsi_dh_rdac scsi_dh_emc scsi_dh_alua ip6_tables ip_tables fuse
Sep 16 16:29:53 p1 kernel: CPU: 7 UID: 0 PID: 3837 Comm: (sd-close) Not tainted 6.11.0-63.fc42.x86_64 #1
Sep 16 16:29:53 p1 kernel: Hardware name: LENOVO 20QT0016US/20QT0016US, BIOS N2OET65W (1.52 ) 08/05/2024
Sep 16 16:29:53 p1 kernel: RIP: 0010:nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel: Code: 49 89 c0 0f 88 b6 e1 02 00 0f b6 83 02 01 00 00 e9 27 fe ff ff ba 00 00 00 00 83 e9 0d 0f 48 ca ba 27 00 00 00 e9 85 fc ff ff <0f> 0b e9 aa fc ff ff 0f 0b e9 a3 fc ff ff 0f 0b e9 9c fc ff ff ba
Sep 16 16:29:53 p1 kernel: RSP: 0018:ffffab59466a39f8 EFLAGS: 00010293
Sep 16 16:29:53 p1 kernel: RAX: 0000000000000000 RBX: ffff9e9d13db6000 RCX: 0000000000000000
Sep 16 16:29:53 p1 kernel: RDX: 0000000000000027 RSI: 000000001fc8bfd0 RDI: 0000000000000008
Sep 16 16:29:53 p1 kernel: RBP: ffff9e9d01ab7000 R08: 00000000202fbf00 R09: 0000000000000004
Sep 16 16:29:53 p1 kernel: R10: 0000000000000000 R11: 0000000000000f00 R12: ffff9e9d13da3008
Sep 16 16:29:53 p1 kernel: R13: 00000000202fbf00 R14: 0a7c5ac471b47843 R15: 0000000000000000
Sep 16 16:29:53 p1 kernel: FS:  00007f409c039e00(0000) GS:ffff9ea02c580000(0000) knlGS:0000000000000000
Sep 16 16:29:53 p1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 16 16:29:53 p1 kernel: CR2: 00007ffc5a5bd3b0 CR3: 00000002019c4005 CR4: 00000000003706f0
Sep 16 16:29:53 p1 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Sep 16 16:29:53 p1 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Sep 16 16:29:53 p1 kernel: Call Trace:
Sep 16 16:29:53 p1 kernel:  <TASK>
Sep 16 16:29:53 p1 kernel:  ? nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel:  ? __warn.cold+0x8e/0xe8
Sep 16 16:29:53 p1 kernel:  ? nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel:  ? report_bug+0xff/0x140
Sep 16 16:29:53 p1 kernel:  ? handle_bug+0x3c/0x80
Sep 16 16:29:53 p1 kernel:  ? exc_invalid_op+0x17/0x70
Sep 16 16:29:53 p1 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Sep 16 16:29:53 p1 kernel:  ? nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel:  ? nv50_sor_atomic_enable+0x254/0x840 [nouveau]
Sep 16 16:29:53 p1 kernel:  ? wait_for_completion_timeout+0x13b/0x170
Sep 16 16:29:53 p1 kernel:  ? drm_calc_timestamping_constants+0x10b/0x180
Sep 16 16:29:53 p1 kernel:  nv50_disp_atomic_commit_tail+0x247/0x790 [nouveau]
Sep 16 16:29:53 p1 kernel:  nv50_disp_atomic_commit+0x188/0x1b0 [nouveau]
Sep 16 16:29:53 p1 kernel:  drm_atomic_commit+0xa6/0xe0
Sep 16 16:29:53 p1 kernel:  ? __pfx___drm_printfn_info+0x10/0x10
Sep 16 16:29:53 p1 kernel:  drm_client_modeset_commit_atomic+0x203/0x250
Sep 16 16:29:53 p1 kernel:  drm_client_modeset_commit_locked+0x5a/0x160
Sep 16 16:29:53 p1 kernel:  drm_client_modeset_commit+0x25/0x40
Sep 16 16:29:53 p1 kernel:  drm_fb_helper_lastclose+0x49/0x80
Sep 16 16:29:53 p1 kernel:  drm_fbdev_ttm_client_restore+0x11/0x20 [drm_ttm_helper]
Sep 16 16:29:53 p1 kernel:  drm_client_dev_restore+0x69/0xc0
Sep 16 16:29:53 p1 kernel:  drm_release+0xbe/0xc0
Sep 16 16:29:53 p1 kernel:  __fput+0xee/0x2c0
Sep 16 16:29:53 p1 kernel:  __x64_sys_close+0x3c/0x80
Sep 16 16:29:53 p1 kernel:  do_syscall_64+0x82/0x160
Sep 16 16:29:53 p1 kernel:  ? do_fault+0x26e/0x470
Sep 16 16:29:53 p1 kernel:  ? __handle_mm_fault+0x7eb/0x1040
Sep 16 16:29:53 p1 kernel:  ? exc_page_fault+0x7e/0x180
Sep 16 16:29:53 p1 kernel:  ? __count_memcg_events+0x75/0x130
Sep 16 16:29:53 p1 kernel:  ? count_memcg_events.constprop.0+0x1a/0x30
Sep 16 16:29:53 p1 kernel:  ? handle_mm_fault+0x21b/0x330
Sep 16 16:29:53 p1 kernel:  ? do_user_addr_fault+0x55a/0x7b0
Sep 16 16:29:53 p1 kernel:  ? clear_bhb_loop+0x25/0x80
Sep 16 16:29:53 p1 kernel:  ? clear_bhb_loop+0x25/0x80
Sep 16 16:29:53 p1 kernel:  ? clear_bhb_loop+0x25/0x80
Sep 16 16:29:53 p1 kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Sep 16 16:29:53 p1 kernel: RIP: 0033:0x7f409b717f74
Sep 16 16:29:53 p1 kernel: Code: 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 80 3d d5 60 0e 00 00 74 13 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 3c c3 0f 1f 00 55 48 89 e5 48 83 ec 10 89 7d
Sep 16 16:29:53 p1 kernel: RSP: 002b:00007ffc69079fd8 EFLAGS: 00000202 ORIG_RAX: 0000000000000003
Sep 16 16:29:53 p1 kernel: RAX: ffffffffffffffda RBX: 00000000000000c2 RCX: 00007f409b717f74
Sep 16 16:29:53 p1 kernel: RDX: 0000000000000000 RSI: 00007f409baff26e RDI: 00000000000000c2
Sep 16 16:29:53 p1 kernel: RBP: 00007ffc69079ff0 R08: 0000000000000007 R09: 0000000000000007
Sep 16 16:29:53 p1 kernel: R10: 00007f409b72a406 R11: 0000000000000202 R12: 00000000000000c2
Sep 16 16:29:53 p1 kernel: R13: 0000000000000411 R14: 0000000000000000 R15: 0000000000000000
Sep 16 16:29:53 p1 kernel:  </TASK>
Sep 16 16:29:53 p1 kernel: ---[ end trace 0000000000000000 ]---
Sep 16 16:29:55 p1 kernel: nouveau 0000:01:00.0: DRM: core notifier timeout
Sep 16 16:29:57 p1 kernel: nouveau 0000:01:00.0: DRM: core notifier timeout
...

$ journalctl -k |head -1
Sep 16 16:28:15 p1 kernel: Linux version 6.11.0-63.fc42.x86_64 (mockbuild@270c597366cc4c1d96ed094db712ac94) (gcc (GCC) 14.2.1 20240912 (Red Hat 14.2.1-3), GNU ld version 2.43.1-1.fc42) #1 SMP PREEMPT_DYNAMIC Sun Sep 15 17:14:12 UTC 2024
$ journalctl -k |grep WARNING
Sep 16 16:29:53 p1 kernel: WARNING: CPU: 7 PID: 3837 at drivers/gpu/drm/nouveau/dispnv50/disp.c:1675 nv50_sor_atomic_enable+0x7c4/0x840 [nouveau]
Sep 16 16:30:48 p1 kernel: WARNING: CPU: 8 PID: 1187 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:114 r535_gsp_msgq_wait+0x1af/0x1d0 [nouveau]
Sep 16 16:30:48 p1 kernel: WARNING: CPU: 8 PID: 1187 at drivers/gpu/drm/nouveau/nvkm/subdev/gsp/r535.c:2371 r535_gsp_fini+0x30f/0x340 [nouveau]

Comment 11 Satish Balay 2025-01-13 18:07:12 UTC
Issue persists with 6.12.8-200.fc41.x86_64

I've switched from nouveau to nvidia - thats works better with the external/usb-c monitor - with some quirks.

https://forums.developer.nvidia.com/t/bug-kfence-use-after-free-read-in-nv-dma-release-sgt-0x29-0x70-nvidia/304219

Comment 12 Aoife Moloney 2025-04-25 10:26:07 UTC
This message is a reminder that Fedora Linux 40 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora Linux 40 on 2025-05-13.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
'version' of '40'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, change the 'version' 
to a later Fedora Linux version. Note that the version field may be hidden.
Click the "Show advanced fields" button if you do not see it.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora Linux 40 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora Linux, you are encouraged to change the 'version' to a later version
prior to this bug being closed.

Comment 13 Aoife Moloney 2025-05-16 08:04:46 UTC
Fedora Linux 40 entered end-of-life (EOL) status on 2025-05-13.

Fedora Linux 40 is no longer maintained, which means that it
will not receive any further security or bug fix updates. As a result we
are closing this bug.

If you can reproduce this bug against a currently maintained version of Fedora Linux
please feel free to reopen this bug against that version. Note that the version
field may be hidden. Click the "Show advanced fields" button if you do not see
the version field.

If you are unable to reopen this bug, please file a new report against an
active release.

Thank you for reporting this bug and we are sorry it could not be fixed.


Note You need to log in before you can comment on or make changes to this bug.