Bug 2252447 - kwin_wayland hangs with nvidia 545 driver desktop unusable
Summary: kwin_wayland hangs with nvidia 545 driver desktop unusable
Keywords:
Status: CLOSED EOL
Alias: None
Product: Fedora
Classification: Fedora
Component: kwin
Version: 39
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Rex Dieter
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-12-01 18:09 UTC by Barry Scott
Modified: 2024-11-27 22:12 UTC (History)
8 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2024-11-27 22:12:07 UTC
Type: ---
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
KDE Software Compilation 478251 0 NOR UNCONFIRMED kwin_wayland with nvidia 545 driver drops user back to login after a few seconds 2023-12-08 11:13:48 UTC
RPM Fusion 6807 0 P1 NEW The nvidia 545 driver crashes kwin_wayland 2023-12-02 10:42:58 UTC

Description Barry Scott 2023-12-01 18:09:21 UTC
This is a reproducible problem with akmod-nvidia-545.29.06-1.fc39.x86_64
My GPU is a RTX3060.

When I login there is a long pause with screen black and a cursor blinking in the top-left corner. Then the KDE plasma desktop appears.
I can start apps and the are working. But after about 10 seconds I get throw back the login screen.
Here are the journal --user logs.

2023-12-01T17:50:34+0000 plasmashell[4703]: qt.qpa.wayland: Wayland does not support QWindow::requestActivate()
2023-12-01T17:50:35+0000 plasmashell[4703]: QString::arg: 2 argument(s) missing in org.barrys-emacs.scm-workbench
2023-12-01T17:50:35+0000 systemd[4184]: Started app-org.barrys\x2demacs.scm\x2dworkbench-bf73e11b6a40477fb99b021e4439cbb9.scope - SCM Workbench.
2023-12-01T17:50:36+0000 kwin_wayland[4567]: kf.service.services: The desktop entry file "/usr/share/applications/qemu.desktop" has Type= "Application" but has no Exec field.
2023-12-01T17:50:36+0000 kwin_wayland[4567]: kf.service.services: The desktop entry file "/usr/share/applications/org.freedesktop.Xwayland.desktop" has Type= "Application" but has no Exec field.
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_wayland_drm: Atomic commit failed! Invalid argument
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_wayland_drm: Presentation failed! Invalid argument
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_core: Applying KScreen config failed!
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SINK@"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SOURCE@"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SINK@"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SOURCE@"
2023-12-01T17:50:46+0000 plasmashell[4703]: org.kde.plasma.pulseaudio: No object for name "auto_null.monitor"
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_wayland_drm: Atomic commit failed! Permission denied
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_wayland_drm: Presentation failed! Permission denied
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_core: Applying KScreen config failed!
2023-12-01T17:50:46+0000 kwin_wayland[4567]: kwin_core: Applying KScreen config failed!

Here is the output of dmesg

$ dmesg | grep -i nvidia
[  +0.000000] Command line: BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.6.2-201.fc39.x86_64 root=UUID=f160dd82-834b-4cfa-8ee7-9c159b2a1b7b ro rootflags=subvol=root rd.luks.uuid=luks-904db66b-db23-4719-bbf6-fb596c23d831 initcall_blacklist=simpledrm_platform_driver_init nvidia-drm.modeset=1 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau
[  +0.000011] Kernel command line: BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.6.2-201.fc39.x86_64 root=UUID=f160dd82-834b-4cfa-8ee7-9c159b2a1b7b ro rootflags=subvol=root rd.luks.uuid=luks-904db66b-db23-4719-bbf6-fb596c23d831 initcall_blacklist=simpledrm_platform_driver_init nvidia-drm.modeset=1 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau
[  +0.011260] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:1c.4/0000:06:00.1/sound/card1/input11
[  +0.000614] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:1c.4/0000:06:00.1/sound/card1/input12
[  +0.000584] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:1c.4/0000:06:00.1/sound/card1/input13
[  +0.000633] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:1c.4/0000:06:00.1/sound/card1/input14
[  +0.079026] nvidia: loading out-of-tree module taints kernel.
[  +0.000535] nvidia: module license 'NVIDIA' taints kernel.
[  +0.000520] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[  +0.000528] nvidia: module license taints kernel.
[  +0.113438] nvidia-nvlink: Nvlink Core is being initialized, major device number 235
[  +0.001374] nvidia 0000:06:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
[  +0.048729] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  545.29.06  Thu Nov 16 01:59:08 UTC 2023
[  +0.066443] nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
[  +0.066863] nvidia-uvm: Loaded the UVM driver, major device number 511.
[  +0.037940] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  545.29.06  Thu Nov 16 01:47:29 UTC 2023
[  +0.005739] [drm] [nvidia-drm] [GPU ID 0x00000600] Loading driver
[  +1.072910] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:06:00.0 on minor 0
[  +0.000021] nvidia 0000:06:00.0: vgaarb: deactivate vga console
[  +0.127493] fbcon: nvidia-drmdrmfb (fb0) is primary device
[  +0.012371] nvidia 0000:06:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device


Reproducible: Always

Steps to Reproduce:
1. upgrade to akmod-nvidia-545.29.06-1.fc39.x86_64
2. build nvidia drivers
3. reboot
4. login to plasma (wayland)
5. there is pause before desktop appears
6. start show apps, I use konsole and the scm workbench app
7. after 10 seconds you are returned to the login screen
8. new login attempts will not load a desktop

Actual Results:  
kwin_wayland breaks


Expected Results:  
desktop is stable

Comment 1 Barry Scott 2023-12-01 18:10:12 UTC
I raised this against hte rpmfusion driver https://bugzilla.rpmfusion.org/show_bug.cgi?id=6807

Comment 2 Alessandro Astone 2023-12-02 11:03:32 UTC
Can you try booting without `initcall_blacklist=simpledrm_platform_driver_init nvidia-drm.modeset=1` in the kernel cmdline? Beware that it only works with the 545 driver

Comment 3 Barry Scott 2023-12-03 11:27:42 UTC
That looks to make things work.

I'll run with that config and test cold vs warn boot and multiple login attempts.

Comment 4 Barry Scott 2023-12-03 11:37:59 UTC
On a cold boot I see:

2023-12-03T11:29:11+0000 systemd[4162]: Starting plasma-kwin_wayland.service - KDE Window Manager...
2023-12-03T11:29:11+0000 systemd[4162]: Started plasma-kwin_wayland.service - KDE Window Manager.
2023-12-03T11:30:10+0000 kwin_wayland[4488]: kwin_wayland_drm: Atomic commit failed! Permission denied
2023-12-03T11:30:10+0000 kwin_wayland[4488]: kwin_wayland_drm: Presentation failed! Permission denied

After login there is a ~30s delay before seeing the desktop.
I then started konsole and awaited.
Was back at the login screen.

Warm booted from this state the same failure.

2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_wayland_drm: Atomic commit failed! Invalid argument
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_wayland_drm: Presentation failed! Invalid argument
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_core: Applying KScreen config failed!
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "alsa_output.pci-0000_00_1f.3.analog-stereo.monitor"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SINK@"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SOURCE@"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SINK@"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "@DEFAULT_SOURCE@"
2023-12-03T11:36:32+0000 plasmashell[4619]: org.kde.plasma.pulseaudio: No object for name "auto_null.monitor"
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_core: Applying KScreen config failed!
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_core: Applying KScreen config failed!
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_core: Applying KScreen config failed!
2023-12-03T11:36:32+0000 kwin_wayland[4483]: kwin_core: Applying KScreen config failed!

Comment 5 Barry Scott 2023-12-04 20:08:50 UTC
I downgraded kwin-wayland:

$ dnf history info 430
Transaction ID : 430
Begin time     : Sun 03 Dec 2023 11:38:26 GMT
Begin rpmdb    : afac498ef5c8a090b1aa4f0f37d630685377726e701c5568ae9e7eab4f68db95
End time       : Sun 03 Dec 2023 11:38:28 GMT (2 seconds)
End rpmdb      : 5f81cef5f534cd7be3ad316d5f31c598b3cc3e8fe51047691dd2bdc74f984dac
User           : root <root>
Return-Code    : Success
Releasever     : 39
Command Line   : downgrade kwin-wayland
Comment        :
Packages Altered:
    Downgrade  kwin-5.27.8-1.fc39.x86_64         @fedora
    Downgraded kwin-5.27.9-3.fc39.x86_64         @@System
    Downgrade  kwin-common-5.27.8-1.fc39.x86_64  @fedora
    Downgraded kwin-common-5.27.9-3.fc39.x86_64  @@System
    Downgrade  kwin-libs-5.27.8-1.fc39.x86_64    @fedora
    Downgraded kwin-libs-5.27.9-3.fc39.x86_64    @@System
    Downgrade  kwin-wayland-5.27.8-1.fc39.x86_64 @fedora
    Downgraded kwin-wayland-5.27.9-3.fc39.x86_64 @@System
    Downgrade  kwin-x11-5.27.8-1.fc39.x86_64     @fedora
    Downgraded kwin-x11-5.27.9-3.fc39.x86_64     @@System

Still have the nvidia 545 driver installed.

Here is kernel cmdline:

$ cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.6.2-201.fc39.x86_64 root=UUID=f160dd82-834b-4cfa-8ee7-9c159b2a1b7b ro rootflags=subvol=root rd.luks.uuid=luks-904db66b-db23-4719-bbf6-fb596c23d831 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau initcall_blacklist=simpledrm_platform_driver_init nvidia-drm.modeset=1

Details of graphics:

$ inxi -G
Graphics:
  Device-1: NVIDIA GA106 [GeForce RTX 3060 Lite Hash Rate] driver: nvidia v: 545.29.06
  Display: server: X.org v: 1.20.14 with: Xwayland v: 23.2.2 driver: X: loaded: nvidia
    unloaded: fbdev,modesetting,nouveau,vesa gpu: nvidia,nvidia-nvswitch tty: 120x36
    resolution: 3840x2160
  API: EGL v: 1.5 drivers: nvidia platforms: gbm
  API: OpenGL v: 4.6.0 vendor: nvidia v: 545.29.06 note: console (EGL sourced) renderer: NVIDIA
    GeForce RTX 3060/PCIe/SSE2
  API: Vulkan v: 1.3.268 drivers: nvidia,llvmpipe surfaces: N/A

From cold boot can login and stay logged in.
desktop appears after ~2s of login.

After warm boot can login and stay logged in.

Comment 6 Barry Scott 2023-12-06 10:51:53 UTC
It seems that from a cold boot I will still see a problem on login.
The desktop does not appear until about 30s passes and then I'll be kicked back to the login.

But a warm reboot from that state will get me to a working desktop and I can stay logged in.
The desktop appears in about 2s from login.

Comment 7 Neal Gompa 2023-12-07 15:14:27 UTC
Can you please file a bug about this upstream with kwin? https://bugs.kde.org/enter_bug.cgi?product=kwin&component=platform-drm

Comment 8 Barry Scott 2023-12-08 09:28:58 UTC
Filed with upstream: https://bugs.kde.org/show_bug.cgi?id=478251

Comment 9 Aoife Moloney 2024-11-13 10:11:26 UTC
This message is a reminder that Fedora Linux 39 is nearing its end of life.
Fedora will stop maintaining and issuing updates for Fedora Linux 39 on 2024-11-26.
It is Fedora's policy to close all bug reports from releases that are no longer
maintained. At that time this bug will be closed as EOL if it remains open with a
'version' of '39'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, change the 'version' 
to a later Fedora Linux version. Note that the version field may be hidden.
Click the "Show advanced fields" button if you do not see it.

Thank you for reporting this issue and we are sorry that we were not 
able to fix it before Fedora Linux 39 is end of life. If you would still like 
to see this bug fixed and are able to reproduce it against a later version 
of Fedora Linux, you are encouraged to change the 'version' to a later version
prior to this bug being closed.

Comment 10 Aoife Moloney 2024-11-27 22:12:07 UTC
Fedora Linux 39 entered end-of-life (EOL) status on 2024-11-26.

Fedora Linux 39 is no longer maintained, which means that it
will not receive any further security or bug fix updates. As a result we
are closing this bug.

If you can reproduce this bug against a currently maintained version of Fedora Linux
please feel free to reopen this bug against that version. Note that the version
field may be hidden. Click the "Show advanced fields" button if you do not see
the version field.

If you are unable to reopen this bug, please file a new report against an
active release.

Thank you for reporting this bug and we are sorry it could not be fixed.


Note You need to log in before you can comment on or make changes to this bug.