Bug 2104883

Summary: Kernel >= 5.18.6 sometimes hangs after EFI stub
Product: [Fedora] Fedora Reporter: homann.philipp
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED CURRENTRELEASE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 36CC: acaringi, adscvr, airlied, alciregi, bskeggs, hdegoede, hpa, jarodwilson, jglisse, jonathan, josef, kernel-maint, lgoncalv, linville, masami256, mchehab, ptalbert, steved
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: ---
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-08 12:11:00 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description homann.philipp 2022-07-07 11:56:05 UTC
1. Please describe the problem:
Booting into kernel-5.18.6-200.fc36.x86_64 and higher sometimes hangs after GRUB.
The only thing printed to the screen is: "EFI stub: UEFI Secure Boot is enabled."
This does happen every time, but lets say on 5 of 10 boots.
Also CTRL+ALT+DEL does not trigger a reboot. I have to reset the system (HPE DL360 Gen10) manually.

When disabling secure boot everything works fine.


2. What is the Version-Release number of the kernel:
kernel-5.18.6-200.fc36.x86_64 and higher

3. Did it work previously in Fedora? If so, what kernel version did the issue
   *first* appear?  Old kernels are available for download at
   https://koji.fedoraproject.org/koji/packageinfo?packageID=8 :
kernel-5.17.13-300.fc36.x86_64


4. Can you reproduce this issue? If so, please provide the steps to reproduce
   the issue below:
Update kernel and do some reboots.

5. Does this problem occur with the latest Rawhide kernel? To install the
   Rawhide kernel, run ``sudo dnf install fedora-repos-rawhide`` followed by
   ``sudo dnf update --enablerepo=rawhide kernel``:
kernel-5.19.0-0.rc5.20220706gite35e5b6f695d.42.fc37 is also not working

6. Are you running any modules that not shipped with directly Fedora's kernel?:
No

7. Please attach the kernel logs. You can get the complete kernel log
   for a boot with ``journalctl --no-hostname -k > dmesg.txt``. If the
   issue occurred on a previous boot, use the journalctl ``-b`` flag.
No kernel logs since kernel is not booting

Comment 1 homann.philipp 2022-08-04 12:54:36 UTC
To clarify as another user might have the same problem (2111555): I'm on Intel (Xeon(R) Silver 4210 CPU @ 2.20GHz)
The problem also occurs with Secure Boot disabled.
But instead of the message (UEFI Secure Boot is enabled) there is just a stuck cursor on screen.

Comment 2 homann.philipp 2022-08-05 06:43:12 UTC
Just another side note:
Booting Arch Linux with kernel 5.18.7 works like a charm.
With SecureBoot disabled indeed as Arch does not support it out of the box.

Comment 3 homann.philipp 2022-08-05 11:24:23 UTC
Finally I've got a stacktrace on the serial port.
Its pretty the same as here:
https://lore.kernel.org/lkml/20220721180503.896050-1-toshi.kani@hpe.com/

Comment 4 homann.philipp 2022-08-08 12:11:00 UTC
The issues was fixed with this commit: https://github.com/torvalds/linux/commit/5e2805d5379619c4a2e3ae4994e73b36439f4bad
This was also backported to 5.18 and released in 5.18.15, which is available for Fedora.

I've tested the fix with 5.18.16, did several reboots and the issue seems to be gone.