Bug 2374607
| Summary: | kernel-6.16.0-0.rc3.31.fc43 maybe broke BTRFS root file system in F42 test: error btrfs_replay_log errno=-5 IO failure after 6.16rc3 freeze | ||
|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | Christopher Klooz <py0xc3> |
| Component: | kernel | Assignee: | Kernel Maintainer List <kernel-maint> |
| Status: | CLOSED ERRATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> |
| Severity: | urgent | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 42 | CC: | acaringi, adscvr, agurenko, airlied, alex, awilliam, hdegoede, hpa, jforbes, josef, kdudka, kernel-maint, linville, masami256, mchehab, ptalbert, py0xc3, sam, steved, suraj.ghimire7 |
| Target Milestone: | --- | ||
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | kernel-6.15.9-201.fc42 kernel-6.15.9-101.fc41 | Doc Type: | --- |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2025-08-06 01:36:16 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Attachments: | |||
|
Description
Christopher Klooz
2025-06-24 15:30:36 UTC
Created attachment 2094982 [details]
dmesg-log-after-mount-attempt-in-livesystem.txt
Created attachment 2094983 [details]
Emergency-mode-after-boot-of-6.16rc3-with-external-hdmi-monitor.jpg
Created attachment 2094984 [details]
Emergency-mode-after-boot-of-6.16rc3-without-external-hdmi-monitor.jpg
Created attachment 2094985 [details]
journalctl-emergency-boot-0(overlapping).jpg
Created attachment 2094986 [details]
journalctl-emergency-boot-3(overlapping).jpg
Created attachment 2094987 [details]
journalctl-emergency-boot-2(overlapping).jpg
Created attachment 2094988 [details]
journalctl-emergency-boot-1(overlapping).jpg
Created attachment 2094989 [details]
journalctl-emergency-boot-3_lines-on-right.jpg
There was a btrfs pull yesterday, so that may be resolved with kernel-6.16.0-0.rc3.250624g78f4e737a53e.32.fc43 (building now) I experienced what appears to be the same issue after upgrading to 6.15.4-200.fc42. btrfs check didn't detect anything wrong, but btrfs rescue zero-log fixed it. Ok, I now just got the same issue again with 6.15.6-200.fc42.x86_64 when I experienced an occurrence of AMD #4141 [1]. So the system froze with the errors of AMD #4141 (verified log entries after I could repair the btrfs), and then btrfs was broken. Just like above. So not sure if its just the btrfs that occasionally breaks when the system freezes (force reboot without umount etc), or if there is an error that causes this (which would mean it has been backported to any 6.15 kernel). I also mentioned it at the AMD ticket [2], just in case. Anyway, my tendency would be to assume it's just an fs that occasionally breaks when it regularly experiences system freezes without proper umounts and such (though the likelihood indeed makes me question that myself), but Mario Limonciello of AMD just mentioned in a post [3] that he does not exclude a relation of #4141 to a current btrfs issue. [1] https://gitlab.freedesktop.org/drm/amd/-/issues/4141 [2] https://gitlab.freedesktop.org/drm/amd/-/issues/4141#note_3013701 [3] https://gitlab.freedesktop.org/drm/amd/-/issues/4141#note_3013469 This is likely https://lore.kernel.org/linux-btrfs/fce139db-4458-4788-bb97-c29acf6cb1df@cachyos.org/ , there seem to be various reports of this issue with recent 6.16 and 6.15 kernels. FEDORA-2025-a3bae4266e (kernel-6.15.9-101.fc41) has been submitted as an update to Fedora 41. https://bodhi.fedoraproject.org/updates/FEDORA-2025-a3bae4266e FEDORA-2025-09b1545c7a (kernel-6.15.9-201.fc42) has been submitted as an update to Fedora 42. https://bodhi.fedoraproject.org/updates/FEDORA-2025-09b1545c7a FEDORA-2025-09b1545c7a has been pushed to the Fedora 42 testing repository. Soon you'll be able to install the update with the following command: `sudo dnf upgrade --enablerepo=updates-testing --refresh --advisory=FEDORA-2025-09b1545c7a` You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2025-09b1545c7a See also https://fedoraproject.org/wiki/QA:Updates_Testing for more information on how to test updates. FEDORA-2025-a3bae4266e has been pushed to the Fedora 41 testing repository. Soon you'll be able to install the update with the following command: `sudo dnf upgrade --enablerepo=updates-testing --refresh --advisory=FEDORA-2025-a3bae4266e` You can provide feedback for this update here: https://bodhi.fedoraproject.org/updates/FEDORA-2025-a3bae4266e See also https://fedoraproject.org/wiki/QA:Updates_Testing for more information on how to test updates. I added a **careful** "thumbs up" in bodhi for BZ#2374607 with the possibility to reopen the ticket here: I cannot verify if the issue is solved given that it so far only occurred when I intentionally provoked the [AMD issue #4141](https://gitlab.freedesktop.org/drm/amd/-/issues/4141), and I can no longer intentionally provoke the AMD issue #4141. In the AMD ticket, the idea was put forward some time ago that the AMD issue #4141 maybe is just a manifest of the btrfs issue. This assumption is based only on some correlations, but because I have seen something like that before (my wifi driver's discovery function once caused freezes with amdgpu errors), and because I can no longer reproduce the AMD issue with this kernel (though I tried last time to provoke it on 6.15.6 that was still affected), I hope for the best and assume the btrfs issue is solved in this kernel. I re-report in bodhi & re-open the bug ticket if the issue reoccurs. Thanks for taking care :) FEDORA-2025-09b1545c7a (kernel-6.15.9-201.fc42) has been pushed to the Fedora 42 stable repository. If problem still persists, please make note of it in this bug report. FEDORA-2025-a3bae4266e (kernel-6.15.9-101.fc41) has been pushed to the Fedora 41 stable repository. If problem still persists, please make note of it in this bug report. |