Note: This bug is displayed in read-only format because
the product is no longer active in Red Hat Bugzilla.
RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
(In reply to Frank Sorenson from comment #6)
> closing this for now... if they or someone else hits this in the future,
> we'll know it's not the first, and perhaps there will be more information
If we see this again we will need to include memory management folks on the
cc.
It looks like this could be related to the relatively new page folios
implementation.
Description of problem: System crashed with BUG due to nonzero pages counter, despite having an empty page tree: [8004333.840863] kernel BUG at fs/inode.c:518! PID: 1 TASK: ffff973100c19ec0 CPU: 66 COMMAND: "systemd" [exception RIP: clear_inode+0x81] #7 [ffffbcd5c0037e20] evict at ffffffff93f36e6b #8 [ffffbcd5c0037e40] __dentry_kill at ffffffff93f32975 #9 [ffffbcd5c0037e60] dentry_kill at ffffffff93f335cd #10 [ffffbcd5c0037e88] dput at ffffffff93f337e9 #11 [ffffbcd5c0037ea0] __fput at ffffffff93f1a3af #12 [ffffbcd5c0037ee8] task_work_run at ffffffff93d01eaa #13 [ffffbcd5c0037f20] exit_to_usermode_loop at ffffffff93c03bbb #14 [ffffbcd5c0037f38] do_syscall_64 at ffffffff93c04348 #15 [ffffbcd5c0037f50] entry_SYSCALL_64_after_hwframe at ffffffff946000ad void clear_inode(struct inode *inode) ... xa_lock_irq(&inode->i_data.i_pages); BUG_ON(inode->i_data.nrpages); <<<< location of crash called from evict(): if (op->evict_inode) { op->evict_inode(inode); } else { truncate_inode_pages_final(&inode->i_data); clear_inode(inode); nrpages is indeed nonzero, however the page tree is empty: crash> inode.i_data ffff9730011f30b0 -ox struct inode { [ffff9730011f3228] struct address_space i_data; } crash> address_space.nrpages ffff9730011f3228 nrpages = 32, crash> address_space.i_pages ffff9730011f3228 i_pages = { xa_lock = { rlock = { raw_lock = { val = { counter = 1 <<<--------- locked on our codepath ... xa_flags = 33, ---> XA_FLAGS_LOCK_IRQ|XA_FLAGS_ACCOUNT xa_head = 0x0, <<<----------- empty tree ! crash> inode.i_fop,i_sb,i_state ffff9730011f30b0 i_fop = 0xffffffff94a3c740 <pipefifo_fops>, i_sb = 0xffff975eaf23c800, i_state = 0x27, -------------> I_DIRTY_SYNC|I_DIRTY_DATASYNC|I_DIRTY_PAGES|I_FREEING crash> super_block.s_id 0xffff975eaf23c800 s_id = "pipefs", Version-Release number of selected component (if applicable): kernel-4.18.0-305.57.1.el8_4 How reproducible: Unknown; crash experienced once thus far Steps to Reproduce: unknown Actual results: kernel crash Expected results: no crash Additional info: