RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1965191 - kernel panic: RIP: 0010:__list_del_entry_valid.cold+0x31/0x47
Summary: kernel panic: RIP: 0010:__list_del_entry_valid.cold+0x31/0x47
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Red Hat Enterprise Linux 9
Classification: Red Hat
Component: kernel-rt
Version: 9.0
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: beta
: ---
Assignee: Juri Lelli
QA Contact: Boyang Xue
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-05-27 06:33 UTC by Bruno Goncalves
Modified: 2022-11-27 07:27 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-11-27 07:27:41 UTC
Type: Bug
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Bruno Goncalves 2021-05-27 06:33:50 UTC
Description of problem:
During CKI test for kernel-rt build for RHEL-9 we hit a kernel panic:

[ 6380.145284] ./checking generic/464 
[ 6381.143639] run fstests generic/464 at 2021-05-26 09:59:30 
[ 6410.078901] restraintd[1021]: *** Current Time: Wed May 26 09:59:59 2021  Localwatchdog at: Wed May 26 15:47:58 2021 
[-- MARK -- Wed May 26 14:00:00 2021] 
[ 6436.184205] list_del corruption. prev->next should be fffff3f9442802c8, but was fffff3f944633608 
[ 6436.184232] ------------[ cut here ]------------ 
[ 6436.184234] kernel BUG at lib/list_debug.c:51! 
[ 6437.184752] invalid opcode: 0000 [#1] PREEMPT_RT SMP PTI 
[ 6437.296230] CPU: 10 PID: 614973 Comm: nfsd Tainted: G               X --------- ---  5.13.0-0.rc3.25.rt3.3.tst.el9.x86_64 #1 
[ 6437.346429] Hardware name: HP ProLiant DL120 Gen9, BIOS P86 07/20/2015 
[ 6437.376473] RIP: 0010:__list_del_entry_valid.cold+0x31/0x47 
[ 6437.402089] Code: 21 37 a6 e8 b8 08 ff ff 0f 0b 48 c7 c7 38 22 37 a6 e8 aa 08 ff ff 0f 0b 48 89 f2 48 89 fe 48 c7 c7 f8 21 37 a6 e8 96 08 ff ff <0f> 0b 48 89 fe 4c 89 c2 48 c7 c7 c0 21 37 a6 e8 82 08 ff ff 0f 0b 
[ 6437.485967] RSP: 0018:ffffbabac18afe00 EFLAGS: 00010086 
[ 6437.509359] RAX: 0000000000000054 RBX: ffff9718cec982d8 RCX: 0000000000000000 
[ 6437.541499] RDX: 0000000000000000 RSI: ffff971c3fc97dd0 RDI: 00000000ffffffff 
[ 6437.574239] RBP: 0000000000000001 R08: ffffffffa6a77cc0 R09: 000000000000000f 
[ 6437.606213] R10: 000000000000000f R11: ffffffffa77d9916 R12: 0000000000000002 
[ 6437.638463] R13: ffff971c3ffd4d80 R14: fffff3f9442802c8 R15: fffff3f9442802c0 
[ 6437.670357] FS:  0000000000000000(0000) GS:ffff971c3fc80000(0000) knlGS:0000000000000000 
[ 6437.706866] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
[ 6437.734967] CR2: 000055e728300708 CR3: 000000017180e002 CR4: 00000000001706e0 
[ 6437.770049] Call Trace: 
[ 6437.782515]  __alloc_pages_bulk+0x2ae/0x700 
[ 6437.801746]  svc_recv+0x8a/0x350 [sunrpc] 
[ 6437.819715]  ? svc_xprt_release+0xc9/0x160 [sunrpc] 
[ 6437.841255]  ? nfsd_shutdown_threads+0x90/0x90 [nfsd] 
[ 6437.863887]  nfsd+0xdb/0x150 [nfsd] 
[ 6437.879509]  kthread+0x186/0x1a0 
[ 6437.893950]  ? __kthread_parkme+0xa0/0xa0 
[ 6437.911911]  ret_from_fork+0x22/0x30 
[ 6437.927952] Modules linked in: rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache netfs nfsd auth_rpcgss nfs_acl lockd grace dm_log_writes dm_flakey dm_mod rfkill sunrpc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal iTCO_wdt intel_powerclamp iTCO_vendor_support coretemp rapl intel_cstate mgag200 intel_uncore pcspkr drm_kms_helper ipmi_ssif syscopyarea sysfillrect sysimgblt fb_sys_fops i2c_i801 hpilo lpc_ich cec hpwdt i2c_smbus acpi_ipmi ipmi_si ipmi_devintf acpi_tad ipmi_msghandler ioatdma ext4 acpi_power_meter mbcache vfat jbd2 fat fuse drm ip_tables xfs libcrc32c crct10dif_pclmul sd_mod crc32_pclmul t10_pi crc32c_intel ghash_clmulni_intel igb ahci i2c_algo_bit libahci dca libata hpsa tg3 scsi_transport_sas wmi 
[ 6438.216491] ---[ end trace 0000000000000002 ]--- 
[ 6439.216692] printk: enabled sync mode 
[ 6439.261347] RIP: 0010:__list_del_entry_valid.cold+0x31/0x47 
[ 6439.286605] Code: 21 37 a6 e8 b8 08 ff ff 0f 0b 48 c7 c7 38 22 37 a6 e8 aa 08 ff ff 0f 0b 48 89 f2 48 89 fe 48 c7 c7 f8 21 37 a6 e8 96 08 ff ff <0f> 0b 48 89 fe 4c 89 c2 48 c7 c7 c0 21 37 a6 e8 82 08 ff ff 0f 0b 
[ 6439.370555] RSP: 0018:ffffbabac18afe00 EFLAGS: 00010086 
[ 6439.393893] RAX: 0000000000000054 RBX: ffff9718cec982d8 RCX: 0000000000000000 
[ 6439.427105] RDX: 0000000000000000 RSI: ffff971c3fc97dd0 RDI: 00000000ffffffff 
[ 6439.459824] RBP: 0000000000000001 R08: ffffffffa6a77cc0 R09: 000000000000000f 
[ 6439.491896] R10: 000000000000000f R11: ffffffffa77d9916 R12: 0000000000000002 
[ 6439.525285] R13: ffff971c3ffd4d80 R14: fffff3f9442802c8 R15: fffff3f9442802c0 
[ 6439.558939] FS:  0000000000000000(0000) GS:ffff971c3fc80000(0000) knlGS:0000000000000000 
[ 6439.597300] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
[ 6439.624050] CR2: 000055e728300708 CR3: 000000017180e002 CR4: 00000000001706e0 
[ 6439.656237] Kernel panic - not syncing: 
[ 6439.673415] Fatal exception 
[ 6439.673568] Kernel Offset: 0x24000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff) 
[ 6439.736455] ---[ end Kernel panic - not syncing: Fatal exception ]--- 

Version-Release number of selected component (if applicable):
kernel 5.13.0-0.rc3.25.rt3.3.tst.el9.x86_64

We hit this on a scratch build, but it is likely this problem to also be on  kernel-rt-5.13.0-0.rc3.25.rt3.3.el9, but we didn't reproduce it there when CKI tested it.

How reproducible:
Not sure

Steps to Reproduce:
1. run xfstests - nfsv4.2 test

Comment 7 RHEL Program Management 2022-11-27 07:27:41 UTC
After evaluating this issue, there are no plans to address it further or fix it in an upcoming release.  Therefore, it is being closed.  If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened.


Note You need to log in before you can comment on or make changes to this bug.