Bug 2066214 - [ark] divide error: 0000 [#1] PREEMPT SMP NOPTI: RIP: 0010:bfqq_request_over_limit+0x1d9/0x5f0
Summary: [ark] divide error: 0000 [#1] PREEMPT SMP NOPTI: RIP: 0010:bfqq_request_over_...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Fedora
Classification: Fedora
Component: kernel
Version: rawhide
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: Kernel Maintainer List
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-03-21 09:29 UTC by Bruno Goncalves
Modified: 2022-06-08 14:02 UTC (History)
19 users (show)

Fixed In Version: 5.17.5
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-06-08 14:02:04 UTC
Type: Bug
Embargoed:


Attachments (Terms of Use)

Description Bruno Goncalves 2022-03-21 09:29:20 UTC
1. Please describe the problem:
During CKI stress-ng - os test [1] we hit the following issue:

[28574.729271] capability: warning: `stress-ng' uses 32-bit capabilities (legacy support in use) 
[28682.013232] restraintd[861]: *** Current Time: Sat Mar 19 20:35:50 2022  Localwatchdog at: Sat Mar 19 21:34:50 2022 
[28622.197403] sched: DL replenish lagged too much 
[28742.655967] restraintd[861]: *** Current Time: Sat Mar 19 20:36:51 2022  Localwatchdog at: Sat Mar 19 21:34:50 2022 
[28802.009997] restraintd[861]: *** Current Time: Sat Mar 19 20:37:50 2022  Localwatchdog at: Sat Mar 19 21:34:50 2022 
[28770.133641] Attempt to set a LOCK_MAND lock via flock(2). This support has been removed and the request ignored. 
[28862.009849] restraintd[861]: *** Current Time: Sat Mar 19 20:38:50 2022  Localwatchdog at: Sat Mar 19 21:34:50 2022 
[28922.029594] restraintd[861]: *** Current Time: Sat Mar 19 20:39:50 2022  Localwatchdog at: Sat Mar 19 21:34:50 2022 
[-- MARK -- Sun Mar 20 00:40:00 2022] 
[28868.111792] divide error: 0000 [#1] PREEMPT SMP NOPTI 
[28868.140603] CPU: 1 PID: 1778513 Comm: stress-ng Tainted: G           OE    --------- ---  5.17.0-0.rc8.34e047aa16c0.126.test.fc37.x86_64 #1 
[28868.210839] Hardware name: HP ProLiant SL4545 G7/, BIOS A31 11/02/2013 
[28868.245550] RIP: 0010:bfqq_request_over_limit+0x1d9/0x5f0 
[28868.271622] Code: 83 c3 38 48 8b 13 83 c0 01 48 83 c3 30 48 8d 0c ca 39 04 24 7d ed 41 8b 46 50 48 89 ca 48 d1 ea 0f af c5 48 98 48 01 d0 31 d2 <48> f7 f1 89 c5 41 3b 46 48 0f 8f ae 01 00 00 49 8b 44 24 08 48 8b 
[28868.365270] RSP: 0018:ffffb5ef08727aa0 EFLAGS: 00010046 
[28868.393375] RAX: 0000000000000f00 RBX: ffffa00f7de39988 RCX: 0000000000000000 
[28868.427670] RDX: 0000000000000000 RSI: 0000000000000064 RDI: ffffa00f7de39898 
[28868.465239] RBP: 00000000000000c0 R08: 0000000000000002 R09: 000000003312476a 
[28868.499491] R10: 0000000000000001 R11: 0000000000000001 R12: ffffa00ed08d1b00 
[28868.536219] R13: 0000000000000001 R14: ffffa00ed08d1b88 R15: 0000000000000000 
[28868.570578] FS:  00007f63a8a40740(0000) GS:ffffa012ace00000(0000) knlGS:0000000000000000 
[28868.612266] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
[28868.641367] CR2: 00007fff57718fa0 CR3: 000000061773a000 CR4: 00000000000406e0 
[28868.680366] Call Trace: 
[28868.693878]  <TASK> 
[28868.705598]  ? lock_is_held_type+0xea/0x140 
[28868.728292]  ? find_held_lock+0x32/0x80 
[28868.749936]  ? sched_clock_cpu+0xb/0xc0 
[28868.769081]  ? lock_release+0x151/0x460 
[28868.788137]  ? _raw_spin_unlock_irqrestore+0x30/0x60 
[28868.815476]  bfq_limit_depth+0xbf/0x2e0 
[28868.835749]  __blk_mq_alloc_requests+0x328/0x390 
[28868.857909]  blk_mq_submit_bio+0x460/0x8d0 
[28868.878903]  submit_bio_noacct+0x1f6/0x2b0 
[28868.899953]  iomap_submit_ioend+0x4b/0x70 
[28868.919092]  xfs_vm_writepages+0x6e/0x90 [xfs] 
[28868.940923]  do_writepages+0xaf/0x1a0 
[28868.961095]  ? lock_release+0x151/0x460 
[28868.979800]  ? _raw_spin_unlock+0x29/0x40 
[28868.998934]  filemap_fdatawrite_wbc+0x5c/0x80 
[28869.021603]  file_write_and_wait_range+0x76/0xd0 
[28869.045042]  xfs_file_fsync+0x75/0x2b0 [xfs] 
[28869.074082]  __x64_sys_fsync+0x37/0x60 
[28869.093348]  do_syscall_64+0x3a/0x80 
[28869.112043]  entry_SYSCALL_64_after_hwframe+0x44/0xae 
[28869.136137] RIP: 0033:0x7f63a88b71b7 
[28869.153408] Code: 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 4a 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 43 10 f8 ff 
[28869.257395] RSP: 002b:00007fff5771a868 EFLAGS: 00000246 ORIG_RAX: 000000000000004a 
[28869.297757] RAX: ffffffffffffffda RBX: 00007fff5771ba60 RCX: 00007f63a88b71b7 
[28869.335136] RDX: 0000000000000004 RSI: 00007fff5771a880 RDI: 0000000000000005 
[28869.369215] RBP: 0000000000000005 R08: 0000000000000000 R09: 00000000000000eb 
[28869.406571] R10: 00000000008cec00 R11: 0000000000000246 R12: 00007fff5771a940 
[28869.441043] R13: 00007fff5771a8c0 R14: 0000000000000000 R15: 0000000000000003 
[28869.478620]  </TASK> 
[28869.488953] Modules linked in: binfmt_misc wp512 streebog_generic rmd160 nhpoly1305_sse2 nhpoly1305 michael_mic md4 twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common sm4_aesni_avx_x86_64 libsm4 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic fcrypt des3_ede_x86_64 des_generic libdes cast6_avx_x86_64 cast6_generic cast5_avx_x86_64 cast5_generic cast_common camellia_generic camellia_aesni_avx_x86_64 camellia_x86_64 blowfish_generic blowfish_x86_64 blowfish_common aegis128 aegis128_aesni raid10 raid0 dm_cache_smq dm_cache raid1 dm_raid raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx dm_thin_pool dm_persistent_data dm_bio_prison nvme nvme_core ipmi_watchdog ipmi_poweroff ipmi_ssif ipmi_devintf nls_utf8 cifs cifs_arc4 cifs_md4 nf_conntrack_netlink xt_addrtype xt_nat xt_mark xt_comment nft_chain_nat xt_MASQUERADE nf_nat veth bridge stp llc vsock_loopback vmw_vsock_virtio_transport_common vmw_vsock_vmci_transport vsock vmw_vmci loop 
[28869.489094]  tun af_key crypto_user scsi_transport_iscsi xt_multiport ip_gre ip_tunnel gre bluetooth ecdh_generic overlay xt_CONNSECMARK xt_SECMARK xt_conntrack nft_compat ah6 ah4 nft_objref nft_ct nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnetlink ip_tables vfat fat jfs sctp ip6_udp_tunnel udp_tunnel rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache netfs rpcrdma rdma_cm iw_cm ib_cm ib_core nfsd auth_rpcgss nfs_acl lockd grace dm_log_writes tls rfkill sunrpc amd64_edac edac_mce_amd kvm_amd ccp kvm irqbypass pcspkr k10temp fam15h_power i2c_piix4 acpi_ipmi joydev igb ipmi_si hpilo dca ipmi_msghandler fuse zram amdgpu xfs iommu_v2 gpu_sched ata_generic pata_acpi crct10dif_pclmul radeon crc32_pclmul crc32c_intel hpsa drm_ttm_helper ghash_clmulni_intel ttm serio_raw hpwdt sp5100_tco pata_atiixp scsi_transport_sas [last unloaded: dm_persistent_data] 
[28870.356005] ---[ end trace 0000000000000000 ]--- 
[28870.378204] RIP: 0010:bfqq_request_over_limit+0x1d9/0x5f0 
[28870.405284] Code: 83 c3 38 48 8b 13 83 c0 01 48 83 c3 30 48 8d 0c ca 39 04 24 7d ed 41 8b 46 50 48 89 ca 48 d1 ea 0f af c5 48 98 48 01 d0 31 d2 <48> f7 f1 89 c5 41 3b 46 48 0f 8f ae 01 00 00 49 8b 44 24 08 48 8b 
[28870.501690] RSP: 0018:ffffb5ef08727aa0 EFLAGS: 00010046 
[28870.526811] RAX: 0000000000000f00 RBX: ffffa00f7de39988 RCX: 0000000000000000 
[28870.564403] RDX: 0000000000000000 RSI: 0000000000000064 RDI: ffffa00f7de39898 
[28870.598672] RBP: 00000000000000c0 R08: 0000000000000002 R09: 000000003312476a 
[28870.636068] R10: 0000000000000001 R11: 0000000000000001 R12: ffffa00ed08d1b00 
[28870.670468] R13: 0000000000000001 R14: ffffa00ed08d1b88 R15: 0000000000000000 
[28870.707999] FS:  00007f63a8a40740(0000) GS:ffffa012ace00000(0000) knlGS:0000000000000000 
[28870.746910] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033 
[28870.778692] CR2: 00007fff57718fa0 CR3: 000000061773a000 CR4: 00000000000406e0 
[28870.817673] Kernel panic - not syncing: Fatal exception 
[28871.987346] Shutting down cpus with NMI 
[28872.020323] Kernel Offset: 0x2000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff) 
[28872.080168] ---[ end Kernel panic - not syncing: Fatal exception ]--- 

2. What is the Version-Release number of the kernel:
kernel-5.17.0-0.rc8.34e047aa16c0.126.test.fc37.x86_64


[1] https://gitlab.com/cki-project/kernel-tests/-/tree/main/stress/stress-ng

Comment 2 Chris Murphy 2022-04-07 16:21:08 UTC
Hopeful this will fix it, and should appear in 5.17.3
https://lore.kernel.org/all/20220407140738.9723-1-jack@suse.cz/


Note You need to log in before you can comment on or make changes to this bug.