Bug 1833860

Summary: BUG: scheduling while atomic: bz722926/3905/0x00000002 regresion case with zram
Product: Red Hat Enterprise Linux 7 Reporter: Ping Fang <pifang>
Component: kernel-rtAssignee: Crystal Wood <crwood>
kernel-rt sub component: Locking/lockdep QA Contact: Ping Fang <pifang>
Status: CLOSED ERRATA Docs Contact:
Severity: unspecified    
Priority: unspecified CC: bhu, crwood, jlelli, jreznik, mstowell, trix, williams
Version: 7.9Keywords: Triaged, ZStream
Target Milestone: rc   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: 1966760 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-11-23 17:14:02 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Ping Fang 2020-05-11 04:34:38 UTC
Description of problem:
bz722926's regression testcase can trigger following bug

[ 1243.107453] BUG: scheduling while atomic: bz722926/14635/0x00000002
[ 1243.107480] Modules linked in: zram rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache sunrpc ipmi_ssif iTCO_wdt iTCO_vendor_support gpio_ich intel_powerclamp sg pcspkr cdc_ether usbnet coretemp mii ioatdma ipmi_si ipmi_devintf ipmi_msghandler i7core_edac i2c_i801 dca lpc_ich ip_tables xfs libcrc32c sd_mod crc_t10dif crct10dif_generic crct10dif_common mgag200 i2c_algo_bit crc32c_intel drm_kms_helper syscopyarea sysfillrect serio_raw sysimgblt fb_sys_fops ttm drm mptsas drm_panel_orientation_quirks scsi_transport_sas mptscsih bnx2 mptbase dm_mirror dm_region_hash dm_log dm_mod
[ 1243.107483] CPU: 3 PID: 14635 Comm: bz722926 Kdump: loaded Not tainted 3.10.0-1139.rt56.1110.el7.x86_64 #1
[ 1243.107483] Hardware name: IBM IBM System x -[7870AC1]-/46M0761     , BIOS -[P9E146CUS-1.08]- 04/26/2010
[ 1243.107484] Call Trace:
[ 1243.107493]  [<ffffffffb3f75f85>] dump_stack+0x19/0x1b
[ 1243.107496]  [<ffffffffb3f70791>] __schedule_bug+0x64/0x72
[ 1243.107497]  [<ffffffffb3f7b53b>] __schedule+0x75b/0x900
[ 1243.107499]  [<ffffffffb3f7b710>] schedule+0x30/0x96
[ 1243.107500]  [<ffffffffb3f7c415>] rt_spin_lock_slowlock_locked+0xf5/0x2d0
[ 1243.107502]  [<ffffffffb3f7c647>] rt_spin_lock_slowlock+0x57/0x90
[ 1243.107503]  [<ffffffffb3f7e055>] rt_spin_lock+0x25/0x30
[ 1243.107507]  [<ffffffffb39c0818>] free_pcppages_bulk+0x38/0x3a0
[ 1243.107509]  [<ffffffffb39c1288>] free_hot_cold_page+0x258/0x2b0
[ 1243.107511]  [<ffffffffb39c13e5>] __free_pages+0x25/0x30
[ 1243.107513]  [<ffffffffb3a35a31>] free_zspage+0x91/0xa0
[ 1243.107514]  [<ffffffffb3a360f8>] zs_free+0x1d8/0x200
[ 1243.107517]  [<ffffffffc0431107>] zram_free_page+0x37/0xb0 [zram]
[ 1243.107519]  [<ffffffffc0432b36>] zram_bvec_rw+0x636/0xbc0 [zram]
[ 1243.107521]  [<ffffffffc0433285>] zram_make_request+0x1c5/0x3d5 [zram]
[ 1243.107524]  [<ffffffffb3b46f9f>] ? generic_make_request_checks+0x25f/0x3f0
[ 1243.107525]  [<ffffffffb3b48737>] generic_make_request+0x197/0x3e0
[ 1243.107526]  [<ffffffffb3b489f0>] submit_bio+0x70/0x150
[ 1243.107529]  [<ffffffffb38ca02b>] ? migrate_enable+0xdb/0x210
[ 1243.107531]  [<ffffffffb3a83bd8>] do_blockdev_direct_IO+0x20e8/0x2340
[ 1243.107533]  [<ffffffffb38ea9e1>] ? cpuacct_charge+0x61/0x70
[ 1243.107535]  [<ffffffffb3a7e200>] ? set_init_blocksize+0x90/0x90
[ 1243.107537]  [<ffffffffb3a83e85>] __blockdev_direct_IO+0x55/0x60
[ 1243.107539]  [<ffffffffb3a7e200>] ? set_init_blocksize+0x90/0x90
[ 1243.107540]  [<ffffffffb3a7e967>] blkdev_direct_IO+0x57/0x60
[ 1243.107541]  [<ffffffffb3a7e200>] ? set_init_blocksize+0x90/0x90
[ 1243.107543]  [<ffffffffb39b99cd>] generic_file_direct_write+0xdd/0x1a0
[ 1243.107544]  [<ffffffffb39b9cc7>] __generic_file_aio_write+0x237/0x400
[ 1243.107545]  [<ffffffffb3a7f316>] blkdev_aio_write+0x56/0xb0
[ 1243.107547]  [<ffffffffb3a3ade3>] do_sync_write+0x93/0xe0
[ 1243.107549]  [<ffffffffb3a3b8e0>] vfs_write+0xc0/0x1f0
[ 1243.107551]  [<ffffffffb3f88155>] ? system_call_after_swapgs+0xa2/0x13a
[ 1243.107553]  [<ffffffffb3a3c6af>] SyS_write+0x7f/0xf0
[ 1243.107554]  [<ffffffffb3f88155>] ? system_call_after_swapgs+0xa2/0x13a
[ 1243.107555]  [<ffffffffb3f88212>] system_call_fastpath+0x25/0x2a
[ 1243.107556]  [<ffffffffb3f88155>] ? system_call_after_swapgs+0xa2/0x13a

Version-Release number of selected component (if applicable):
since 7.8 GA, kernel-rt-3.10.0-1127.rt56.1093.el7
still exist in kernel-rt-3.10.0-1139.rt56.1110.el7


How reproducible:
100%

Steps to Reproduce:
1. wget http://download.eng.rdu2.redhat.com/qa/rhts/lookaside/stress-1.0.4.tar.gz
2. tar xzf stress-1.0.4.tar.gz
3. cd stress-1.0.4; ./configure 2>&1 >/dev/null; make 2>&1 >/dev/null && make install 2>&1 >/dev/null;
4. yum install -y kernel-general-memory-regression-reg-suit
5. cd /mnt/tests/kernel/general/memory/regression/reg-suit/
6. stress -m 4 -t 360 &
7. BZLIST=722926 make run

Actual results:
Panic

Expected results:
Pass

Additional info:
Full vmcore dmesg:
http://ibm-x3250m4-03.rhts.eng.pek2.redhat.com/vmcore/pifang/3.10.0-1139.rt56.1110.el7.x86_64/4246269/ibm-hs22-7870-1.gsslab.rdu2.redhat.com/10.10.176.242-2020-05-11-00:08:43/

Comment 31 errata-xmlrpc 2021-11-23 17:14:02 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: kernel-rt security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:4779