Bug 1291328
| Summary: | aio_stress kernel BUG at mm/filemap.c:238 | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 7 | Reporter: | Bill Peck <bpeck> |
| Component: | kernel-aarch64 | Assignee: | fs-maint |
| kernel-aarch64 sub component: | XFS | QA Contact: | Eryu Guan <eguan> |
| Status: | CLOSED CURRENTRELEASE | Docs Contact: | |
| Severity: | unspecified | ||
| Priority: | unspecified | CC: | eguan, jburke, jfeeney, jstancek |
| Version: | 7.3 | Keywords: | TestOnly |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-07-13 03:45:52 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Bill Peck
2015-12-14 15:04:21 UTC
Should be fixed in 4.4-rc5: http://marc.info/?l=linux-mm&m=144950957622869&w=2 https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=dfd01f026058a59a513f8a365b439a0681b803af (In reply to Jan Stancek from comment #3) > Should be fixed in 4.4-rc5: > > http://marc.info/?l=linux-mm&m=144950957622869&w=2 > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/ > ?id=dfd01f026058a59a513f8a365b439a0681b803af Just for future reference, this patch also fixed BUG_ON I've seen when running xfstests generic/095 against NFSv3 on ppc64 host(though it's different call trace). I also confirmed 4.4-rc5 passed the test. [ 2015.935021] ------------[ cut here ]------------ [ 2015.935034] kernel BUG at mm/page-writeback.c:2654! [ 2015.935039] Oops: Exception in kernel mode, sig: 5 [#1] [ 2015.935042] SMP NR_CPUS=2048 NUMA pSeries [ 2015.935046] Modules linked in: loop(E) dm_mod(E) nfsv3(E) nfs(E) fscache(E) ext4(E) mbcache(E) jbd2(E) sg(E) pseries_rng(E) nfsd(E) auth_rpcgss(E) nfs_acl(E) lockd(E) grace(E) sunrpc(E) ip_tables(E) xfs(E) libcrc32c(E) sd_mod(E) ibmvscsi(E) ibmveth(E) scsi_transport_srp(E) [ 2015.935090] CPU: 2 PID: 5679 Comm: fio Tainted: G E 4.4.0-rc4 #1 [ 2015.935096] task: c00000059b380000 ti: c00000059a17c000 task.ti: c00000059a17c000 [ 2015.935103] NIP: c0000000001f3e24 LR: c0000000001f3e10 CTR: c0000000001f3de0 [ 2015.935109] REGS: c00000059a17f440 TRAP: 0700 Tainted: G E (4.4.0-rc4) [ 2015.935115] MSR: 8000000100029032 <SF,EE,ME,IR,DR,RI> CR: 24004480 XER: 00000001 [ 2015.935129] CFAR: c00000000020a59c SOFTE: 1 GPR00: d00000000bd6e524 c00000059a17f6c0 c000000001325e00 c000000598eb1af8 GPR04: 0000000000000000 0000000000000000 c00000059a17f830 0000000000000000 GPR08: 0000000000000000 0000000000000001 0000000000000000 d00000000bd78bf8 GPR12: 0000000024004480 c00000000e7e1300 00000100080a1b70 ffffffffffffffff GPR16: 0000000000000016 00003fff87fdedd0 000000003b9ac9ff f000000001651e18 GPR20: c000000598eb1b10 0000000000000007 0000000000000000 0000000000000001 GPR24: 0000000000000001 c000000598eb1af8 0000000000000006 c00000059a17f950 GPR28: c00000059a17f9c0 c000000598eb1990 c000000598eb1af8 f000000001651e00 [ 2015.935215] NIP [c0000000001f3e24] .clear_page_dirty_for_io+0x44/0x2a0 [ 2015.935221] LR [c0000000001f3e10] .clear_page_dirty_for_io+0x30/0x2a0 [ 2015.935226] Call Trace: [ 2015.935230] [c00000059a17f6c0] [0000000000000006] 0x6 (unreliable) [ 2015.935255] [c00000059a17f760] [d00000000bd6e524] .nfs_wb_page+0x84/0x210 [nfs] [ 2015.935277] [c00000059a17f850] [d00000000bd59250] .nfs_launder_page+0x70/0xc0 [nfs] [ 2015.935285] [c00000059a17f8d0] [c0000000001f98e8] .invalidate_inode_pages2_range+0x2f8/0x490 [ 2015.935299] [c00000059a17faa0] [d00000000bd65940] .nfs_file_direct_write+0xf0/0x460 [nfs] [ 2015.935313] [c00000059a17fb70] [d00000000bd5a6d4] .nfs_file_write+0xf4/0x230 [nfs] [ 2015.935319] [c00000059a17fc10] [c000000000283278] .__vfs_write+0xf8/0x170 [ 2015.935325] [c00000059a17fcf0] [c000000000283cf0] .vfs_write+0xb0/0x1f0 [ 2015.935330] [c00000059a17fd90] [c000000000284d24] .SyS_pwrite64+0xb4/0xd0 [ 2015.935337] [c00000059a17fe30] [c000000000009204] system_call+0x38/0xb4 [ 2015.935341] Instruction dump: [ 2015.935344] fbc1fff0 fbe1fff8 7c7f1b78 f8010010 91810008 f821ff61 48016765 60000000 [ 2015.935354] e93f0000 7d2948f8 7c7e1b78 792907e0 <0b090000> 2fa30000 419e0038 e8630000 [ 2015.935367] ---[ end trace 23bfe706acc701f1 ]--- [ 2015.936730] [ 2017.936768] Kernel panic - not syncing: Fatal exception Okay so given comments #3 and #4, it looks like this should be fixed in the 4.4-final RHELSA kernel. I am going to move this on to MODIFIED then so it can take a step towards closure. Tested with kernel-4.5.0-0.44.el7.aarch64, all ltp-aiodio tests passed without kernel BUG_ON/hang/panic https://beaker.engineering.redhat.com/jobs/1399681 Set to VERIFIED. |