429168 – GFS2: Kernel BUG at mm/filemap.c:553

Bug 429168 - GFS2: Kernel BUG at mm/filemap.c:553

Summary: GFS2: Kernel BUG at mm/filemap.c:553

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux 5
Classification:	Red Hat
Component:	kernel
Sub Component:
Version:	5.2
Hardware:	All
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	rc
Target Release:	---
Assignee:	Don Zickus
QA Contact:	GFS Bugs
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2008-01-17 18:29 UTC by Robert Peterson
Modified:	2008-05-21 15:06 UTC (History)
CC List:	3 users (show)
Fixed In Version:	RHBA-2008-0314
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2008-05-21 15:06:58 UTC
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
Diff between -56 and -57 kernel (32.24 KB, patch) 2008-01-17 19:10 UTC, Robert Peterson	no flags	Details \| Diff
Patch to fix the problem (651 bytes, patch) 2008-01-18 01:17 UTC, Robert Peterson	no flags	Details \| Diff
Try #2 (416 bytes, patch) 2008-01-18 14:58 UTC, Robert Peterson	no flags	Details \| Diff
Same patch, but can be applied before 253990 patch (471 bytes, patch) 2008-01-21 16:50 UTC, Robert Peterson	no flags	Details \| Diff
Show Obsolete (2) View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2008:0314	0	normal	SHIPPED_LIVE	Updated kernel packages for Red Hat Enterprise Linux 5.2	2008-05-20 18:43:34 UTC

Description Robert Peterson 2008-01-17 18:29:43 UTC

Description of problem:
I was running my "hell" GFS2 tests on the latest GFS2 version and got
a kernel panic with "Kernel BUG at mm/filemap.c:553" on "hell6".

This problem was not caused by a recent code change.  The problem
exists in kernels 57 through 68, but the -56 kernel works properly.

Version-Release number of selected component (if applicable):
RHEL 5.2 beta

How reproducible:
Easily

Steps to Reproduce:
The "Hell6" test is as follows:

service cman start
service clvmd start
mkfs.gfs2 -O -t bobs_roth:test_gfs -p lock_dlm -j 3 /dev/roth_vg/roth_lv
mount -tgfs2 /dev/roth_vg/roth_lv /mnt/gfs2
cd /bob_music/
cp -a * /mnt/gfs2/

The test goes on to rm the copied files after the file system is full
on a working system.  However, it is the cp that causes the kernel to
panic on broken versions.

Actual results:
----------- [cut here ] --------- [please bite here ] ---------
Kernel BUG at mm/filemap.c:553
invalid opcode: 0000 [1] SMP 
last sysfs file: /devices/pci0000:00/0000:00:00.0/irq
CPU 1 
Modules linked in: lock_dlm gfs2 dlm configfs autofs4 hidp rfcomm l2cap
bluetooth sunrpc ipv6 dm_multipath video sbs backlight i2c_ec button battery
asus_acpi acpi_memhotplug ac parport_pc lp parport joydev ide_cd shpchp i2c_i801
sg i2c_core cdrom tg3 serio_raw pcspkr dm_snapshot dm_zero dm_mirror dm_mod
qla2xxx scsi_transport_fc ata_piix libata sd_mod scsi_mod ext3 jbd ehci_hcd
ohci_hcd uhci_hcd
Pid: 2946, comm: cp Not tainted 2.6.18-57.el5 #1
RIP: 0010:[<ffffffff800179a4>]  [<ffffffff800179a4>] unlock_page+0xf/0x2f
RSP: 0018:ffff8100525d9bd8  EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff810000d854d8 RCX: ffff81001f8949e4
RDX: ffff81001e935500 RSI: 00000000ffffffe4 RDI: ffff810000d854d8
RBP: ffff810000d854d8 R08: 00000000ffffffe4 R09: 0000000000020000
R10: ffff8100015e98f8 R11: 00000000fffffffa R12: 000000000035b000
R13: 0000000000001000 R14: 00007fffec4aa000 R15: 0000000000000000
FS:  00002aaaaaabaf20(0000) GS:ffff8100026e57c0(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000e29f1f8 CR3: 0000000056abe000 CR4: 00000000000006e0
Process cp (pid: 2946, threadinfo ffff8100525d8000, task ffff8100644d1860)
Stack:  ffffffffffffffe4 ffffffff8000f9eb 0000000000001000 ffff8100525d9f50
 000000000035b000 0000000000000001 ffff8100525d9dd8 0000000000000001
 0000100000000000 ffff81007be5c580 ffff81001e935610 ffffffff884c7fa0
Call Trace:
 [<ffffffff8000f9eb>] generic_file_buffered_write+0x27a/0x6d8
 [<ffffffff8000bf9a>] do_generic_mapping_read+0x3b6/0x3f8
 [<ffffffff8000cbec>] file_read_actor+0x0/0x154
 [<ffffffff8000ddd9>] current_fs_time+0x3b/0x40
 [<ffffffff80015dc6>] __generic_file_aio_write_nolock+0x36c/0x3b8
 [<ffffffff8000c128>] __generic_file_aio_read+0x14c/0x190
 [<ffffffff800be24c>] __generic_file_write_nolock+0x8f/0xa8
 [<ffffffff80016563>] generic_file_aio_read+0x34/0x39
 [<ffffffff8009b4c4>] autoremove_wake_function+0x0/0x2e
 [<ffffffff8009b4c4>] autoremove_wake_function+0x0/0x2e
 [<ffffffff80061a94>] mutex_lock+0xd/0x1d
 [<ffffffff80043358>] generic_file_write+0x49/0xa7
 [<ffffffff800161d8>] vfs_write+0xce/0x174
 [<ffffffff80016aa5>] sys_write+0x45/0x6e
 [<ffffffff8005b28d>] tracesys+0xd5/0xe0


Code: 0f 0b 68 9f d2 28 80 c2 29 02 48 89 df e8 0e 2a 00 00 48 89 


Expected results:
No kernel panic.  When the device gets full, it should just start
spewing out messages like these:
cp: writing `/mnt/gfs2/Metal/Rhapsody/SF6GZ0~2/10 - Guardiani del destino.mp3':
No space left on device


Additional info:
The problem was introduced between -56 and -57, so between 13 Nov through
21 Nov 2007.  I'll compare the sources to see if I can find the problem.

Comment 1 Robert Peterson 2008-01-17 19:10:09 UTC

Created attachment 292044 [details]
Diff between -56 and -57 kernel

This is a diff between the -56 version that works and the -57 version
that fails hell6.  It's 1152 lines long, so lots of changes.

Comment 2 Robert Peterson 2008-01-17 19:20:58 UTC

The problem happens with data=writeback as well as the default ordered
write.

Comment 3 Robert Peterson 2008-01-18 01:17:37 UTC

Created attachment 292094 [details]
Patch to fix the problem

Solved.  Function gfs2_write_lock_start was unlocking the page
prematurely.  When the code figured out there was no space left on
the device, it returned the -ENOSPC return code.  However, vfs will
try to unlock the page if the return code is not AOP_TRUNCATED_PAGE.
The problem was, we had already unlocked it.

The solution--this patch--is to unlock the page only in cases where
it determines that it's going to return AOP_TRUNCATED_PAGE.

BTW, this problem no longer exists upstream because the upstream code
has advanced beyond the need for returning AOP_TRUNCATED_PAGE.

Comment 4 Robert Peterson 2008-01-18 01:19:51 UTC

Reassigning to myself.

Also, I need some flags set please.  It's important to get this into
RHEL5.2.  We don't want rudimentary errors like out-of-space to cause
a kernel panic.

Comment 5 Steve Whitehouse 2008-01-18 14:40:20 UTC

The patch in comment #3 is wrong. We can't remove the unlock before the glock as
thats the whole point of gfs2_write_lock_start. Instead we'll have to fix it by
checking the error path and getting the page lock again if and only if we are
going to return an error.

Comment 6 Robert Peterson 2008-01-18 14:58:42 UTC

Created attachment 292152 [details]
Try #2

Does this look better?	This one keeps the unlock_page in place,
but relocks the page on error.

Comment 7 Robert Peterson 2008-01-21 16:50:05 UTC

Created attachment 292384 [details]
Same patch, but can be applied before 253990 patch

This is the same patch, but the line numbers are different.  The previous
version was meant to be applied over top of the 253990 (performance) fix.
This one goes directly on top of the preceding i_alloc fix that was
previously posted to rhkernel-list.  I'm planning to post this one.

Comment 8 Robert Peterson 2008-01-21 17:03:59 UTC

The fix was posted to rhkernel-list, so I'm changing status to POST
and rerouting it to Don Zickus.

Comment 9 Don Zickus 2008-01-22 18:52:27 UTC

in 2.6.18-72.el5
You can download this test kernel from http://people.redhat.com/dzickus/el5

Comment 12 errata-xmlrpc 2008-05-21 15:06:58 UTC

An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2008-0314.html

Note You need to log in before you can comment on or make changes to this bug.