Bug 209005

Summary: ext3/jbd panic
Product: [Fedora] Fedora Reporter: Dave Jones <davej>
Component: kernelAssignee: Eric Sandeen <esandeen>
Status: CLOSED RAWHIDE QA Contact: Brian Brock <bbrock>
Severity: medium Docs Contact:
Priority: medium    
Version: rawhideCC: esandeen, mishu, pfrields, riel, syeghiay, wtogami
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2006-10-16 23:57:45 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
serial console capture none

Description Dave Jones 2006-10-02 21:49:43 UTC
I ran two fsx instances on a local 2 disk RAID0 mount, one on an NFS mount, and
an fs-stress run on the same local RAID0.  After about 10 minutes, it went bang
with the attached oopses/panic.

Comment 1 Dave Jones 2006-10-02 21:51:04 UTC
Created attachment 137603 [details]
serial console capture

Comment 2 Eric Sandeen 2006-10-02 22:33:08 UTC
Dave, first quick thought is this might be a dup of bugs #208404 / #207739

In case you want to re-test w/ that patch...

I'll need to dig up the exact trees that these problems were reported on to make
sure it's the same BUG_ON we're hitting.

Comment 3 Eric Sandeen 2006-10-03 02:32:08 UTC
Ok, it was wishful thinking that this is a dup of those bugs.  Although, it may
still be the root cause.  The patch which "fixed" those bugs is in this oopsing
kernel, but it wound up at pretty much the same place.

Comment 5 Eric Sandeen 2006-10-03 16:42:12 UTC
I'll look into this a bit.

Comment 6 Eric Sandeen 2006-10-05 18:09:26 UTC
Can't seem to reproduce this on anything but Dave's box...  

From the attached oops, we went down the path of the last call to
journal_do_submit_data in journal_submit_data_buffers, I think.

Comment 8 Eric Sandeen 2006-10-10 15:37:01 UTC
At long last, I can hit this one now too.

The key is that it needs a block size < page size to hit it, I think.  Although,
it still took many hours for me to hit it (about 12...)

As we gain understanding of the bug I'll try to whip up a better testcase.

Comment 9 Dave Jones 2006-10-16 23:57:45 UTC
This should be fixed in todays FC6 kernel.