Bug 688817

Summary: [ext4/xfstests 234] kernel BUG at fs/jbd2/transaction.c:1027! and kernel panic
Product: Red Hat Enterprise Linux 6 Reporter: Eryu Guan <eguan>
Component: kernelAssignee: Red Hat Kernel Manager <kernel-mgr>
Status: CLOSED DUPLICATE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: unspecified    
Version: 6.1   
Target Milestone: rc   
Target Release: ---   
Hardware: x86_64   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-03-18 06:25:36 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Eryu Guan 2011-03-18 06:19:26 UTC
Description of problem:
Got kernel panic when running xfstests 234 on ext4 with 2048 block size, tests were running on real block device instead of loop device

Version-Release number of selected component (if applicable):
kernel 2.6.32-122.el6.x86_64
quota 3.17-14.el6.x86_64

How reproducible:
reliable

Steps to Reproduce:
1. Prepare two partitions for testing
2. yum install rh-tests-kernel-filesystems-xfs-xfstests -y
   cd /mnt/tests/kernel/filesystems/xfs/xfstests
3. TEST_PARAM_TEST_DEV=/dev/sda5 TEST_PARAM_SCRATCH_DEV=/dev/sda6 TEST_PARAM_RUNTESTS=234 TEST_PARAM_FSTYPE=ext4 TEST_PARAM_BLKSIZE=2048 TEST_PARAM_LOOP=1  make run
   Replace sda5 and sda6 with correct ones
  
Actual results:
------------[ cut here ]------------
kernel BUG at fs/jbd2/transaction.c:1027!
invalid opcode: 0000 [#1] SMP
last sysfs file: /sys/devices/pci0000:00/0000:00:02.0/0000:10:00.3/0000:11:06.0/host0/port-0:0/end_device-0:0/target0:0:0/0:0:0:0/block/sda/dev
CPU 3
Modules linked in: ext4 jbd2 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ipv6 dm_mirror dm_region_hash dm_log ppdev parport_pc parport wmi tg3 microcode serio_raw sg iTCO_wdt iTCO_vendor_support snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc shpchp i5k_amb hwmon i5000_edac edac_core ext3 jbd mbcache sr_mod cdrom firewire_ohci firewire_core crc_itu_t sd_mod crc_t10dif ahci ata_generic pata_acpi ata_piix mptsas mptscsih mptbase scsi_transport_sas nouveau ttm drm_kms_helper drm i2c_algo_bit i2c_core video output dm_mod [last unloaded: jbd2]

Modules linked in: ext4 jbd2 sunrpc cpufreq_ondemand acpi_cpufreq freq_table ipv6 dm_mirror dm_region_hash dm_log ppdev parport_pc parport wmi tg3 microcode serio_raw sg iTCO_wdt iTCO_vendor_support snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer snd soundcore snd_page_alloc shpchp i5k_amb hwmon i5000_edac edac_core ext3 jbd mbcache sr_mod cdrom firewire_ohci firewire_core crc_itu_t sd_mod crc_t10dif ahci ata_generic pata_acpi ata_piix mptsas mptscsih mptbase scsi_transport_sas nouveau ttm drm_kms_helper drm i2c_algo_bit i2c_core video output dm_mod [last unloaded: jbd2]
Pid: 27444, comm: setquota Not tainted 2.6.32-122.el6.x86_64 #1 HP xw8400 Workstation
RIP: 0010:[<ffffffffa05227ed>]  [<ffffffffa05227ed>] jbd2_journal_dirty_metadata+0x10d/0x150 [jbd2]
RSP: 0018:ffff880109c21c08  EFLAGS: 00010246
RAX: ffff8801268dec80 RBX: ffff8801064696d8 RCX: ffff8801296d9268
RDX: 0000000000000000 RSI: ffff8801296d9268 RDI: 0000000000000000
RBP: ffff880109c21c28 R08: d010000000000000 R09: feb892b7f35c9a02
R10: ffff880126b75c00 R11: 0000000000000000 R12: ffff8801064e9208
R13: ffff8801296d9268 R14: ffff8801273b0800 R15: 0000000000000000
FS:  00007f66726b7700(0000) GS:ffff8800282c0000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00007f66721b4010 CR3: 0000000127ccb000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process setquota (pid: 27444, threadinfo ffff880109c20000, task ffff880102b2c080)
Stack:
 ffff8801064696d8 ffffffffa057f640 ffff8801296d9268 0000000000000000
<0> ffff880109c21c68 ffffffffa056f0eb ffff880109c21c58 ffffffff8108deef
<0> 0000000000000018 ffff8801064696d8 ffff8801296d9268 0000000000000018
Call Trace:
 [<ffffffffa056f0eb>] __ext4_handle_dirty_metadata+0x7b/0x100 [ext4]
 [<ffffffff8108deef>] ? wake_up_bit+0x2f/0x40
 [<ffffffffa05648f3>] ext4_quota_write+0x123/0x250 [ext4]
 [<ffffffff811cf291>] v2_write_file_info+0xa1/0xe0
 [<ffffffff811cb278>] dquot_commit+0x128/0x130
 [<ffffffffa0563a2e>] ext4_write_dquot+0x5e/0x80 [ext4]
 [<ffffffff811cc9f0>] dqput+0x120/0x200
 [<ffffffff811cd8bf>] vfs_set_dqblk+0x1cf/0x350
 [<ffffffff811d134b>] do_quotactl+0x33b/0x4c0
 [<ffffffff8120b0b1>] ? avc_has_perm+0x71/0x90
 [<ffffffff811d1674>] sys_quotactl+0x1a4/0x390
 [<ffffffff8100b172>] system_call_fastpath+0x16/0x1b
Code: c6 9c 03 00 00 4c 89 f7 e8 61 94 fb e0 48 8b 33 ba 01 00 00 00 4c 89 e7 e8 21 ec ff ff 4c 89 f0 66 ff 00 66 66 90 e9 73 ff ff ff <0f> 0b eb fe 0f 0b eb fe 0f 0b 66 0f 1f 84 00 00 00 00 00 eb f5
RIP  [<ffffffffa05227ed>] jbd2_journal_dirty_metadata+0x10d/0x150 [jbd2]
 RSP <ffff880109c21c08>

Expected results:
test pass

Additional info:
Here are two beaker jobs
https://beaker.engineering.redhat.com/recipes/128656
http://beaker-archive.app.eng.bos.redhat.com/beaker-logs/2011/03/627/62750/128656///console.log

https://beaker.engineering.redhat.com/recipes/128657
http://beaker-archive.app.eng.bos.redhat.com/beaker-logs/2011/03/627/62750/128657///console.log

Comment 1 Eryu Guan 2011-03-18 06:21:09 UTC
I'll try to get a vmcore and on RHEL6 GA kernel to see if it's a regression or not

Comment 2 Eryu Guan 2011-03-18 06:25:36 UTC
I just notice it's already been reported as bug 680105 ... Close it as duplicate
Sorry for noise

*** This bug has been marked as a duplicate of bug 680105 ***