Bug 677584 - xfs_check fails crashing when running on a decently large partition, like 30TB
Summary: xfs_check fails crashing when running on a decently large partition, like 30TB
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: Fedora
Classification: Fedora
Component: xfsprogs
Version: 14
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
Assignee: Eric Sandeen
QA Contact: Fedora Extras Quality Assurance
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2011-02-15 09:59 UTC by lejeczek
Modified: 2011-02-15 22:07 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2011-02-15 22:07:38 UTC
Type: ---


Attachments (Terms of Use)

Description lejeczek 2011-02-15 09:59:44 UTC
Description of problem:

I am sure I've seen it before, there was a certain partition size up to which xfs_check succeeded, cannot remember though, feels like something is wrong deep there in the xfs itself or/and its tool/s

Version-Release number of selected component (if applicable):

xfsprogs-3.1.2-1.fc14.x86_64
2.6.35.11-83.fc14.x86_64

How reproducible:
PV that spans whole device, VG on top of this PV, subsequently LV spans whole VG, SAS2 JBOD using SATA-3 HDDs put together as a hardware RAID0 volume, default mkfs.xfs

yet, tried on different systems having different underlying hardware and process gets killed every time for gobbling too much memory.

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

[ 5765.762625] xfs_db invoked oom-killer: gfp_mask=0x280da, order=0, oom_adj=0
[ 5765.762628] xfs_db cpuset=/ mems_allowed=0
[ 5765.762631] Pid: 3989, comm: xfs_db Not tainted 2.6.35.11-83.fc14.x86_64 #1
[ 5765.762632] Call Trace:
[ 5765.762637]  [<ffffffff8108e2c6>] ? cpuset_print_task_mems_allowed+0x91/0x9d
[ 5765.762640]  [<ffffffff810d5b2e>] dump_header+0x6e/0x1b5
[ 5765.762643]  [<ffffffff81469fcf>] ? _raw_spin_unlock_irqrestore+0x17/0x19
[ 5765.762645]  [<ffffffff8121d073>] ? ___ratelimit+0xe2/0x103
[ 5765.762647]  [<ffffffff810d5cbc>] oom_kill_process+0x47/0x131
[ 5765.762649]  [<ffffffff810d61bd>] __out_of_memory+0x98/0xa7
[ 5765.762651]  [<ffffffff810d6550>] out_of_memory+0x15d/0x195
[ 5765.762653]  [<ffffffff810da3dc>] __alloc_pages_nodemask+0x646/0x776
[ 5765.762656]  [<ffffffff81101f59>] alloc_page_vma+0xce/0xd3
[ 5765.762658]  [<ffffffff810ea992>] do_wp_page+0x2b2/0x58e
[ 5765.762660]  [<ffffffff810e9519>] ? pmd_offset+0x19/0x40
[ 5765.762662]  [<ffffffff810ec4c4>] handle_mm_fault+0x7ef/0x84d
[ 5765.762663]  [<ffffffff8146a40e>] ? common_interrupt+0xe/0x13
[ 5765.762666]  [<ffffffff8146d375>] do_page_fault+0x250/0x265
[ 5765.762668]  [<ffffffff8146a6f5>] page_fault+0x25/0x30
[ 5765.762669] Mem-Info:
[ 5765.762670] Node 0 DMA per-cpu:
[ 5765.762672] CPU    0: hi:    0, btch:   1 usd:   0
[ 5765.762673] CPU    1: hi:    0, btch:   1 usd:   0
[ 5765.762674] CPU    2: hi:    0, btch:   1 usd:   0
[ 5765.762675] CPU    3: hi:    0, btch:   1 usd:   0
[ 5765.762676] CPU    4: hi:    0, btch:   1 usd:   0
[ 5765.762678] CPU    5: hi:    0, btch:   1 usd:   0
[ 5765.762679] Node 0 DMA32 per-cpu:
[ 5765.762682] CPU    0: hi:  186, btch:  31 usd:  60
[ 5765.762684] CPU    1: hi:  186, btch:  31 usd:   0
[ 5765.762685] CPU    2: hi:  186, btch:  31 usd:   0
[ 5765.762686] CPU    3: hi:  186, btch:  31 usd:   0
[ 5765.762687] CPU    4: hi:  186, btch:  31 usd:  31
[ 5765.762688] CPU    5: hi:  186, btch:  31 usd:   0
[ 5765.762689] Node 0 Normal per-cpu:
[ 5765.762690] CPU    0: hi:  186, btch:  31 usd: 175
[ 5765.762691] CPU    1: hi:  186, btch:  31 usd:  17
[ 5765.762692] CPU    2: hi:  186, btch:  31 usd:  30
[ 5765.762693] CPU    3: hi:  186, btch:  31 usd:  21
[ 5765.762694] CPU    4: hi:  186, btch:  31 usd:  57
[ 5765.762695] CPU    5: hi:  186, btch:  31 usd:   0
[ 5765.762698] active_anon:3540806 inactive_anon:421612 isolated_anon:0
[ 5765.762699]  active_file:586 inactive_file:514 isolated_file:0
[ 5765.762699]  unevictable:0 dirty:4 writeback:1 unstable:0
[ 5765.762700]  free:20843 slab_reclaimable:2842 slab_unreclaimable:15812
[ 5765.762701]  mapped:46 shmem:0 pagetables:18172 bounce:0
[ 5765.762702] Node 0 DMA free:15904kB min:12kB low:12kB high:16kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15252kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
[ 5765.762708] lowmem_reserve[]: 0 3254 16131 16131
[ 5765.762710] Node 0 DMA32 free:54604kB min:3276kB low:4092kB high:4912kB active_anon:2701600kB inactive_anon:540068kB active_file:180kB inactive_file:4kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:3332192kB mlocked:0kB dirty:8kB writeback:0kB mapped:152kB shmem:0kB slab_reclaimable:1788kB slab_unreclaimable:1316kB kernel_stack:248kB pagetables:10680kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:722 all_unreclaimable? yes
[ 5765.762716] lowmem_reserve[]: 0 0 12877 12877
[ 5765.762717] Node 0 Normal free:12864kB min:12968kB low:16208kB high:19452kB active_anon:11461624kB inactive_anon:1146380kB active_file:2164kB inactive_file:2052kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:13186560kB mlocked:0kB dirty:8kB writeback:8kB mapped:32kB shmem:0kB slab_reclaimable:9580kB slab_unreclaimable:61932kB kernel_stack:1712kB pagetables:62008kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:6694 all_unreclaimable? yes
[ 5765.762723] lowmem_reserve[]: 0 0 0 0
[ 5765.762725] Node 0 DMA: 2*4kB 1*8kB 1*16kB 0*32kB 2*64kB 1*128kB 1*256kB 0*512kB 1*1024kB 1*2048kB 3*4096kB = 15904kB
[ 5765.762730] Node 0 DMA32: 27*4kB 54*8kB 77*16kB 59*32kB 74*64kB 47*128kB 31*256kB 15*512kB 2*1024kB 5*2048kB 3*4096kB = 54604kB
[ 5765.762734] Node 0 Normal: 836*4kB 223*8kB 72*16kB 32*32kB 15*64kB 4*128kB 1*256kB 1*512kB 1*1024kB 1*2048kB 0*4096kB = 12616kB
[ 5765.762738] 6006 total pagecache pages
[ 5765.762739] 4857 pages in swap cache
[ 5765.762740] Swap cache stats: add 10270122, delete 10265265, find 37290/55802
[ 5765.762741] Free swap  = 0kB
[ 5765.762742] Total swap = 18546684kB
[ 5765.816729] 4194303 pages RAM
[ 5765.816730] 77819 pages reserved
[ 5765.816732] 1240 pages shared
[ 5765.816733] 4093065 pages non-shared
[ 5765.816734] Out of memory: kill process 3988 (xfs_check) score 520480 or a child
[ 5765.816784] Killed process 3989 (xfs_db) vsz:66409172kB, anon-rss:15824712kB, file-rss:8kB

Comment 1 Eric Sandeen 2011-02-15 22:07:38 UTC
xfs_check is known to not scale, and is on its way to being deprecated.

If you want a check-only filesystem verification, please use xfs_repair -n.

Thanks,
-Eric


Note You need to log in before you can comment on or make changes to this bug.