Bug 329451
Summary: | GFS2 Single node: soft lockup detected on CPU#0! during g2 baseline run. | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Linux 5 | Reporter: | Dean Jansa <djansa> | ||||||
Component: | kernel | Assignee: | Steve Whitehouse <swhiteho> | ||||||
Status: | CLOSED NOTABUG | QA Contact: | GFS Bugs <gfs-bugs> | ||||||
Severity: | medium | Docs Contact: | |||||||
Priority: | medium | ||||||||
Version: | 5.1 | CC: | cluster-maint | ||||||
Target Milestone: | --- | ||||||||
Target Release: | --- | ||||||||
Hardware: | All | ||||||||
OS: | Linux | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2008-02-27 23:31:44 UTC | Type: | --- | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Dean Jansa
2007-10-12 14:55:49 UTC
Since you appear to have a number of these warnings, I presume that the tests are still making progress, even if its a bit slow. My thought is that if the write is reasonably large, it is probably (given a log flush or two) taking long enough that it triggers the warning, even though its probably doing the right thing. We may well just need to add a suitable cond_reshed() to get rid of the warnings in that case. The slightly more long winded way to deal with the problem is via bugzilla #329321. I assume that the reason that this isn't seen for non-jdata files is that the volume of data for the log is much smaller, so that it doesn't reach a size where we stall for so long during log flushes. Created attachment 245441 [details]
Potential fix
I wonder if this bug might potentially be fixed by the attached patch. It might
possibly have some bearing on it anyway, so its worth a go.
I hit this again with abhi's kmod-gfs2-1.53-4.1: GFS2 Overlay (built Nov 16 2007 12:45:12) installed Red Hat Enterprise Linux Server release 5.1 (Tikanga) Kernel 2.6.18-53.el5 on an ia64 a1 login: BUG: soft lockup detected on CPU#0! Call Trace: [<a000000100013b20>] show_stack+0x40/0xa0 sp=e000004047e2f8a0 bsp=e000004047e29770 [<a000000100013bb0>] dump_stack+0x30/0x60 sp=e000004047e2fa70 bsp=e000004047e29758 [<a0000001000e5fe0>] softlockup_tick+0x240/0x280 sp=e000004047e2fa70 bsp=e000004047e29718 [<a000000100092df0>] run_local_timers+0x30/0x60 sp=e000004047e2fa80 bsp=e000004047e29700 [<a000000100092ea0>] update_process_times+0x80/0x100 sp=e000004047e2fa80 bsp=e000004047e296d0 [<a000000100037220>] timer_interrupt+0x180/0x360 sp=e000004047e2fa80 bsp=e000004047e29690 [<a0000001000e6650>] handle_IRQ_event+0x90/0x120 sp=e000004047e2fa80 bsp=e000004047e29650 [<a0000001000e6810>] __do_IRQ+0x130/0x420 sp=e000004047e2fa80 bsp=e000004047e29608 [<a000000100011630>] ia64_handle_irq+0xf0/0x1a0 sp=e000004047e2fa80 bsp=e000004047e295d0 [<a00000010000c020>] __ia64_leave_kernel+0x0/0x280 sp=e000004047e2fa80 bsp=e000004047e295d0 [<a000000202a88ef0>] gfs2_ail1_start_one+0xb0/0x420 [gfs2] sp=e000004047e2fc50 bsp=e000004047e29560 [<a000000202a893f0>] gfs2_ail1_start+0x190/0x220 [gfs2] sp=e000004047e2fc50 bsp=e000004047e29520 [<a000000202a8c050>] gfs2_log_reserve+0x1d0/0x380 [gfs2] sp=e000004047e2fc50 bsp=e000004047e294c8 [<a000000202ab7140>] gfs2_do_trans_begin+0x280/0x340 [gfs2] sp=e000004047e2fc50 bsp=e000004047e29478 [<a000000202a95a30>] gfs2_prepare_write+0x430/0x700 [gfs2] sp=e000004047e2fc50 bsp=e000004047e29400 [<a0000001001002a0>] generic_file_buffered_write+0x3c0/0xda0 sp=e000004047e2fc60 bsp=e000004047e29330 [<a000000100101370>] __generic_file_aio_write_nolock+0x6f0/0x800 sp=e000004047e2fcf0 bsp=e000004047e292b8 [<a000000100101780>] __generic_file_write_nolock+0x100/0x140 sp=e000004047e2fd30 bsp=e000004047e29280 [<a0000001001019a0>] generic_file_write+0x80/0x1a0 sp=e000004047e2fe10 bsp=e000004047e29230 [<a000000100161200>] vfs_write+0x200/0x3a0 sp=e000004047e2fe20 bsp=e000004047e291d8 [<a000000100161d50>] sys_write+0x70/0xe0 sp=e000004047e2fe20 bsp=e000004047e29160 [<a00000010000bdb0>] __ia64_trace_syscall+0xd0/0x110 sp=e000004047e2fe30 bsp=e000004047e29160 [<a000000000010620>] __start_ivt_text+0xffffffff00010620/0x400 sp=e000004047e30000 bsp=e000004047e29160 BUG: soft lockup detected on CPU#0! Call Trace: [<a000000100013b20>] show_stack+0x40/0xa0 sp=e000004050d7f8a0 bsp=e000004050d79770 [<a000000100013bb0>] dump_stack+0x30/0x60 sp=e000004050d7fa70 bsp=e000004050d79758 [<a0000001000e5fe0>] softlockup_tick+0x240/0x280 sp=e000004050d7fa70 bsp=e000004050d79718 [<a000000100092df0>] run_local_timers+0x30/0x60 sp=e000004050d7fa80 bsp=e000004050d79700 [<a000000100092ea0>] update_process_times+0x80/0x100 sp=e000004050d7fa80 bsp=e000004050d796d0 [<a000000100037220>] timer_interrupt+0x180/0x360 sp=e000004050d7fa80 bsp=e000004050d79690 [<a0000001000e6650>] handle_IRQ_event+0x90/0x120 sp=e000004050d7fa80 bsp=e000004050d79650 [<a0000001000e6810>] __do_IRQ+0x130/0x420 sp=e000004050d7fa80 bsp=e000004050d79608 [<a000000100011630>] ia64_handle_irq+0xf0/0x1a0 sp=e000004050d7fa80 bsp=e000004050d795d0 [<a00000010000c020>] __ia64_leave_kernel+0x0/0x280 sp=e000004050d7fa80 bsp=e000004050d795d0 [<a000000202a88ee0>] gfs2_ail1_start_one+0xa0/0x420 [gfs2] sp=e000004050d7fc50 bsp=e000004050d79560 [<a000000202a893f0>] gfs2_ail1_start+0x190/0x220 [gfs2] sp=e000004050d7fc50 bsp=e000004050d79520 [<a000000202a8c050>] gfs2_log_reserve+0x1d0/0x380 [gfs2] sp=e000004050d7fc50 bsp=e000004050d794c8 [<a000000202ab7140>] gfs2_do_trans_begin+0x280/0x340 [gfs2] sp=e000004050d7fc50 bsp=e000004050d79478 [<a000000202a95a30>] gfs2_prepare_write+0x430/0x700 [gfs2] sp=e000004050d7fc50 bsp=e000004050d79400 [<a0000001001002a0>] generic_file_buffered_write+0x3c0/0xda0 sp=e000004050d7fc60 bsp=e000004050d79330 [<a000000100101370>] __generic_file_aio_write_nolock+0x6f0/0x800 sp=e000004050d7fcf0 bsp=e000004050d792b8 [<a000000100101780>] __generic_file_write_nolock+0x100/0x140 sp=e000004050d7fd30 bsp=e000004050d79280 [<a0000001001019a0>] generic_file_write+0x80/0x1a0 sp=e000004050d7fe10 bsp=e000004050d79230 [<a000000100161200>] vfs_write+0x200/0x3a0 sp=e000004050d7fe20 bsp=e000004050d791d8 [<a000000100161d50>] sys_write+0x70/0xe0 sp=e000004050d7fe20 bsp=e000004050d79160 [<a00000010000bdb0>] __ia64_trace_syscall+0xd0/0x110 sp=e000004050d7fe30 bsp=e000004050d79160 [<a000000000010620>] __start_ivt_text+0xffffffff00010620/0x400 sp=e000004050d80000 bsp=e000004050d79160 BUG: soft lockup detected on CPU#0! Call Trace: [<a000000100013b20>] show_stack+0x40/0xa0 sp=e0000040ba4678a0 bsp=e0000040ba461770 [<a000000100013bb0>] dump_stack+0x30/0x60 sp=e0000040ba467a70 bsp=e0000040ba461758 [<a0000001000e5fe0>] softlockup_tick+0x240/0x280 sp=e0000040ba467a70 bsp=e0000040ba461718 [<a000000100092df0>] run_local_timers+0x30/0x60 sp=e0000040ba467a80 bsp=e0000040ba461700 [<a000000100092ea0>] update_process_times+0x80/0x100 sp=e0000040ba467a80 bsp=e0000040ba4616d0 [<a000000100037220>] timer_interrupt+0x180/0x360 sp=e0000040ba467a80 bsp=e0000040ba461690 [<a0000001000e6650>] handle_IRQ_event+0x90/0x120 sp=e0000040ba467a80 bsp=e0000040ba461650 [<a0000001000e6810>] __do_IRQ+0x130/0x420 sp=e0000040ba467a80 bsp=e0000040ba461608 [<a000000100011630>] ia64_handle_irq+0xf0/0x1a0 sp=e0000040ba467a80 bsp=e0000040ba4615d0 [<a00000010000c020>] __ia64_leave_kernel+0x0/0x280 sp=e0000040ba467a80 bsp=e0000040ba4615d0 [<a000000202a88ea0>] gfs2_ail1_start_one+0x60/0x420 [gfs2] sp=e0000040ba467c50 bsp=e0000040ba461560 [<a000000202a89320>] gfs2_ail1_start+0xc0/0x220 [gfs2] sp=e0000040ba467c50 bsp=e0000040ba461520 [<a000000202a8c050>] gfs2_log_reserve+0x1d0/0x380 [gfs2] sp=e0000040ba467c50 bsp=e0000040ba4614c8 [<a000000202ab7140>] gfs2_do_trans_begin+0x280/0x340 [gfs2] sp=e0000040ba467c50 bsp=e0000040ba461478 [<a000000202a95a30>] gfs2_prepare_write+0x430/0x700 [gfs2] sp=e0000040ba467c50 bsp=e0000040ba461400 [<a0000001001002a0>] generic_file_buffered_write+0x3c0/0xda0 sp=e0000040ba467c60 bsp=e0000040ba461330 [<a000000100101370>] __generic_file_aio_write_nolock+0x6f0/0x800 sp=e0000040ba467cf0 bsp=e0000040ba4612b8 [<a000000100101780>] __generic_file_write_nolock+0x100/0x140 sp=e0000040ba467d30 bsp=e0000040ba461280 [<a0000001001019a0>] generic_file_write+0x80/0x1a0 sp=e0000040ba467e10 bsp=e0000040ba461230 [<a000000100161200>] vfs_write+0x200/0x3a0 sp=e0000040ba467e20 bsp=e0000040ba4611d8 [<a000000100161d50>] sys_write+0x70/0xe0 sp=e0000040ba467e20 bsp=e0000040ba461160 [<a00000010000bdb0>] __ia64_trace_syscall+0xd0/0x110 sp=e0000040ba467e30 bsp=e0000040ba461160 [<a000000000010620>] __start_ivt_text+0xffffffff00010620/0x400 sp=e0000040ba468000 bsp=e0000040ba461160 BUG: soft lockup detected on CPU#0! Call Trace: [<a000000100013b20>] show_stack+0x40/0xa0 sp=e0000040932ef990 bsp=e0000040932e9718 [<a000000100013bb0>] dump_stack+0x30/0x60 sp=e0000040932efb60 bsp=e0000040932e9700 [<a0000001000e5fe0>] softlockup_tick+0x240/0x280 sp=e0000040932efb60 bsp=e0000040932e96c0 [<a000000100092df0>] run_local_timers+0x30/0x60 sp=e0000040932efb70 bsp=e0000040932e96a8 [<a000000100092ea0>] update_process_times+0x80/0x100 sp=e0000040932efb70 bsp=e0000040932e9678 [<a000000100037220>] timer_interrupt+0x180/0x360 sp=e0000040932efb70 bsp=e0000040932e9638 [<a0000001000e6650>] handle_IRQ_event+0x90/0x120 sp=e0000040932efb70 bsp=e0000040932e95f0 [<a0000001000e6810>] __do_IRQ+0x130/0x420 sp=e0000040932efb70 bsp=e0000040932e95a8 [<a000000100011630>] ia64_handle_irq+0xf0/0x1a0 sp=e0000040932efb70 bsp=e0000040932e9578 [<a00000010000c020>] __ia64_leave_kernel+0x0/0x280 sp=e0000040932efb70 bsp=e0000040932e9578 [<a000000202a88f60>] gfs2_ail1_start_one+0x120/0x420 [gfs2] sp=e0000040932efd40 bsp=e0000040932e9508 [<a000000202a893f0>] gfs2_ail1_start+0x190/0x220 [gfs2] sp=e0000040932efd40 bsp=e0000040932e94c8 [<a000000202a8c050>] gfs2_log_reserve+0x1d0/0x380 [gfs2] sp=e0000040932efd40 bsp=e0000040932e9470 [<a000000202ab7140>] gfs2_do_trans_begin+0x280/0x340 [gfs2] sp=e0000040932efd40 bsp=e0000040932e9420 [<a000000202a93760>] gfs2_write_cache_jdata+0x2c0/0x920 [gfs2] sp=e0000040932efd40 bsp=e0000040932e9390 [<a000000202a950b0>] gfs2_jdata_writepages+0x50/0x100 [gfs2] sp=e0000040932efdd0 bsp=e0000040932e9358 [<a00000010010d590>] do_writepages+0xb0/0x120 sp=e0000040932efdd0 bsp=e0000040932e9330 [<a0000001000fefd0>] __filemap_fdatawrite_range+0xb0/0xe0 sp=e0000040932efdd0 bsp=e0000040932e92f8 [<a0000001000ff0f0>] filemap_fdatawrite_range+0x30/0x60 sp=e0000040932efe10 bsp=e0000040932e92c8 [<a0000001000ff290>] sync_page_range+0x70/0x140 sp=e0000040932efe10 bsp=e0000040932e9280 [<a000000100101a70>] generic_file_write+0x150/0x1a0 sp=e0000040932efe10 bsp=e0000040932e9230 [<a000000100161200>] vfs_write+0x200/0x3a0 sp=e0000040932efe20 bsp=e0000040932e91d8 [<a000000100161d50>] sys_write+0x70/0xe0 sp=e0000040932efe20 bsp=e0000040932e9160 [<a00000010000bdb0>] __ia64_trace_syscall+0xd0/0x110 sp=e0000040932efe30 bsp=e0000040932e9160 [<a000000000010620>] __start_ivt_text+0xffffffff00010620/0x400 sp=e0000040932f0000 bsp=e0000040932e9160 BUG: soft lockup detected on CPU#0! Call Trace: [<a000000100013b20>] show_stack+0x40/0xa0 sp=e0000001f6f2f8a0 bsp=e0000001f6f29770 [<a000000100013bb0>] dump_stack+0x30/0x60 sp=e0000001f6f2fa70 bsp=e0000001f6f29758 [<a0000001000e5fe0>] softlockup_tick+0x240/0x280 sp=e0000001f6f2fa70 bsp=e0000001f6f29718 [<a000000100092df0>] run_local_timers+0x30/0x60 sp=e0000001f6f2fa80 bsp=e0000001f6f29700 [<a000000100092ea0>] update_process_times+0x80/0x100 sp=e0000001f6f2fa80 bsp=e0000001f6f296d0 [<a000000100037220>] timer_interrupt+0x180/0x360 sp=e0000001f6f2fa80 bsp=e0000001f6f29690 [<a0000001000e6650>] handle_IRQ_event+0x90/0x120 sp=e0000001f6f2fa80 bsp=e0000001f6f29650 [<a0000001000e6810>] __do_IRQ+0x130/0x420 sp=e0000001f6f2fa80 bsp=e0000001f6f29608 [<a000000100011630>] ia64_handle_irq+0xf0/0x1a0 sp=e0000001f6f2fa80 bsp=e0000001f6f295d0 [<a00000010000c020>] __ia64_leave_kernel+0x0/0x280 sp=e0000001f6f2fa80 bsp=e0000001f6f295d0 [<a000000202a88f60>] gfs2_ail1_start_one+0x120/0x420 [gfs2] sp=e0000001f6f2fc50 bsp=e0000001f6f29560 [<a000000202a893f0>] gfs2_ail1_start+0x190/0x220 [gfs2] sp=e0000001f6f2fc50 bsp=e0000001f6f29520 [<a000000202a8c050>] gfs2_log_reserve+0x1d0/0x380 [gfs2] sp=e0000001f6f2fc50 bsp=e0000001f6f294c8 [<a000000202ab7140>] gfs2_do_trans_begin+0x280/0x340 [gfs2] sp=e0000001f6f2fc50 bsp=e0000001f6f29478 [<a000000202a95a30>] gfs2_prepare_write+0x430/0x700 [gfs2] sp=e0000001f6f2fc50 bsp=e0000001f6f29400 [<a0000001001002a0>] generic_file_buffered_write+0x3c0/0xda0 sp=e0000001f6f2fc60 bsp=e0000001f6f29330 [<a000000100101370>] __generic_file_aio_write_nolock+0x6f0/0x800 sp=e0000001f6f2fcf0 bsp=e0000001f6f292b8 [<a000000100101780>] __generic_file_write_nolock+0x100/0x140 sp=e0000001f6f2fd30 bsp=e0000001f6f29280 [<a0000001001019a0>] generic_file_write+0x80/0x1a0 sp=e0000001f6f2fe10 bsp=e0000001f6f29230 [<a000000100161200>] vfs_write+0x200/0x3a0 sp=e0000001f6f2fe20 bsp=e0000001f6f291d8 [<a000000100161d50>] sys_write+0x70/0xe0 sp=e0000001f6f2fe20 bsp=e0000001f6f29160 [<a00000010000bdb0>] __ia64_trace_syscall+0xd0/0x110 sp=e0000001f6f2fe30 bsp=e0000001f6f29160 [<a000000000010620>] __start_ivt_text+0xffffffff00010620/0x400 sp=e0000001f6f30000 bsp=e0000001f6f29160 Created attachment 290856 [details]
simple fix
I haven't quite looked into this just yet, but I'm pretty sure its safe to drop
the sd_log_lock during this loop and re-acquire it when we are rescheduled.
This should stop the softlockups from happening when using jdata.
We already drop the log lock at this point in time. If we need to add a cond_resched() here thats ok, but there is no need for the rest of that patch. In fact I suspect that with spinlock debugging on, it will flag up as a bug in its current form. Hit this on x86 w/ kernel-2.6.18-53.el5 and kmod-gfs2-1.62-1.4 installed. I was running d_io with flocks and tank-04 was running accordion_quick and lock2. BUG: soft lockup detected on CPU#0! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c042007b>] try_to_wake_up+0x145/0x37b [<c0605810>] _spin_lock+0x7/0xf [<f8d4f3a7>] gfs2_ail1_empty+0x13/0x7d [gfs2] [<f8d44539>] gfs2_logd+0x44/0x13f [gfs2] [<f8d444f5>] gfs2_logd+0x0/0x13f [gfs2] [<c0435f65>] kthread+0xc0/0xeb [<c0435ea5>] kthread+0x0/0xeb [<c0405c3b>] kernel_thread_helper+0x7/0x10 ======================= BUG: soft lockup detected on CPU#1! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c0605810>] _spin_lock+0x7/0xf [<f8d4f584>] gfs2_ail1_start_one+0x12b/0x15b [gfs2] [<f8d4f61c>] gfs2_ail1_start+0x68/0xe6 [gfs2] [<f8d504fe>] gfs2_log_reserve+0xc6/0x171 [gfs2] [<f8d5f710>] gfs2_do_trans_begin+0xe3/0x119 [gfs2] [<f8d53727>] gfs2_prepare_write+0x1c2/0x2d2 [gfs2] [<c0455c0a>] generic_file_buffered_write+0x23f/0x5f1 [<c042a1f1>] current_fs_time+0x4a/0x55 [<c0456462>] __generic_file_aio_write_nolock+0x4a6/0x52a [<f8ab510b>] gdlm_ast+0x0/0x2 [lock_dlm] [<c0456616>] __generic_file_write_nolock+0x86/0x9a [<c042da86>] lock_timer_base+0x15/0x2f [<c0436031>] autoremove_wake_function+0x0/0x2d [<c04e5231>] copy_to_user+0x31/0x48 [<c0604a93>] mutex_lock+0xb/0x19 [<c0456661>] generic_file_writev+0x37/0x96 [<c045662a>] generic_file_writev+0x0/0x96 [<c04566c0>] generic_file_write+0x0/0x94 [<c047001f>] do_readv_writev+0x166/0x277 [<f8d4c0dd>] gfs2_holder_uninit+0xb/0x1b [gfs2] [<f8d54f05>] gfs2_llseek+0x6f/0x92 [gfs2] [<c044abd9>] audit_syscall_entry+0x11c/0x14e [<c0470167>] vfs_writev+0x37/0x43 [<c04705ab>] sys_writev+0x3c/0x96 [<c0404eff>] syscall_call+0x7/0xb ======================= BUG: soft lockup detected on CPU#0! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c042007b>] try_to_wake_up+0x145/0x37b [<c0605813>] _spin_lock+0xa/0xf [<f8d4f3a7>] gfs2_ail1_empty+0x13/0x7d [gfs2] [<f8d44539>] gfs2_logd+0x44/0x13f [gfs2] [<f8d444f5>] gfs2_logd+0x0/0x13f [gfs2] [<c0435f65>] kthread+0xc0/0xeb [<c0435ea5>] kthread+0x0/0xeb [<c0405c3b>] kernel_thread_helper+0x7/0x10 ======================= BUG: soft lockup detected on CPU#1! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c0605810>] _spin_lock+0x7/0xf [<f8d4f584>] gfs2_ail1_start_one+0x12b/0x15b [gfs2] [<f8d4f61c>] gfs2_ail1_start+0x68/0xe6 [gfs2] [<f8d504fe>] gfs2_log_reserve+0xc6/0x171 [gfs2] [<f8d5f710>] gfs2_do_trans_begin+0xe3/0x119 [gfs2] [<f8d53727>] gfs2_prepare_write+0x1c2/0x2d2 [gfs2] [<c0455c0a>] generic_file_buffered_write+0x23f/0x5f1 [<c042a1f1>] current_fs_time+0x4a/0x55 [<c0456462>] __generic_file_aio_write_nolock+0x4a6/0x52a [<f8ab510b>] gdlm_ast+0x0/0x2 [lock_dlm] [<c0456616>] __generic_file_write_nolock+0x86/0x9a [<c042da86>] lock_timer_base+0x15/0x2f [<c0436031>] autoremove_wake_function+0x0/0x2d [<c04e5231>] copy_to_user+0x31/0x48 [<c0604a93>] mutex_lock+0xb/0x19 [<c0456661>] generic_file_writev+0x37/0x96 [<c045662a>] generic_file_writev+0x0/0x96 [<c04566c0>] generic_file_write+0x0/0x94 [<c047001f>] do_readv_writev+0x166/0x277 [<f8d4c0dd>] gfs2_holder_uninit+0xb/0x1b [gfs2] [<f8d54f05>] gfs2_llseek+0x6f/0x92 [gfs2] [<c044abd9>] audit_syscall_entry+0x11c/0x14e [<c0470167>] vfs_writev+0x37/0x43 [<c04705ab>] sys_writev+0x3c/0x96 [<c0404eff>] syscall_call+0x7/0xb ======================= BUG: soft lockup detected on CPU#0! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c042007b>] try_to_wake_up+0x145/0x37b [<c0605813>] _spin_lock+0xa/0xf [<f8d4f3a7>] gfs2_ail1_empty+0x13/0x7d [gfs2] [<f8d44539>] gfs2_logd+0x44/0x13f [gfs2] [<f8d444f5>] gfs2_logd+0x0/0x13f [gfs2] [<c0435f65>] kthread+0xc0/0xeb [<c0435ea5>] kthread+0x0/0xeb [<c0405c3b>] kernel_thread_helper+0x7/0x10 ======================= BUG: soft lockup detected on CPU#1! [<c044d21c>] softlockup_tick+0x96/0xa4 [<c042ddb0>] update_process_times+0x39/0x5c [<c04196fb>] smp_apic_timer_interrupt+0x5b/0x6c [<c04059bf>] apic_timer_interrupt+0x1f/0x24 [<c0605813>] _spin_lock+0xa/0xf [<f8d4f584>] gfs2_ail1_start_one+0x12b/0x15b [gfs2] [<f8d4f61c>] gfs2_ail1_start+0x68/0xe6 [gfs2] [<f8d504fe>] gfs2_log_reserve+0xc6/0x171 [gfs2] [<f8d5f710>] gfs2_do_trans_begin+0xe3/0x119 [gfs2] [<f8d53727>] gfs2_prepare_write+0x1c2/0x2d2 [gfs2] [<c0455c0a>] generic_file_buffered_write+0x23f/0x5f1 [<c042a1f1>] current_fs_time+0x4a/0x55 [<c0456462>] __generic_file_aio_write_nolock+0x4a6/0x52a [<f8ab510b>] gdlm_ast+0x0/0x2 [lock_dlm] [<c0456616>] __generic_file_write_nolock+0x86/0x9a [<c042da86>] lock_timer_base+0x15/0x2f [<c0436031>] autoremove_wake_function+0x0/0x2d [<c04e5231>] copy_to_user+0x31/0x48 [<c0604a93>] mutex_lock+0xb/0x19 [<c0456661>] generic_file_writev+0x37/0x96 [<c045662a>] generic_file_writev+0x0/0x96 [<c04566c0>] generic_file_write+0x0/0x94 [<c047001f>] do_readv_writev+0x166/0x277 [<f8d4c0dd>] gfs2_holder_uninit+0xb/0x1b [gfs2] [<f8d54f05>] gfs2_llseek+0x6f/0x92 [gfs2] [<c044abd9>] audit_syscall_entry+0x11c/0x14e [<c0470167>] vfs_writev+0x37/0x43 [<c04705ab>] sys_writev+0x3c/0x96 [<c0404eff>] syscall_call+0x7/0xb [<c04566c0>] generic_file_write+0x0/0x94 [<c047001f>] do_readv_writev+0x166/0x277 [<f8d4c0dd>] gfs2_holder_uninit+0xb/0x1b [gfs2] [<f8d54f05>] gfs2_llseek+0x6f/0x92 [gfs2] [<c044abd9>] audit_syscall_entry+0x11c/0x14e [<c0470167>] vfs_writev+0x37/0x43 [<c04705ab>] sys_writev+0x3c/0x96 [<c0404eff>] syscall_call+0x7/0xb I have been running Dean's g2 tests on IA64 and Nate's tests but haven't hit this lockup yet. I'm moving this to NEEDINFO. If anybody sees this again, please post your test scenario. |