Bug 472221

Summary: configfs lockdep warning when starting dlm/cman
Product: [Fedora] Fedora Reporter: Jeff Layton <jlayton>
Component: kernelAssignee: Kernel Maintainer List <kernel-maint>
Status: CLOSED UPSTREAM QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: medium Docs Contact:
Priority: medium    
Version: rawhideCC: kernel-maint, quintela, steved, teigland
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2008-11-19 21:22:14 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Jeff Layton 2008-11-19 13:53:42 UTC
I configured my rawhide xen guest as a single-node cluster and got the following messages in the ring buffer. This may actually be a problem in configfs:

Bridge firewalling registered
virbr0: Dropping NETIF_F_UFO since no NETIF_F_HW_CSUM feature.
virbr0: starting userspace STP failed, starting kernel STP
SELinux: initialized (dev proc, type proc), uses genfs_contexts
Not cloning cgroup for unused subsystem ns
virbr0: no IPv6 routers present
SELinux: initialized (dev configfs, type configfs), uses genfs_contexts
DLM (built Nov 17 2008 03:44:49) installed
GFS2 (built Nov 17 2008 03:45:53) installed
Lock_DLM (built Nov 17 2008 03:46:04) installed

=============================================
[ INFO: possible recursive locking detected ]
2.6.27.5-113.fc10.x86_64.debug #1
---------------------------------------------
dlm_controld/2883 is trying to acquire lock:
 (&sb->s_type->i_mutex_key#11/2){--..}, at: [<ffffffffa00d1c4e>] configfs_attach_group+0x4a/0x183 [configfs]

but task is already holding lock:
 (&sb->s_type->i_mutex_key#11/2){--..}, at: [<ffffffffa00d1c4e>] configfs_attach_group+0x4a/0x183 [configfs]

other info that might help us debug this:
2 locks held by dlm_controld/2883:
 #0:  (&sb->s_type->i_mutex_key#10/1){--..}, at: [<ffffffff810d1525>] lookup_create+0x26/0x94
 #1:  (&sb->s_type->i_mutex_key#11/2){--..}, at: [<ffffffffa00d1c4e>] configfs_attach_group+0x4a/0x183 [configfs]

stack backtrace:
Pid: 2883, comm: dlm_controld Not tainted 2.6.27.5-113.fc10.x86_64.debug #1

Call Trace:
 [<ffffffff81064759>] __lock_acquire+0x82c/0xc06
 [<ffffffff81064bc0>] lock_acquire+0x8d/0xba
 [<ffffffffa00d1c4e>] ? configfs_attach_group+0x4a/0x183 [configfs]
 [<ffffffff8134ce3a>] __mutex_lock_common+0xed/0x35d
 [<ffffffffa00d1c4e>] ? configfs_attach_group+0x4a/0x183 [configfs]
 [<ffffffff8114762e>] ? security_d_instantiate+0x1f/0x21
 [<ffffffffa00d1c4e>] ? configfs_attach_group+0x4a/0x183 [configfs]
 [<ffffffff8134d153>] mutex_lock_nested+0x35/0x3a
 [<ffffffffa00d1c4e>] configfs_attach_group+0x4a/0x183 [configfs]
 [<ffffffff8134e5d3>] ? _spin_unlock+0x26/0x2a
 [<ffffffffa00d1cfe>] configfs_attach_group+0xfa/0x183 [configfs]
 [<ffffffffa00d1fbc>] configfs_mkdir+0x235/0x320 [configfs]
 [<ffffffff810d180e>] vfs_mkdir+0x72/0xc6
 [<ffffffff810d41de>] sys_mkdirat+0xa2/0xf5
 [<ffffffff8134e140>] ? trace_hardirqs_on_thunk+0x3a/0x3f
 [<ffffffff81063796>] ? trace_hardirqs_on_caller+0x10f/0x133
 [<ffffffff810d4244>] sys_mkdir+0x13/0x15
 [<ffffffff8101022a>] system_call_fastpath+0x16/0x1b

Comment 1 Dave Jones 2008-11-19 20:37:11 UTC
You're probably best off taking this straight to upstream given we don't touch this code at all in the Fedora kernel.

Comment 2 Jeff Layton 2008-11-19 21:14:57 UTC
Thanks Dave. I'll fire off an email to LKML with the trace...

Comment 3 Jeff Layton 2008-11-19 21:22:14 UTC
Ahh, spoke with Joel Becker on IRC and he said that this is a known bug that he's planning to fix. I'll close this BZ...