Bug 1309838
Summary: | GFS2 lockdep deadlock warning (sd_log_flush_lock vs gl->gl_work.work) | ||
---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Andrew Price <anprice> |
Component: | kernel | Assignee: | gfs2-maint |
Status: | CLOSED WORKSFORME | QA Contact: | Fedora Extras Quality Assurance <extras-qa> |
Severity: | unspecified | Docs Contact: | |
Priority: | unspecified | ||
Version: | rawhide | CC: | gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda, mchehab, rpeterso |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-10-16 16:21:46 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Andrew Price
2016-02-18 19:46:01 UTC
Hi Andy, I noticed in your call trace above that the shrinker is calling evict. Excerpt: [ 373.840558] [<ffffffff812b0308>] evict+0xb8/0x180 [ 373.841356] [<ffffffff812b0414>] dispose_list+0x44/0x70 [ 373.842216] [<ffffffff812b164a>] prune_icache_sb+0x5a/0x80 [ 373.843125] [<ffffffff812937be>] super_cache_scan+0x14e/0x1a0 [ 373.844051] [<ffffffff81205bf6>] shrink_slab.part.42+0x216/0x540 [ 373.845004] [<ffffffff8120b255>] shrink_zone+0x2f5/0x300 The following comment and associated patch might be relevant to this problem: https://bugzilla.redhat.com/show_bug.cgi?id=1255872#c30 Perhaps you can try that patch and see if the problem still recreates? I managed to reproduce the bug with a fresh 4.5-rc7 kernel, first time (took approx. 100,000 files) so I tried again with the patch from https://bugzilla.redhat.com/show_bug.cgi?id=1255872#c30 and I haven't seen the lockdep splat yet. However, at around file 300,000 I saw a couple of hung task warnings for gfs2_quotad: [ 2340.901808] INFO: task gfs2_quotad:742 blocked for more than 90 seconds. [ 2340.902919] Tainted: G W 4.5.0-rc7-00230-g20698c9-dirty #56 [ 2340.903860] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2340.905698] gfs2_quotad D ffff880027093b30 0 742 2 0x00000000 [ 2340.906640] ffff880027093b30 ffff88003ff7b6b0 ffff88003db00000 ffff880039610000 [ 2340.907628] ffff880027094000 ffff88003ff7b6b0 ffff880027093bc8 ffffffff81808b00 [ 2340.908595] ffff880027093bb0 ffff880027093b48 ffffffff8180819c 0000000000000002 [ 2340.909558] Call Trace: [ 2340.909897] [<ffffffff81808b00>] ? out_of_line_wait_on_atomic_t+0xf0/0xf0 [ 2340.910728] [<ffffffff8180819c>] schedule+0x3c/0x90 [ 2340.911362] [<ffffffff81808b11>] bit_wait+0x11/0x60 [ 2340.912005] [<ffffffff8180870d>] __wait_on_bit+0x5d/0x90 [ 2340.912666] [<ffffffff810b4e7a>] ? finish_task_switch+0x6a/0x210 [ 2340.913407] [<ffffffff81808b00>] ? out_of_line_wait_on_atomic_t+0xf0/0xf0 [ 2340.914276] [<ffffffff81808872>] out_of_line_wait_on_bit+0x82/0xb0 [ 2340.916050] [<ffffffff810d4eb0>] ? autoremove_wake_function+0x40/0x40 [ 2340.917510] [<ffffffff8140b9f5>] gfs2_glock_dq_wait+0x65/0x70 [ 2340.918251] [<ffffffff81426c61>] gfs2_evict_inode+0x111/0x470 [ 2340.919936] [<ffffffff8180dc67>] ? _raw_spin_unlock+0x27/0x40 [ 2340.921679] [<ffffffff81252368>] evict+0xb8/0x180 [ 2340.923073] [<ffffffff8125246b>] dispose_list+0x3b/0x70 [ 2340.924652] [<ffffffff8125281a>] prune_icache_sb+0x5a/0x80 [ 2340.926292] [<ffffffff81236caf>] super_cache_scan+0x14f/0x1a0 [ 2340.928031] [<ffffffff81420283>] gfs2_quotad+0x113/0x420 [ 2340.929662] [<ffffffff810d4e70>] ? wake_atomic_t_function+0x70/0x70 [ 2340.931265] [<ffffffff81420170>] ? gfs2_wake_up_statfs+0x40/0x40 [ 2340.932724] [<ffffffff810abe1e>] kthread+0xfe/0x120 [ 2340.933691] [<ffffffff810abd20>] ? __kthread_parkme+0x90/0x90 [ 2340.934443] [<ffffffff8180e89f>] ret_from_fork+0x3f/0x70 [ 2340.935127] [<ffffffff810abd20>] ? __kthread_parkme+0x90/0x90 [ 2340.935828] 1 lock held by gfs2_quotad/742: [ 2340.936340] #0: (&type->s_umount_key#36){.+.+..}, at: [<ffffffff81236b2b>] trylock_super+0x1b/0x50 3 years later and this bug isn't reproducing any more so I'm closing this one. |