Bug 570323

Summary: gajim.py blocked for more than 120 seconds
Product: [Fedora] Fedora Reporter: Laurent Le Grandois <llg>
Component: gajimAssignee: Debarshi Ray <debarshir>
Status: CLOSED DUPLICATE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: high Docs Contact:
Priority: low    
Version: 12CC: debarshir, fedora, mschmidt
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-03-04 07:55:35 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Laurent Le Grandois 2010-03-03 21:27:44 UTC
Hi, 

 with gajim-0.13.2-1.fc12 (from updates-testing), I got this message :

Mar  3 22:06:53 osc-llg kernel: gajim.py      D 00000000ffffffff     0  4540   2288 0x00000080
Mar  3 22:06:53 osc-llg kernel: ffff88008d02bb08 0000000000000086 ffff88008d02bad0 ffff88008d02bacc
Mar  3 22:06:53 osc-llg kernel: 0000000000000000 0000000000000400 ffff88008d02bfd8 ffff88008d02bfd8
Mar  3 22:06:53 osc-llg kernel: ffff88008fef6158 000000000000f980 0000000000015740 ffff88008fef6158
Mar  3 22:06:53 osc-llg kernel: Call Trace:
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81454c42>] __mutex_lock_common+0x11a/0x181
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81454cc2>] __mutex_lock_slowpath+0x19/0x1b
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81454de2>] mutex_lock+0x36/0x50
Mar  3 22:06:53 osc-llg kernel: [<ffffffff8142be5c>] unix_dgram_recvmsg+0x5b/0x1ef
Mar  3 22:06:53 osc-llg kernel: [<ffffffff813a36b2>] __sock_recvmsg+0x71/0x7e
Mar  3 22:06:53 osc-llg kernel: [<ffffffff813a3c6e>] sock_recvmsg+0xcf/0xe8
Mar  3 22:06:53 osc-llg kernel: [<ffffffff810748a3>] ? autoremove_wake_function+0x0/0x39
Mar  3 22:06:53 osc-llg kernel: [<ffffffff8111469e>] ? __mem_cgroup_try_charge+0x4d/0x1c3
Mar  3 22:06:53 osc-llg kernel: [<ffffffff810dfad0>] ? page_evictable+0x17/0x81
Mar  3 22:06:53 osc-llg kernel: [<ffffffff810de9a5>] ? lru_cache_add_lru+0x27/0x29
Mar  3 22:06:53 osc-llg kernel: [<ffffffff810f960e>] ? page_add_new_anon_rmap+0x3f/0x4e
Mar  3 22:06:53 osc-llg kernel: [<ffffffff813a36f3>] ? sockfd_lookup_light+0x20/0x58
Mar  3 22:06:53 osc-llg kernel: [<ffffffff813a5037>] sys_recvfrom+0xd7/0x141
Mar  3 22:06:53 osc-llg kernel: [<ffffffff8122b14d>] ? __up_read+0x76/0x81
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81077eb6>] ? up_read+0xe/0x10
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81458133>] ? do_page_fault+0x270/0x2a0
Mar  3 22:06:53 osc-llg kernel: [<ffffffff810a8fe5>] ? audit_syscall_entry+0x11e/0x14a
Mar  3 22:06:53 osc-llg kernel: [<ffffffff81011d32>] system_call_fastpath+0x16/0x1b
Mar  3 22:06:53 osc-llg kernel: INFO: task gajim.py:4541 blocked for more than 120 seconds.

and load average of my dual-core is always greater than 4.

With gajim-0.12.5-1.fc12, load average is 0.06


Kernel 2.6.31.12-174.2.22.fc12.x86_64

Thanks

llg

Comment 1 Michal Schmidt 2010-03-04 07:55:35 UTC
It's a kernel bug in which causes both symptoms (tasks stuck in uninterruptible sleep are accounted into the load average and when they stay in this state for too long, they will trigger the warning).

*** This bug has been marked as a duplicate of bug 529202 ***

Comment 2 Michal Schmidt 2010-03-08 19:24:00 UTC
Since this would be perceived as a regression, I made a workaround in gajim-0.13.3-1.fc12.1 (disabled libasyncns) to avoid triggering the kernel bug.

The new package is on the way to updates-testing:
https://admin.fedoraproject.org/updates/gajim-0.13.3-1.fc12.1

Please let me know how it works.

Comment 3 Laurent Le Grandois 2010-03-08 21:57:14 UTC
Hi Michal, 

  I downloaded your version (gajim-0.13.3-1.fc12.1), and it seems to be ok : no more errors in /var/log/messages and load average stays very low.

I will have more feedback tomorrow after a complete working day.

Thanks

Laurent