Bug 345301
Summary: | NFS client, NLM, java 1.4.2_08, readlock hang on futex | ||
---|---|---|---|
Product: | Red Hat Enterprise Linux 4 | Reporter: | Guy Streeter <streeter> |
Component: | kernel | Assignee: | Michal Schmidt <mschmidt> |
Status: | CLOSED WONTFIX | QA Contact: | Martin Jenner <mjenner> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 4.5 | CC: | andy.fletcher, ivecera, nobody, peterm, tao |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i686 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2008-03-13 20:36:04 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 166292 | ||
Bug Blocks: | 422551, 430698 |
Description
Guy Streeter
2007-10-22 14:46:21 UTC
Several different issues are mixed together in this BZ and related ITs. The original report was about file locking over NFS (using NLM). In some cases it takes tens of seconds to obtain the lock. It could be caused by a race condition where NLM_GRANTED_MSG from the server arrives before the client prepares for it. That would be bug 432855. A patched kernel for testing is available from Jeff Layton's page: http://people.redhat.com/jlayton/ Could someone with the long NFS locking delays test it? Then we have some futex-related reports here, e.g. the hangs with LDM. This could be explained by bug 217067 which was closed in November. Could an LDM user retest the futex problem with a current RHEL4 kernel? I've been working on the NFS locking bug before bug 432855 came to my attention today. During that, I wrote a testcase for NLM which shows another bug. When a client waiting for NLM_GRANTED_MSG is interrupted by a signal, the file stays locked forever. The bug is present in RHEL5 and current upstream. I suspect this code in do_setlk() is harmful: if (status == -EINTR || status == -ERESTARTSYS) { do_vfs_lock(filp, fl); } The patch for bug 432855 is now present in current RHEL4 development builds: http://people.redhat.com/vgoyal/rhel4/ It should fix the 30 seconds NFS locking delays. Please test. The futex hang is something else completely and should have been in a separate BZ. Possibly fixed already in kernel-2.6.9-67.EL. I'm discussing the problem of files remaining locked forever on NFS on upstream mailing list. This will be another BZ. Devel NAK: this bug is either a duplicate of 432855 or 217067 (then it doesn't really matter) or it's not (then I won't be able to provide a fix in 4.7). Development Management has reviewed and declined this request. You may appeal this decision by reopening this request. |