| Summary: | apparent deadlock/hang after nfs4_reclaim_open_state messages | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 5 | Reporter: | Kevan Carstensen <kacarstensen> |
| Component: | kernel | Assignee: | nfs-maint |
| Status: | CLOSED NOTABUG | QA Contact: | Red Hat Kernel QE team <kernel-qe> |
| Severity: | urgent | Docs Contact: | |
| Priority: | medium | ||
| Version: | 5.6 | CC: | bfields, dhowells, harshula, jlayton, rdassen, rwheeler, sprabhu, steved |
| Target Milestone: | rc | ||
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2012-01-18 19:50:37 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
|
Description
Kevan Carstensen
2011-05-05 22:13:51 UTC
Those errors are ESTALE and NFS4ERR_BAD_SEQID, for what it's worth.
Unlike RHEL5, upstream and RHEL6 handle ESTALE in nfs4_reclaim_open_state; In wonder whether we need something like this?:
diff --git a/fs/nfs/nfs4state.c b/fs/nfs/nfs4state.c
index f664bb7..6fbfec7 100644
--- a/fs/nfs/nfs4state.c
+++ b/fs/nfs/nfs4state.c
@@ -733,6 +733,7 @@ static int nfs4_reclaim_open_state(struct nfs4_state_recovery
case -ENOENT:
case -NFS4ERR_RECLAIM_BAD:
case -NFS4ERR_RECLAIM_CONFLICT:
+ case -ESTALE:
/*
* Open state on this file cannot be recovered
* All we can do is revert to using the zero stat
(Whoops, jlayton points out all that does is skip a printk!) If it was easy it might help to try kernels between -194 and -238 to see if we can pin down when the regression happened. Also, when you say "The kernel logs something *like* the following".... Is that a literal excerpt, is there anything interesting before it, and how does it continue? (In particular, do those final "unhandled error -10026" warnings continue until logging fails completely, or are there only a few more?) Finally, if you could contact support they could help with the debugging. That's a literal excerpt. There's nothing interesting before it, and the last statement in the excerpt is the last log message before the logging fails completely. Sorry for the ambiguity on that. We've opened a support ticket; thanks for the suggestion. |