Bug 37572

Summary: zombie lockd processes after upgrade to nfs-utils-0.3.1-0.6.x.1
Product: [Retired] Red Hat Linux Reporter: Pawel Salek <pawsa>
Component: nfs-utilsAssignee: Pete Zaitcev <zaitcev>
Status: CLOSED CURRENTRELEASE QA Contact: David Lawrence <dkl>
Severity: low Docs Contact:
Priority: medium    
Version: 6.2CC: ajb
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2003-06-04 06:31:24 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Pawel Salek 2001-04-25 08:43:34 UTC
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux 2.2.19-6.2.1 i686; en-US; Galeon)
Gecko/20010216


After upgrade to nfs-utils-0.3.1-0.6.x.1 on our network (Linux RH 6.2/7.0
and Thru64), zombie lockd processes started to appear on the Linux NFS server.

Reproducible: Sometimes
Steps to Reproduce:
1. use nfs-ultils-0.3.1-0.6.x.1
2. wait 1-2 days. Reboot some clients in meantime.

	

Actual Results:  The ps afx output shows:

root     26178  0.0  0.0     0    0 ?        SW   Apr24   0:08 [nfsd]
root     26186  0.0  0.0     0    0 ?        SW   Apr24   0:00  \_ [lockd]
root     26187  0.0  0.0     0    0 ?        SW   Apr24   0:00      \_ [rpciod]
root     28739  0.0  0.0     0    0 ?        Z    Apr24   0:00      \_
[lockd <defunct>]


Expected Results:
root      1112  0.0  0.0     0    0 pts/0    SW   10:41   0:00 [nfsd]
root      1120  0.0  0.0     0    0 pts/0    SW   10:41   0:00  \_ [lockd]
root      1121  0.0  0.0     0    0 pts/0    SW   10:41   0:00      \_ [rpciod]


This is not a big deal but having too many zombies hanging around can be
slightly scary and I don't feel like upgrading to 2.4.2 on our main server yet.

Comment 1 Pawel Salek 2001-04-27 07:36:59 UTC
For the record: I rebooted some Linux clients and haven't got any new zombie
processes since the report date (two days ago). Simple disk remounting on Thru64
boxes does not create zombies neither. It is strange in a way, because before I
reported the problem, I had to restart nfs services twice to get rid of zombies.
I will keep you informed.

Comment 2 Pawel Salek 2001-04-30 07:39:02 UTC
I should add that my attempts to reproduce this behavior failed.

Comment 3 Bob Matthews 2001-04-30 14:12:42 UTC
Okay, good to know that the zombie lockd's are not reappearing.

If you see this behaviour in the future, please reopen this bug report.

Comment 4 Pawel Salek 2001-05-17 12:03:45 UTC
I have actually observed it again:

root      1112  0.0  0.0     0     SW   Apr25   4:01 [nfsd]
root      1120  0.0  0.0     0     SW   Apr25   0:01  \_ [lockd]
root      1121  0.0  0.0     0     SW   Apr25   0:00      \_ [rpciod]
root     21020  0.0  0.0     0     Z    May08   0:00      \_ [lockd <defunct>]
root      1113  0.0  0.0     0     SW   Apr25   4:03 [nfsd]

I left it hanging around, so if you needed more information (or correlate it
with some info from log files, for instance), let me know.


Comment 5 Pawel Salek 2001-05-21 08:07:11 UTC
FYI: yet another zombie process appeared couple of days ago:
root      1112  0.0  0.0     0    SW   Apr25   4:21 [nfsd]
root      1120  0.0  0.0     0    SW   Apr25   0:01  \_ [lockd]
root      1121  0.0  0.0     0    SW   Apr25   0:00      \_ [rpciod]
root     21020  0.0  0.0     0    Z    May08   0:00      \_ [lockd <defunct>]
root      6089  0.0  0.0     0    Z    May18   0:00      \_ [lockd <defunct>]
Is it lockd that does not wait for its children? I guess I could look at the
source if I knew where to start from.

Comment 6 Pawel Salek 2001-07-13 15:57:17 UTC
I have noticed a correlation: the zombie process is created when one of the
client boxes is rebooted. When it tries to mount the NFS again (or connect to
remote lockd; I have not sufficient time resolution), zombie proces is created.

Comment 7 Andreas J. Bathe 2001-08-22 21:16:36 UTC
the same problem at our main fileserver:
  557 ?        SW    11:54 [nfsd]
  565 ?        SW     0:00  \_ [lockd]
  566 ?        SW     0:00      \_ [rpciod]
12799 ?        Z      0:00      \_ [lockd <defunct>]
 9876 ?        Z      0:00      \_ [lockd <defunct>]
before the update the zombies didn't appear; will it be fixed someday?

Comment 8 Bob Matthews 2001-08-22 21:33:53 UTC
> will it be fixed someday?

There is an experimental patch for 2.4.7 kernels at
http://www.fys.uio.no/~trondmy/src/2.4.7/linux-2.4.7-reclaim.dif
but Linus has not yet accepted the patch into the mainstream.

If/when Linus accepts the patch, a backport to the 2.2 kernel series is
possible, in which case we would issue it as a kernel errata.  Unfortunately, I
can't offer a firm timeframe.

Comment 9 Andreas J. Bathe 2001-09-24 22:33:39 UTC
I think the patch made it into 2.4.10 finally...

Comment 10 Pete Zaitcev 2003-06-04 03:52:14 UTC
Does it work ok on RHL9?


Comment 11 Pawel Salek 2003-06-04 06:00:58 UTC
I cannot tell about RH9, but RH7.3+ current updates is OK (i.e problem
resolved). Thanks! (should the resolution be "current release", or "errata"?)

Comment 12 Pete Zaitcev 2003-06-04 06:31:24 UTC
"Currentrelease" sounds good. Closing.