User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X 10_4_11; en) AppleWebKit/531.9 (KHTML, like Gecko) Version/4.0.3 Safari/531.9
I upgraded my machine from Fedora 10 to 11 (clean install), put the files back in place, then one day, I noticed that my LDAP server had stopped responding to clients' queries. After looking into it, I found that it had run out of available file descriptors. The reason for this is that it seems to keep opening this library, /lib64/libnspr4.so, and then not close it.
I'm running this on a virtual machine with this kernel version:
Steps to Reproduce:
1. Start slapd (normal options: "-u ldap -h ldap:///")
slapd will eventually run out of file descriptors, which then causes it to stop accepting connections from clients (presumably the accept(2) system call fails). I restart the process (service ldap restart), and its reset and fine for a little while again.
It shouldn't stop responding to clients.
None really. I have a pretty standard configuration file. I'll include it as an attachment for reference as soon as I figure out how (this is the first bug report I've filed).
Created attachment 361192 [details]
Configuration file for slapd, in case its relevant.
I've only removed access controls, default user information, and comments. Everything else should be intact.
I tried to reproduce the issue with no success. Could you please provide me straces of slapd threads using following command?
strace -Fffxvto slapd.strace slapd -u ldap -h ldap:///
Just keep the strace running for enough time to see where and how are those files open (you can watch lsof -u ldap and after several new file descriptors to libnspr4.so are open you can end the tracing). Hopefully those straces will give us an idea where those descriptors are open.
I've attached strace to an already-running slapd process with several of these files already open. If you'd like me to start a fresh process for the strace, just say so, and I'll do that instead.
Attaching strace to running process is fine, the important thing is the presence of at least one call opening the library in the strace. If it is present in your current strace, you can attach the result to this bug.
Created attachment 364972 [details]
strace of one of the threads
This is the strace of one of the threads for the process. Note that there were 4 threads, but only two of them opened the libnspr4.so library. I'll be attaching the other strace momentarily.
Created attachment 364973 [details]
The other thread's strace
Both this and the strace attachment immediately before it were attached for about 1.5-2 days. During that time the number of open file descriptors for that library went up by about 75. If you need any other help from me or would like me to look into anything else, just let me know. I looked at the strace's but don't have too much experience looking at them.
I'd also like to apologize for how long it took to get those to you... I ran into some problems, but I think I've gotten everything now.
Thank you for valuable info. I'm working on it, but it may take some time, because I have some other work as well. I'll keep you posted.
Ok, thanks for your help. If you need or would like me to help in any way, just let me know.
I need another set of straces. They should be the same as before, but this time could you please attach relevant part of syslog as well? Maybe not syslog (I'm not sure which log file it is), but I will need log which contains records like this:
<167>Oct 13 02:12:14 slapd .....
<167>Oct 13 02:12:15 slapd .....
Hopefully this will provide me some more valuable information. I might need debug output as well, but for now this should be enough, I will contact you in case more data is needed.
Created attachment 378615 [details]
slapd traces and messages
This is the output files for the strace. I used the same flags you gave me a while back. I've included all the files, and I ran a "tail -F /var/log/messages" when I started the strace and stopped it once I stopped the strace. I took a look though and there aren't any messages from slapd. We may need to turn debugging on, as you suggested....
Yes. I figured those log messages in strace are sent to systog daemon, but perhaps not ...
Try tu run the debug mode, I think -d 1 should do the trick. I need something which sould give me a link between the strace and function entry/exit points, so I can see where in the program the library is created. Also, I found -s option which could be fiven to strace, so longer string could be printed. It might be even more useful than debug mode. Check it out.
I believe this is a bug in crypt() which I've just reported as bug #552917.
I'll keep an eye on that one to see what progress is made. Thanks for your help!
This message is a reminder that Fedora 11 is nearing its end of life.
Approximately 30 (thirty) days from now Fedora will stop maintaining
and issuing updates for Fedora 11. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as WONTFIX if it remains open with a Fedora
'version' of '11'.
Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version prior to Fedora 11's end of life.
Bug Reporter: Thank you for reporting this issue and we are sorry that
we may not be able to fix it before Fedora 11 is end of life. If you
would still like to see this bug fixed and are able to reproduce it
against a later version of Fedora please change the 'version' of this
bug to the applicable version. If you are unable to change the version,
please add a comment here and someone will do it for you.
Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.
The process we are following is described here:
Can you confirm this bug on F12 or F13? If yes, please change the Fedora version. Otherwise I'm going to close it.
I can confirm this to be solved in F12
Perfect, thanks. I'm closing this bug then.