Bug 38392
Summary: | kernel: nfs: task 264936 can't get a request slot - after upgrading to 2.2.19 / nfs-utils-0.3.1-0.6.x.1 | ||
---|---|---|---|
Product: | [Retired] Red Hat Linux | Reporter: | Bruce Garlock <bruce> |
Component: | nfs-utils | Assignee: | Pete Zaitcev <zaitcev> |
Status: | CLOSED WORKSFORME | QA Contact: | David Lawrence <dkl> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 6.2 | CC: | bruce, magnus.moren, oabundes |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | i386 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2002-11-12 20:47:57 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Bruce Garlock
2001-04-30 13:40:39 UTC
FYI, the "can't get a request slot" error is usually the result of a network congestion/misconfiguration or a badly behaving network driver. What make/model of network card do you use? I'd suggest trying a different one to see if the problem goes away. The network card is built into the server (IBM Netfinity-5000). It uses the pcnet32 module. Since the only thing I did was upgrade the kernel, and any other dependencies, I would suspect the pcnet32 module, since my network has not changed in months, and was working fine before the kernel upgrade. I guess I could test this next weekend by booting back into 2.2.16, and see if it clears up this issue. Any other ideas? Another thing to consider... Are you mounting NFS v2 or v3? Whatever you're currently using, try the other one, provided the SCO server supports v3. And, for the record, since the linux machine is acting as the 'client' in this case, nfs-utils is most likely not involved (since it is primarily the 'server' nfs component), though it does handle client-side file locking... which you could try turning off to see if it helps: /etc/rc.d/init.d/nfslock stop I turned off nfslock, and still got the errors during my overnight backups. I'm waiting to hear a response I posted to comp.unix.sco.misc about which version of NFS server is running on SCO Openserver 5.0.4. Maybe someone from that newsgroup will be able to help with this problem. Another question: should I change any mount options? Right now they all look like this: Server:/usr/covalent/users/miked /opt/work/users/miked/covalent nfs rsize=8192,wsize=8192,hard,intr 0 0 Maybe the linux client does not like this anymore? Just FYI: as of 2.2.19, the kernel starts lockd automatically as needed, so this is probably not related to your problem. For the time being, I am excluding all NFS mounts from my backup. 'sar' on the SCO box reports heavy system usage during the time of the backup (much more than normal usage during the same time before the 2.2.19 upgrade), and the last thing I need is a page at 0300. I'll write up a quick script that tar/bzip's the directories I need, and ftp it over to the linux box. Hopefully I'll get a chance to boot back in to 2.2.16 this weekend and see if the problem persists, but I really think this is related to the 2.2.19 upgrade. I've had similar problems with Red Hat 7.1 running the 2.4.2-2 kernel. When I mount a remote system from a Solaris 2.8 box onto my Linux laptop, I'm able to create files and I'm to read them, but if I try to copy them from the remote fs to the local fs the process freezes and never completes; as a matter of fact then the mounted fs has to be unmount and remouted to get access again. I hope they fix this problems soon, it'd be nice if they did more testing before releasing a new version of the OS. I'm getting the same problem with a RH 7.3, kernel 2.4.18-3smp, NFS client of a Solaris 8 2/02 server: ..... Nov 11 09:35:14 hcidaldelws70 kernel: nfs: task 43977 can't get a request slot ..... Possible culprits I've come up with from researching the problem: 1. Network congestion.- However, our network load has not changed. The RH machine is just another NFS client in our landscape doing the same thing as all the other ones (all Solaris though). 2. Busy servers. We have two and they're not. Once clients automount data directories, NFS traffic is pretty low. 3. Bad NIC. It's a possibility but this is a brand new machine. I've already eliminated the switch and cabling to the wiring closet by connecting it to the same port where a Solaris 8 client was happily working. Someone suggested to get the latest kernel available via up2date. Does anyone know if the latest kernel has NFS fixes? The bad news is that the way errors are propagated by sunrpc stack, "cannot get request slot" error may be caused by a thousand of different reasons. The 2.4.18-17 can fix the problem, or perhaps not. I think it is worth to download and try, it's a better NFS than it was. Also, victims of early Intel e100's get helped by newer driver (it has nothing to do with NFS by itself, but NFS is more sensitive than, say, FTP). Oh, and BTW, I just noticed we are raping a 6.2 bug report. Please file new bugs for kernel 2.4 based clients, it's a little different code base. Yes, I realized this bug referred to RH 6.2 but I decided to comment because I saw no resolution posted. My apologies. I'll file a report for 7.3. Thank you. BTW, I will load the latest kernel and test to see if the problem goes away. Oscar says 2.4.18-17 works for him. I am closing this one, because we only do security fixes for 6.2 by now. Don't hesitate to file new bugs against contemporary releases. |