Bug 847084
Summary: | nfsidmap failing to show id of some users when using LXDM greeter user list | ||||||
---|---|---|---|---|---|---|---|
Product: | [Fedora] Fedora | Reporter: | Josh Trutwin <jtrutwin> | ||||
Component: | nfs-utils | Assignee: | Steve Dickson <steved> | ||||
Status: | CLOSED DUPLICATE | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||
Severity: | high | Docs Contact: | |||||
Priority: | unspecified | ||||||
Version: | 17 | CC: | bfields, jlayton, luca.giuzzi, paolini, steved | ||||
Target Milestone: | --- | ||||||
Target Release: | --- | ||||||
Hardware: | x86_64 | ||||||
OS: | Linux | ||||||
Whiteboard: | |||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2012-11-08 16:17:25 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Josh Trutwin
2012-08-09 16:14:26 UTC
Created attachment 603302 [details]
sos report
SOS Report for Fedora 17 client.
Forgot to mention that the existing RHEL 6.3 NFS server is showing no idmap issues with our 80+ Fedora 14 clients. It's only the Fedora 17 system that is having problems right now. Ok wow so I just figured out what the culprit is here. It's lxdm. I switched from GDM to LXDM so I could have XFCE be the default window manager (another separate Fedora bug if you ask me) and when LXDM displays the username chooser on boot up it runs a large number of nfsidmap lookups - but apparently not enough. By disabling the userlist in /etc/lxdm/lxdm.conf and rebooting all my nfsidmap lookups are working normally. Still a bug somewhere - but easier to replicate hopefully: 1. setup an nfs4 / nis environment 2. install lxdm 3. set lxdm as displaymanager vi /etc/sysconfig/desktop or other means 4. reboot When lxdm displays the username chooser for users who have previously logged into system, check for bad nfsidmaps in home directories, etc. Change in /etc/lxdm/lxdm.conf: [userlist] ## if disable the user list control at greeter disable=1 Reboot nfsidmap lookups should behave normally. Thanks, Josh Huh. I wonder what lxdm does that's triggering the problem. Would it be possible to get a network trace showing the nfs traffic while this is going on? So, at step 4, start a tcpdump: tcpdump -s0 -wtmp.pcap 'host myclient && host myserver' Then kill tcpdump after the problem reproduces. You can attach the resulting tmp.pcap and/or look at it yourself in wireshark. What we're looking for is replies to GETATTR calls which request the OWNER or OWNER_GROUP attributes. They should all look like name@domain. If they all look right, then that confirms the problem is with the client-side idmapping. You should also be able to add the "-v" (or "-vvvv" if necessary") option to the nfsidmap commandline in /etc/request-key.conf to get some more debugging. Sorry for delayed reply - been working on this image for a while and haven't been able to get to your request. I have a question though - the problem happens during boot up when the initial LXDM screen is shown - I assume I'd need to put this tcpdump command into an RC script somewhere? Alas, I had a user report that he was seeing these funky uid/gid's on the system on some files despite the fix I put in so there may be more going on here... Thanks, Josh (In reply to comment #5) > Sorry for delayed reply - been working on this image for a while and haven't > been able to get to your request. > > I have a question though - the problem happens during boot up when the > initial LXDM screen is shown - I assume I'd need to put this tcpdump command > into an RC script somewhere? Hm, could be, I'm not sure where exactly to suggest. Note that will produce huge amounts of data (it writes all network traffic to tmp.pcap), so it's not something you want running all the time. Best would be if you can start tcpdump and then LXDM by hand, and then stop tcpdump as soon as you've seen the problem reproduced. > Alas, I had a user report that he was seeing these funky uid/gid's on the > system on some files despite the fix I put in so there may be more going on > here... > > Thanks, > > Josh We have a similar problem with a fedora 17 client (nfs-utils-1.2.6-3) and a fedora 16 nfs server. It seems that the problem appears for a few user when a large number of nfs id lookups is requested in a short time, e.g. when on the client I # ls -l /home/misc/dmfmail/ This path contains a nfs mounted folder with the inbox of all users (approx 200). For a few users an uid of 4294967294 is shown. Apparently for *exactly* those users there isn't an entry in /proc/keys, whereas for all the others I see an entry in /proc/keys I hope this helps. I also tried the nfs-utils-1.2.6-5 patch in the updates-testing repository which did not solve the problem. There are two facts that seem strange to me: 1. A typical entry of /proc/keys in a Fedora 16 looks like: 1e43f4d4 I--Q--- 1 8m 3f010000 0 0 id_legacy uid:user.unicatt.it: 4 with an expiry that is less than 10m (consistent with the default 600 seconds expiration time) whereas on a Fedora 17 I see: 3fad2d7c I--Q--- 1 perm 3f010000 0 0 id_resolv uid:user.unicatt.it: 4 which seems to be *permanent* 2. cat /proc/key-users gives $ cat /proc/key-users 0: 204 203/203 199/200 6571/20000 [...] which seems to imply that there is a maximum of 200 keys that can be allocates. It might be possible to try and increase the number of available keys: echo 10000 > /proc/sys/kernel/keys/maxkeys echo 10000 > /proc/sys/kernel/keys/root_maxkeys Does this solve anything? This solves my problem! Indeed we had just about 200 users which clearly caused the creation of a number of keys larger than the allowed maximum. However I think that this should only be considered a workaround, and that this kind of keys should *not* contribute to the quota. Moreover they perhaps should not be "permanent", since it is possible that the mapping will change on the nfs server when users are removed/created. Why *not a bug*? It seems to me that there *is* something wrong, if not with the software, then with some default setting... How is a system administrator supposed to realize that this strange behaviour of nfs is due to some too small kernel quota somewhere? I agree with Maurizio... At a minimum something should be logged about the key limit being exceeded. (In reply to comment #12) > Why *not a bug*? It seems to me that there *is* something wrong, if not with > the software, then with some default setting... How is a system administrator > supposed to realize that this strange behaviour of nfs is due to some too > small kernel quota somewhere? I closed it as not a bug in the actual id mapping code. I agree the maxkeys/ root_maxkeys are set too small and will deal with it in bz876705 *** This bug has been marked as a duplicate of bug 876705 *** |