Bug 1581502
| Summary: | PID file never created or disappears | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Product: | Red Hat Enterprise Linux 7 | Reporter: | Ian Kent <ikent> | ||||||
| Component: | autofs | Assignee: | Ian Kent <ikent> | ||||||
| Status: | CLOSED ERRATA | QA Contact: | Kun Wang <kunwan> | ||||||
| Severity: | medium | Docs Contact: | |||||||
| Priority: | unspecified | ||||||||
| Version: | 7.5 | CC: | extras-qa, ikent, trailtotale, wattersm, xifeng, xzhou, zbyszek | ||||||
| Target Milestone: | rc | ||||||||
| Target Release: | --- | ||||||||
| Hardware: | All | ||||||||
| OS: | Linux | ||||||||
| Whiteboard: | |||||||||
| Fixed In Version: | autofs-5.0.7-92.el7 | Doc Type: | If docs needed, set a value | ||||||
| Doc Text: | Story Points: | --- | |||||||
| Clone Of: | 1552216 | Environment: | |||||||
| Last Closed: | 2018-10-30 11:41:29 UTC | Type: | Bug | ||||||
| Regression: | --- | Mount Type: | --- | ||||||
| Documentation: | --- | CRM: | |||||||
| Verified Versions: | Category: | --- | |||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||
| Embargoed: | |||||||||
| Bug Depends On: | 1552216 | ||||||||
| Bug Blocks: | |||||||||
| Attachments: |
|
||||||||
|
Description
Ian Kent
2018-05-22 23:59:09 UTC
As with the Fedora hosts, some CentOS hosts sometimes work and other times don't. I had one particular host that was very stubborn, so I started investigating more deeply there. It feels very much like the exact same conditions (automount running, but PID file isn't present) so I took a wild stab and added 'rpc-statd.service rpcbind.service' to the After= of autofs.service on that host and upon rebooting, it worked the first try. Not rock solid proof of a resolution, but given the similarities I've already seen, I suspect the same fix applies here equally. I can continue trying the same trick if others get stubborn. (In reply to John Florian from comment #2) > As with the Fedora hosts, some CentOS hosts sometimes work and other times > don't. I had one particular host that was very stubborn, so I started > investigating more deeply there. It feels very much like the exact same > conditions (automount running, but PID file isn't present) so I took a wild > stab and added 'rpc-statd.service rpcbind.service' to the After= of > autofs.service on that host and upon rebooting, it worked the first try. > Not rock solid proof of a resolution, but given the similarities I've > already seen, I suspect the same fix applies here equally. I can continue > trying the same trick if others get stubborn. Yes, that's what I thought from our investigation. I did the systemd change as well because that was recommended as what should be done from a systemd POV. The unit change should be a straight forward workaround while waiting for an update. I'm pretty sure the restart gets triggered because the NFS client itself starts a service (using systemctl) on first mount if it isn't already running. The unit change just ensures the service is already running. That doesn't help with the race that was described which the systemd change is meant to handle, the more general solution. Ian Created attachment 1441025 [details]
Path - add units After line to include statd service
Created attachment 1441026 [details]
Patch - use systemd sd_notify() at startup
I had some difficulty back porting this to autofs-5.0.7. Could you check the build resolves the problem please. The build can be found at: http://people.redhat.com/~ikent/autofs-5.0.7-92.el7/ Ian, sorry for the slow response. I needed to wait for the dust to settle around my upgrades to Fedora 28. I installed your autofs-5.0.7-92.el7.x86_64 onto 11 different CentOS hosts, 9 of which were VMs while the remaining 2 were physical. I rebooted all 11 and not one showed the misbehavior and automounting is working on all w/o any manual intervention. I'd say it looks good! (In reply to John Florian from comment #7) > Ian, sorry for the slow response. I needed to wait for the dust to settle > around my upgrades to Fedora 28. > > I installed your autofs-5.0.7-92.el7.x86_64 onto 11 different CentOS hosts, > 9 of which were VMs while the remaining 2 were physical. I rebooted all 11 > and not one showed the misbehavior and automounting is working on all w/o > any manual intervention. I'd say it looks good! Thanks for that. Ian Since this patch changed uses '--foreground --dont-check-daemon' instead of original '--pid-file /run/autofs.pid' as the option. autofs.pid witch in a condition might trigger this bug no longer exists. And the original issue is hard to reproduce here, so I decided to check the new option and the appearance of '/run/autofs.pid' in our test case. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:3283 |