| Summary: | Default NFS settings kernel cause high load and low throughput on random write loads | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Product: | [Fedora] Fedora | Reporter: | D.S. Ljungmark <spider> | ||||||||||||||
| Component: | kernel | Assignee: | nfs-maint | ||||||||||||||
| Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | Fedora Extras Quality Assurance <extras-qa> | ||||||||||||||
| Severity: | medium | Docs Contact: | |||||||||||||||
| Priority: | unspecified | ||||||||||||||||
| Version: | 23 | CC: | bfields, gansalmon, itamar, jonathan, kernel-maint, madhu.chinakonda, mchehab, spider | ||||||||||||||
| Target Milestone: | --- | Flags: | spider:
needinfo-
spider: needinfo- |
||||||||||||||
| Target Release: | --- | ||||||||||||||||
| Hardware: | x86_64 | ||||||||||||||||
| OS: | Linux | ||||||||||||||||
| Whiteboard: | |||||||||||||||||
| Fixed In Version: | Doc Type: | Bug Fix | |||||||||||||||
| Doc Text: | Story Points: | --- | |||||||||||||||
| Clone Of: | Environment: | ||||||||||||||||
| Last Closed: | 2016-10-26 16:51:15 UTC | Type: | Bug | ||||||||||||||
| Regression: | --- | Mount Type: | --- | ||||||||||||||
| Documentation: | --- | CRM: | |||||||||||||||
| Verified Versions: | Category: | --- | |||||||||||||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||||||||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||||||||||||
| Attachments: |
|
||||||||||||||||
|
Description
D.S. Ljungmark
2016-04-21 08:10:11 UTC
Perhaps the NFS maintainers will have more insight, but I suspect the answer will be "it's tunable from userspace so tune it to your workload because no default will ever be 100% correct for everyone." Josh Boyer said: > I suspect the answer will be "it's tunable from userspace so tune it to your > workload because no default will ever be 100% correct for everyone." I wonder why rsize/wsize makes a difference at all in this case, since the client's likely not able to make >4k writes anyway. I'm more inclined to accept this as a bug. Given that it's easily reproduceable on a variety of versions, I bet it's been noticed before, but I don't recall any discussion off the top of my head. > "Reproduced on CentOS 7 host, Fedora 22 host (4.4.5 kernel) > Fedora 23 clients, Fedora 22 clients, CentOS6 and 7 clients" Sorry, by "host" here, do you mean "NFS server"? Yes, Sorry about the confusion of terms there. Attaching some logs and results from two CentOS machines, ( I don't have the results of the Fedora ones on hand, sadly) which would probably help. In the logs, Acorn is the client, Omap is the server. Created attachment 1149609 [details]
fio result, 8k wsize
Created attachment 1149610 [details]
fio result, 1M wsize
Created attachment 1149611 [details]
nfs server dstat, 8k wsize
Created attachment 1149612 [details]
nfs server dstat, 1M wsize
Created attachment 1149613 [details]
nfs client dstat, 8k wsize
Created attachment 1149616 [details]
nfs client dstat, 1M wsize
Oh, both wsize and rsize were the same, sorry for not mentioning that in the attachments, my bad. I don't really have time to troubleshoot this right now, just trying to think of suggestions for things you might try. It might be interesting to look at /proc/self/mountstats (see also the "mountstats" command), and compare the rpc statistics for the "good" (8K w/rsize) vs "bad" (1M w/rsize) cases. *********** MASS BUG UPDATE ************** We apologize for the inconvenience. There is a large number of bugs to go through and several of them have gone stale. Due to this, we are doing a mass bug update across all of the Fedora 23 kernel bugs. Fedora 23 has now been rebased to 4.7.4-100.fc23. Please test this kernel update (or newer) and let us know if you issue has been resolved or if it is still present with the newer kernel. If you have moved on to Fedora 24 or 25, and are still experiencing this issue, please change the version to Fedora 24 or 25. If you experience different issues, please open a new bug report for those. *********** MASS BUG UPDATE ************** This bug is being closed with INSUFFICIENT_DATA as there has not been a response in 4 weeks. If you are still experiencing this issue, please reopen and attach the relevant data from the latest kernel you are running and any data that might have been requested previously. |