Bug 1583839
| Summary: | bluestore osd_memory_target not holding | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Ben England <bengland> |
| Component: | Distribution | Assignee: | Andrew Schoen <aschoen> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Tejas <tchandra> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 3.2 | CC: | acalhoun, adeza, aschoen, assingh, ceph-eng-bugs, flucifre, gfidente, gsitlani, jdurgin, johfulto, kbader, kdreyer, mnelson, nthomas, pasik, wredtex |
| Target Milestone: | z2 | Keywords: | FutureFeature, Tracking |
| Target Release: | 3.3 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2019-10-01 16:38:15 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 1578730 | ||
|
Description
Ben England
2018-05-29 21:13:27 UTC
Alex Calhoun has reproduced user problems with getting bluestore to outperform Filestore on all-flash config. -- I cc'ed him. He is currently working on reproducing the Micron results using smallest possible subset of Mark's tuning. NVMe SSD results: https://mojo.redhat.com/docs/DOC-1189708 HDD results: TBD observations: -Bluestore auto-tuning --NVMe SSD resident memory usage ranged between 15% to 45% of totally memory usage, or 4.5GB to 14.5GB per OSD (8 osds per host), When manually setting the memory to 50% of total available memory or 16GB per OSD, performance had a ~1.7% gain. Based on these results it seemed unnecessary to modify the auto-tuning parameters in order to achieve better results. --HDDs resident memory usage ranged between 56% to 65% of totally memory usage or 7GB to 6GB per OSD (24 osds per host). With both configurations memory appeared to fluctuate in response to the load although in the case of HDDs (24 OSD per host) variability seemed to be low, fluctuating between 1-2 GB where as with SSDs, with few OSDs per host(8), fluctuation was between 7-14 GB --on filestore it appears as if resident memory usage ranged from ~500MB to ~2GB per OSD on HDDs and ~200MB to ~900MB on SSD. --We believe that the osd_memory_target was set to 4GB, if osd_memory_target is to be treated as a ceiling, it is not functioning accordingly based on these results. An investigation targeted specifically on the osd_memory_target and actual osd memory usage maybe need as a follow-on experiment -RAM/OSD? --SSD ranged from ~4.5GB to ~14.5GB --HDD rhanged from ~6GB to ~7GB -RocksDB partition size? --testing utilized a osd scenario of LVM, sizing was automatic and seemed to work with Rocksdb sizing. --rocksdb was partitioned on both NVMe SSD’s equally. Had 24 HDDs and 12 “ceph-block-dbs” were partitioned on each SSD. -Micron tunings --NVMe SSD testing shows that Micron tunings, and subset configurations, only provided a performances gain of ~3%. With the amount of parameters added and the minimal performance impact, its suggested that these specific tunables are not necessary. I believe RocksDB spillover issue is resolved in case where user specifies the simplest scenario (just osd_scenario: lvm + a list of devices). osd_memory_target must be a firm limit under all conditions. Without this, we can't set reasonable limits for containerized OSD memory size in a HCI cluster, and we cannot easily co-locate OSDs with other Ceph daemons or applications. So I'm changing the bz name to be more specific. Once this issue is resolved, we can close the bz. Updating the QA Contact to a Hemant. Hemant will be rerouting them to the appropriate QE Associate. Regards, Giri Updating the QA Contact to a Hemant. Hemant will be rerouting them to the appropriate QE Associate. Regards, Giri close this bug please, it is stale, have not heard of reported problems with osd_memory_target of late in Nautilus or even later versions of Luminous, and there is another bz 1637153 that covers the problem of extreme random write workloads with Luminous. Am not concerned about Filestore at this point. |