Bug 1293634
Summary: | [georep+tiering]: Geo-Rep sync is poor if master volume is tiered | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Rahul Hinduja <rhinduja> |
Component: | geo-replication | Assignee: | Bug Updates Notification Mailing List <rhs-bugs> |
Status: | CLOSED WONTFIX | QA Contact: | storage-qa-internal <storage-qa-internal> |
Severity: | urgent | Docs Contact: | |
Priority: | unspecified | ||
Version: | rhgs-3.1 | CC: | avishwan, csaba, mzywusko, rcyriac, rhs-bugs, rreddy, sankarshan |
Target Milestone: | --- | Keywords: | ZStream |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | tier-interops | ||
Fixed In Version: | Doc Type: | Known Issue | |
Doc Text: |
Sync performance for geo-replicated storage is reduced when the master volume is tiered, resulting in slower geo-rep performance on tiered volumes.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2018-04-16 15:55:39 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1268895 |
Description
Rahul Hinduja
2015-12-22 13:40:58 UTC
Performance issues from Geo-rep side: Entry operations handled by Cold Workers(worker belongs to cold bricks) only. Data synced from all the workers. Cold workers will get overloaded and Data sync may fail from Hot workers since entry will be created from Cold workers. Possible fix: Handle Rsync errors and retries effectively (Patch sent to upstream for the same http://review.gluster.org/#/c/12856/) Synchronization between Hot workers and Cold Workers, do not sync files from Hot before entry created on Slave from Cold workers.(Just an idea, design pending) Performance issues due to tiering: Geo-rep uses rsync to sync data from Master Volume to Slave Volume. For a given list of files Rsync will sync data from Master volume mount to Slave Volume mount. Read performance of Tiering may have affected the Sync performance. Hi Laura, You could add an additional sentence "As a consequence, geo-rep performance on tiered volumes is slower than with non tiered volumes". Those basic sentences seem to capture what the customer needs to know. My reading of Aravinda's summary from comment #4, is that incorporating additional low level engineering details into the release notes would not help the customer. You could consider gathering some more information from the geo-rep team, etc. 1. how much slower is it? Does the degradation get worse depending on the hot/cold volume type or number of sub volumes? 2. does the degradation ever become significant enough to make geo-rep unusable? Per discussion with Milind, changing component to geo-rep. This should be tested with the latest patches in the release-3.8 branch. (see below). >> I spoke with Aravinda regarding tiering + georep performance issues. >> He said that some patches have been merged upstream to mitigate the >> performance drop seen for tiered volumes. He insisted on getting the >> performance benchmarked *before* any additional enhancements are >> attempted. >> >> Having said this, he still has one recommendation: to synchronize hot >> and cold tier georep worker processes w.r.t. entry creation by cold >> tier worker followed by data sync by hot tier worker. This could be >> attempted if the latest performance numbers seem unacceptable to QE. > > Should this be moved to the geo-rep group? yes, you could move this to the geo-rep group with a comment to test the performance with the latest patches on release-3.8 branch |