Bug 1428936
Summary: | [GSS]Remove-brick operation is slow in a distribute-replicate volume in RHGS 3.1.3 | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | Cal Calhoun <ccalhoun> |
Component: | distribute | Assignee: | Susant Kumar Palai <spalai> |
Status: | CLOSED ERRATA | QA Contact: | Prasad Desala <tdesala> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | rhgs-3.1 | CC: | amukherj, asrivast, bkunal, ccalhoun, nbalacha, olim, omasek, pousley, ravishankar, rcyriac, rhinduja, rhs-bugs, rnalakka, spalai, storage-qa-internal |
Target Milestone: | --- | ||
Target Release: | RHGS 3.3.0 | ||
Hardware: | Unspecified | ||
OS: | Linux | ||
Whiteboard: | dht-rebalance | ||
Fixed In Version: | glusterfs-3.8.4-27 | Doc Type: | If docs needed, set a value |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-09-21 04:33:25 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1417145 |
Description
Cal Calhoun
2017-03-03 16:17:04 UTC
Could not reproduce the issue in my test machine. Created 8*2 volume and created data (dirs and files). Did multiple remove-bricks, but found no errors. Will give it few more try. @ Susant, Do you need any additional information from Cal or customer, which you think might be useful for reproducing ? @ Cal, Can you even try to to reproduce issue with miniature version of customer environment. (In reply to Bipin Kunal from comment #7) > @ Susant, Do you need any additional information from Cal or customer, which > you think might be useful for reproducing ? > > @ Cal, Can you even try to to reproduce issue with miniature version of > customer environment. The problem in hand points to a memory overrun. I went through the rebalance code and could not find any evidence of such problem and was it caused by some other translator e.g AFR can not be confirmed from the logs as it does not point to the translator which caused it. A reproducer will be highly helpful here. Still few more information will be helpful here. 1- Xattr information on the directories and files. 2- What kind of operations were running in parallel? -Susant @ Bipin: I'll try to set up a simplified reproducer tomorrow. @ Susant: I'll ask the customer to supply the additional information. -Cal @ Susant: Can you supply a command that will return the Xattr information you need? I'm not sure exactly what you're looking for. I've asked the customer about what else might have been running in parallel. Does the customer have hardlinks to his files? Verified this BZ on glusterfs version 3.8.4-33.el7rhgs.x86_64. Followed the same steps as in Comment 107, the script didn't throw any errors and all the files on the bricks migrated successfully as expected without any issues. Moving this BZ to Verified. *** Bug 1467495 has been marked as a duplicate of this bug. *** Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:2774 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:2774 |