Bug 1331764
Summary: | OSDs are not selected properly while reweight-by-utilization | ||
---|---|---|---|
Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Tanay Ganguly <tganguly> |
Component: | RADOS | Assignee: | Sage Weil <sweil> |
Status: | CLOSED ERRATA | QA Contact: | shylesh <shmohan> |
Severity: | high | Docs Contact: | Bara Ancincova <bancinco> |
Priority: | unspecified | ||
Version: | 1.3.2 | CC: | ceph-eng-bugs, dzafman, flucifre, hnallurv, kchai, kdreyer, kurs, nlevine, sjust, sweil |
Target Milestone: | rc | ||
Target Release: | 1.3.3 | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | RHEL: ceph-0.94.7-5.el7cp Ubuntu: ceph_0.94.7-3redhat1trusty | Doc Type: | Bug Fix |
Doc Text: |
.OSDs are now selected properly during "reweight-by-utilization"
During the `reweight-by-utilization` process, some of the OSD nodes that met the criteria for reweighting were not selected. The underlying algorithm has been modified, and OSDs are now selected properly during `reweight-by-utilization`.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2016-09-29 12:58:06 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1372735 |
Description
Tanay Ganguly
2016-04-29 13:09:51 UTC
This probably should not hold up 1.3.2 -- advisory to user would be the right thing. Sam, what advisory will be given to user in this case? please share the details. I think sage would be the right person to ask. Maybe that the user should make sure to use the test_ option first and verify that the behavior is ok? Hi Sage, Please let us know what advisory will be given to user in this case. Regards, Harish Right. The user should ceph osd test-reweight-by-utilization ... or ceph osd test-reweight-by-pg ... prior to doing the non-test- variant to confirm that nothing drastic will happen. They should also use small max_weight values. E.g., ceph osd test-reweight-by-utilization 120 .05 10 to update at most 10 osds with at most a change of .05. Later, when we have backported this fix, the low-weight osds can be weighted up. If they can wait for that, they should, but if not, it's no big deal--just a bit more data movement. (In reply to Sage Weil from comment #7) > Right. The user should > > ceph osd test-reweight-by-utilization ... > or > ceph osd test-reweight-by-pg ... > > prior to doing the non-test- variant to confirm that nothing drastic will > happen. > > They should also use small max_weight values. E.g., > > ceph osd test-reweight-by-utilization 120 .05 10 > > to update at most 10 osds with at most a change of .05. > > Later, when we have backported this fix, "this fix" here refers to the fix for BZ 1331764 or BZ 1331784 or both? Can you please confirm? > the low-weight osds can be weighted > up. If they can wait for that, they should, but if not, it's no big > deal--just a bit more data movement. The fix is the same for both BZs. Thanks! https://github.com/ceph/ceph/pull/9416 was merged to hammer after v0.94.7 was tagged, so this bug is fixed in v0.94.8 upstream. With the introduction of new algorithm osds are chosen based on their distance from avg utilization. i.e more the distance from the avg greater chance of getting selected. Hence marking this as verified. Verified on 0.94.9-1.el7cp.x86_64 Looks good to me. Thanks, Bara! Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHSA-2016-1972.html |