Bug 1443057
| Summary: | Metricd can lose coordination group and lose capacity | |||
|---|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Alex Krzos <akrzos> | |
| Component: | openstack-gnocchi | Assignee: | Mehdi ABAAKOUK <mabaakou> | |
| Status: | CLOSED ERRATA | QA Contact: | Sasha Smolyak <ssmolyak> | |
| Severity: | unspecified | Docs Contact: | ||
| Priority: | unspecified | |||
| Version: | 10.0 (Newton) | CC: | apevec, jdanjou, jschluet, lhh, mabaakou, nshetty, pkilambi, vaggarwa | |
| Target Milestone: | --- | Keywords: | Triaged, ZStream | |
| Target Release: | 10.0 (Newton) | |||
| Hardware: | Unspecified | |||
| OS: | Unspecified | |||
| Whiteboard: | scale_lab | |||
| Fixed In Version: | openstack-gnocchi-3.0.8-1.el7ost | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | ||
| Clone Of: | ||||
| : | 1454842 (view as bug list) | Environment: | ||
| Last Closed: | 2017-07-12 14:07:53 UTC | Type: | Bug | |
| Regression: | --- | Mount Type: | --- | |
| Documentation: | --- | CRM: | ||
| Verified Versions: | Category: | --- | ||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
| Cloudforms Team: | --- | Target Upstream Version: | ||
| Embargoed: | ||||
| Bug Depends On: | ||||
| Bug Blocks: | 1454842 | |||
|
Description
Alex Krzos
2017-04-18 12:01:38 UTC
(In reply to Alex Krzos from comment #0) > Description of problem: > Gnocchi Metricd can "lose"(perhaps it attempts to its coordination group *Meant* Gnocchi Metricd can "lose"(perhaps it attempts to "re-coordinate" regularly?) its coordination group and when it does so, two controllers out of three will have Gnocchi schedulers sharing the same block. Fortunately, the error message "Error getting block to work on, defaulting to first" shallow the root cause. Also if such case we didn't retry. I have proposed https://review.openstack.org/457702 to unshallow the root error and to retry later to get correct blocks in such case. This has been merged and released as part or Gnocchi 3.0.8. tooz is fixed. Get exception from redis when it's killed, as expected. Then it reconnects. Verifying it for now Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:1748 |