Bug 1236997
| Summary: | [New] - Volume status is shown incorrect when glusterd is down on one of the node. | ||
|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Gluster Storage | Reporter: | RamaKasturi <knarra> |
| Component: | nagios-server-addons | Assignee: | Ramesh N <rnachimu> |
| Status: | CLOSED WONTFIX | QA Contact: | RHS-C QE <rhsc-qe-bugs> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | rhgs-3.1 | CC: | asriram, knarra, mlawrenc, rnachimu, sabose, sankarshan, shtripat |
| Target Milestone: | --- | Keywords: | ZStream |
| Target Release: | --- | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Known Issue | |
| Doc Text: |
Bricks with an 'UNKNOWN' status are not considered as DOWN when volume status is calculated. When the glusterd service is down in one node, brick status changes to 'UNKNOWN' while the volume status remains 'OK'. You may think the volume is up and running when bricks may not be running. You are not able to detect the correct status.
Workaround:
You are notified when glusterd is down and when bricks are in an 'UNKNOWN' state.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-04-13 06:30:11 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 1216951 | ||
|
Description
RamaKasturi
2015-06-30 07:06:10 UTC
I think this is expected behaviour. Even if glusterd is down on one of the nodes, the bricks are still online and accessible. Till the time, the bricks are marked down, the volume is not marked CRITICAL. Is this a regression, because I don't see any change to this plugin behaviour. Removing devel_ack till confirmed. Brick status is marked as UNKNOWN in the nagios UI when glusterd in that node goes down. IMO, volume status should also be changed. Doc text is edited. Please sign off to be included in Known Issues. doc text looks good. Nagios monitors brick status and glusterd status separately and sends notifications if these service are down. For this particular case, volume status cannot be correctly determined - hence even a change in volume status could be interpreted incorrectly. Closing this - please re-open if you can suggest the volume status that it needs to move to. |