Bug 1498504
| Summary: | 'Ceph OSD Information' - bad number in 'OSDs UP' panel | ||||||
|---|---|---|---|---|---|---|---|
| Product: | [Red Hat Storage] Red Hat Ceph Storage | Reporter: | Lubos Trilety <ltrilety> | ||||
| Component: | Ceph-Metrics | Assignee: | Paul Cuzner <pcuzner> | ||||
| Status: | CLOSED ERRATA | QA Contact: | Lubos Trilety <ltrilety> | ||||
| Severity: | medium | Docs Contact: | |||||
| Priority: | high | ||||||
| Version: | 3.0 | CC: | anharris, branto, ceph-eng-bugs, kdreyer, ltrilety, pcuzner, zcerza | ||||
| Target Milestone: | rc | ||||||
| Target Release: | 3.0 | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | cephmetrics-1.0-4.el7cp | Doc Type: | If docs needed, set a value | ||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2017-12-05 23:46:42 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Attachments: |
|
||||||
The OSDs UP panel is using the default time interval of 1hr - which is why the number is not tracking. Changing the relative time to 60s (instead of 60m) shows the value correctly. PR submitted upstream, pending review and merge https://github.com/ceph/cephmetrics/pull/142 Fix is merged Tested with: cephmetrics-1.0-6.el7cp.x86_64 After 1 minute data were always correct. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:3387 |
Created attachment 1334245 [details] OSD machine not running Description of problem: When an OSD machine with 4 OSD daemons was switched off 'Ceph - At A Glance' shows correctly that only 12 OSDs are up. However 'Ceph OSD Information' dashboard shows still 16 OSDs as up whilst 'OSDs DOWN' panel shows correct information. So the sum of 'OSDs UP' and 'OSDs DOWN' is bigger than the number in 'OSDs' panel. Moreover when the machine was started again instead of displaying 16 OSDs up there's just 15 up displayed on the dashboard. Version-Release number of selected component (if applicable): cephmetrics-grafana-plugins-0.1-372_g098fb9a.el7.centos.x86_64 cephmetrics-0.1-372_g098fb9a.el7.centos.x86_64 How reproducible: 100% Steps to Reproduce: 1. Switch off one OSD machine 2. Look at 'Ceph OSD Information' dashboard 3. Actual results: There's a bad number in 'OSDs UP' panel Expected results: Correct number is displayed Additional info: