Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 1498504

Summary: 'Ceph OSD Information' - bad number in 'OSDs UP' panel
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Lubos Trilety <ltrilety>
Component: Ceph-MetricsAssignee: Paul Cuzner <pcuzner>
Status: CLOSED ERRATA QA Contact: Lubos Trilety <ltrilety>
Severity: medium Docs Contact:
Priority: high    
Version: 3.0CC: anharris, branto, ceph-eng-bugs, kdreyer, ltrilety, pcuzner, zcerza
Target Milestone: rc   
Target Release: 3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: cephmetrics-1.0-4.el7cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-12-05 23:46:42 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
OSD machine not running none

Description Lubos Trilety 2017-10-04 13:29:08 UTC
Created attachment 1334245 [details]
OSD machine not running

Description of problem:
When an OSD machine with 4 OSD daemons was switched off 'Ceph - At A Glance' shows correctly that only 12 OSDs are up. However 'Ceph OSD Information' dashboard shows still 16 OSDs as up whilst 'OSDs DOWN' panel shows correct information. So the sum of 'OSDs UP' and 'OSDs DOWN' is bigger than the number in 'OSDs' panel.

Moreover when the machine was started again instead of displaying 16 OSDs up there's just 15 up displayed on the dashboard.


Version-Release number of selected component (if applicable):
cephmetrics-grafana-plugins-0.1-372_g098fb9a.el7.centos.x86_64
cephmetrics-0.1-372_g098fb9a.el7.centos.x86_64

How reproducible:
100%

Steps to Reproduce:
1. Switch off one OSD machine
2. Look at 'Ceph OSD Information' dashboard
3.

Actual results:
There's a bad number in 'OSDs UP' panel

Expected results:
Correct number is displayed

Additional info:

Comment 1 Paul Cuzner 2017-10-05 00:25:45 UTC
The OSDs UP panel is using the default time interval of 1hr - which is why the number is not tracking. Changing the relative time to 60s (instead of 60m) shows the value correctly.

Comment 2 Paul Cuzner 2017-10-11 04:19:44 UTC
PR submitted upstream, pending review and merge
https://github.com/ceph/cephmetrics/pull/142

Comment 3 Zack Cerza 2017-10-11 16:31:30 UTC
Fix is merged

Comment 6 Lubos Trilety 2017-10-25 10:50:20 UTC
Tested with:
cephmetrics-1.0-6.el7cp.x86_64

After 1 minute data were always correct.

Comment 9 errata-xmlrpc 2017-12-05 23:46:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3387