Bug 1890113 - [5.0] Ceph-Dashboard - Device health status is not getting listed under hosts section in 5.0 dashboard
Summary: [5.0] Ceph-Dashboard - Device health status is not getting listed under hosts...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Ceph-Dashboard
Version: 5.0
Hardware: x86_64
OS: Linux
medium
low
Target Milestone: ---
: 5.1
Assignee: Alfonso Martínez
QA Contact: Sunil Angadi
URL:
Whiteboard:
: 2025699 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-10-21 12:21 UTC by Preethi
Modified: 2022-04-04 10:20 UTC (History)
12 users (show)

Fixed In Version: ceph-16.2.7-27.el8cp
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-04-04 10:19:53 UTC
Embargoed:


Attachments (Terms of Use)
Snippet of the device list (199.31 KB, image/png)
2020-10-21 12:21 UTC, Preethi
no flags Details
unknown state (213.75 KB, image/png)
2020-10-21 12:22 UTC, Preethi
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Ceph Project Bug Tracker 49354 0 None None None 2021-04-05 18:34:24 UTC
Github ceph ceph pull 40494 0 None open pacific: mgr/dashboard: Device health status is not getting listed under hosts section 2021-04-05 18:34:24 UTC
Github ceph ceph pull 44597 0 None Merged pacific: mgr/dashboard: fix: get SMART data from single-daemon device 2022-01-17 10:17:00 UTC
Red Hat Issue Tracker RHCSDASH-242 0 None None None 2021-11-10 06:17:05 UTC
Red Hat Knowledge Base (Solution) 6533111 0 None None None 2022-03-10 07:44:47 UTC
Red Hat Product Errata RHSA-2022:1174 0 None None None 2022-04-04 10:20:24 UTC

Description Preethi 2020-10-21 12:21:15 UTC
Created attachment 1723179 [details]
Snippet of the device list

Description of problem:[5.0] Ceph-Dashboard - Device health status is not getting listed under hosts section in 5.0 dashboard 


Version-Release number of selected component (if applicable):

[root@magna094 ubuntu]# ./cephadm version
Using recent ceph image registry-proxy.engineering.redhat.com/rh-osbs/rhceph:ceph-5.0-rhel-8-containers-candidate-96803-20201013192445
ceph version 16.0.0-6275.el8cp (d1e0606106224ac333f1c245150d7484cb626841) pacific (dev)

How reproducible:


Steps to Reproduce:
1.Install 5.0 cluster 
2. Launch dashboard URL
3. Go to hosts section and click on specific host and check for its attributes like device, device health, daemons, inventory and others
4. Observe that device and device state of health is showing "unknown" and device health displayes " failed to retrieve SMART DATA"

Cephadm cli is showing up details correctly with ceph orch device ls command
Actual results:
Device state of health is showing "unknown" and device health displayes " failed to retrieve SMART DATA"

Expected results:
Values should reflect as per cli output in dashboard also

Additional info:


https://magna094.ceph.redhat.com:8443/#/login  -Dashboard URL

admin/ admin123

Bootstrap node: magna094
root/q

Comment 1 Preethi 2020-10-21 12:22:30 UTC
Created attachment 1723180 [details]
unknown state

Comment 4 Preethi 2020-12-17 12:21:24 UTC
@Ernesto, the ask was to confirm from dahsboard team if they using the below to get information from devices. There is no action item from QE i guess. I checked with Juan also on the same.
Is the dashboard using:

"ceph orch device ls --format json"

Comment 6 Juan Miguel Olmo 2020-12-22 09:46:07 UTC
I think we need to use the same source of information independently of the tool (dashboard, cli) which is providing the output.

In this case , what we want is to get the list of devices in the cluster hosts and what is the state of this devices:

This is done in the orchestrator using the command:

```
# ceph orch device ls
```

Which is using in the background the command:

```
# ceph-volume inventory
```

and in the dashboard ( copying what Preethi says:)

"""
3. Go to hosts section and click on specific host and check for its attributes like device, device health, daemons, inventory and others
4. Observe that device and device state of health is showing "unknown" and device health displayes " failed to retrieve SMART DATA"
"""


As you said Ernesto, in the dashboard we are using:

```
# ceph daemon osd.<id> smart <devid>"
```

So the first thing to do is to decide what is the best tool to provide information about storage devices in the cluster hosts, and use the same in the dashboard and in the orchestrator CLI

I think that probably we should rely in ceph-volume to get this information, because if you use "ceph daemon osd.<id>"  you cannot retrieve the information of the device if you do not have an OSD created using the device.
So i think that the better solution is to change the command used in the dashboard to "ceph orch device ls --format json"

Comment 7 Juan Miguel Olmo 2021-02-02 15:11:41 UTC
I continue thinking the same.. the information about devices form the dashvoard and from the CLI must be the same. And must have the same source. 
In my view the right source is to use always the command:

# ceph orch device ls --json

Comment 8 Aashish sharma 2021-02-11 10:37:24 UTC
@JuanMi, Dashboard is currently using this command for retrieving the smart data - 'ceph daemon <svc_type>.<svc_id> smart <device_id>'. This command runs fine for svc_type="osd" but is failing for svc_type="mon". Moreover "ceph orch device ls --json" is not providing relevant smart data attributes. Maybe there is some other way to fetch smart data from ceph-volume.

Comment 11 Ken Dreyer (Red Hat) 2021-04-05 21:26:15 UTC
https://github.com/ceph/ceph/pull/40494 missed the v16.2.0 release upstream. Since this is low severity, I recommend we re-target this to RHCS 5.1.

Comment 24 Sunil Angadi 2022-03-10 07:44:48 UTC
*** Bug 2025699 has been marked as a duplicate of this bug. ***

Comment 26 errata-xmlrpc 2022-04-04 10:19:53 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 5.1 Security, Enhancement, and Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:1174


Note You need to log in before you can comment on or make changes to this bug.