Bug 1810396

Summary: [RFE][gss][RADOS] "ceph device query-daemon-health-metrics osd.x" command is not working on MegaRaid disks.
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Geo Jose <gjose>
Component: RADOSAssignee: Neha Ojha <nojha>
Status: CLOSED NOTABUG QA Contact: Manohar Murthy <mmurthy>
Severity: medium Docs Contact: Aron Gunn <agunn>
Priority: medium    
Version: 4.0CC: agunn, bniver, ceph-eng-bugs, dzafman, gsitlani, hfukumot, kchai, mhackett, mmuench, nojha, system-engineering, yhatuka
Target Milestone: rcKeywords: FutureFeature
Target Release: 5.*   
Hardware: x86_64   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: Known Issue
Doc Text:
.The `ceph device` command does not work when querying MegaRaid devices Currently, the `ceph device query-daemon-health-metrics` command does not support querying the health metrics of disks attached to MegaRaid devices. This command displays an error similar to the following: ---- smartctl returned invalid JSON ---- The disk failure prediction module for MegaRaid devices is unusable at this time. Currently, there is no workaround for this issue. See the link:{hardware-guide}#avoid-using-raid-solutions_hw[_{storage-product} Hardware Guide_] for more information on using RAID solutions with {storage-product}.
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-09-14 19:18:27 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1816167    

Description Geo Jose 2020-03-05 07:02:38 UTC
Description of problem:
ceph can monitor the health metrics for all types of devices(such as SATA, SAS, NVMe) using SMART. But "ceph device query-daemon-health-metrics osd.X" is not working on MegaRaid OSD device.

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Configure OSDs on MegaRaid devices(Device driver: megaraid_sas).
2. Run the command "ceph device query-daemon-health-metrics osd.X".


Actual results:
Getting the error "smartctl returned invalid JSON"

Expected results:


Additional info:
It seems that ceph uses the command "/usr/sbin/smartctl -a --json=o /dev/sdX". For MegaRaid devices, this command will not work until we pass the option "megaraid,N"(N  denotes which disk on the controller is monitored) with smartctl command.

Comment 3 Yaarit Hatuka 2020-03-09 18:21:14 UTC
Hi Geo,

Unfortunately, we currently do not support querying health metrics of devices behind HW RAID.

Thanks,
Yaarit