Bug 2176088

Summary: [GSS] Document the common issues and troubleshooting steps for MDS/CephFS
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Karun Josy <kjosy>
Component: DocumentationAssignee: Akash Raj <akraj>
Documentation sub component: File System Guide QA Contact: Aditya Ramteke <aramteke>
Status: NEW --- Docs Contact:
Severity: high    
Priority: unspecified CC: akraj, asriram, ksachdev, vshankar
Version: 5.3Flags: kjosy: needinfo? (vshankar)
Target Milestone: ---   
Target Release: Backlog   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Karun Josy 2023-03-07 11:27:13 UTC
* Describe the issue:

In RHCS 5, we do not have a troubleshooting section for MDS.
In the Filesystem guide[1] we have a section : "Appendix A. Health messages for the Ceph File System"

But this just lists out and defines various Health errors/warnings for MDS.
It does not explain the steps that support/customers should take to address individual issues. Nor do we think it is comprehensive enough to understand the issue. 


[1] https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/5/html/file_system_guide/health-messages-for-the-ceph-file-system_fs


* Suggestions for improvement:

We are looking for a comprehensive document that
- explains all the common issues with MDS/CephFS
- Logs that need to be collected(at what level) for further review
- Steps needed to fix the issue or apply any workaround



* Document URL:

We can either add this as a separate section in 
Filesystem Guide or Troubleshooting Guide

Comment 1 Karun Josy 2023-03-07 11:27:13 UTC
Additional Information :

For example, we have a Ceph-ISCSI section that is well documented with the support of ISCSI developers:  https://bugzilla.redhat.com/show_bug.cgi?id=1713514