Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 1339393

Summary: Documentation: make it clear how to resolve degraded or non-active pgs on a 1/2 osd cluster
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Alexander Chuzhoy <sasha>
Component: DocumentationAssignee: John Wilkins <jowilkin>
Status: CLOSED NOTABUG QA Contact: Ramakrishnan Periyasamy <rperiyas>
Severity: unspecified Docs Contact:
Priority: medium    
Version: 2.0CC: asriram, hnallurv, kdreyer, khartsoe, sjust, srevivo, uboppana
Target Milestone: rc   
Target Release: 2.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-11-02 15:59:09 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Alexander Chuzhoy 2016-05-24 21:11:24 UTC
Documentation: Need to document the confusion and workaround for the degraded state of ceph when deploying OSP with director using only a single OSD.


What happens is that if a deployment with director was done with a single OSD,
then the status returns:
     health HEALTH_WARN
and there are pgs in degraded,unclean,undersized state upon creation/load of data.


The workaround for it would be:

1) For every created pool to run:
ceph osd pool set <poolname> size 1

2) To set in /etc/ceph/ceph.conf
osd_pool_default_size = 1
And restart the ceph service.

Comment 2 Samuel Just 2016-05-24 21:21:06 UTC
There seems to be a lot of confusion in cases where a 1 osd test/poc cluster comes up with degraded/non-active pgs due to the default size=3/min_size=2 values.

http://docs.ceph.com/docs/master/rados/troubleshooting/troubleshooting-pg/

at the least should have examples of clusters with both degraded and non-active pgs.  It should also be clear how to resolve it (both by setting the config value for new pools and by changing existing pools).