Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1489120

Summary: Ceph pg_num and pgp_num default is too low for any user
Product: Red Hat OpenStack Reporter: jomurphy
Component: openstack-tripleo-heat-templatesAssignee: Giulio Fidente <gfidente>
Status: CLOSED ERRATA QA Contact: Yogev Rabl <yrabl>
Severity: high Docs Contact:
Priority: high    
Version: 11.0 (Ocata)CC: dbecker, dcadzow, gfidente, jomurphy, mburns, mlopes, morazi, rhel-osp-director-maint, yrabl
Target Milestone: z4Keywords: Triaged, ZStream
Target Release: 11.0 (Ocata)   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: openstack-tripleo-heat-templates-6.2.4-2.el7ost Doc Type: Bug Fix
Doc Text:
Prior to this update, the default `pg_num` and `pgp_num` values set for the Ceph pools (32) was too low. For most production environments, using only 32 PGs for a Ceph pool could have resulted in performance issues, as it limited the scalability of the writes on multiple OSDs, and increased the time necessary to recover from faults. With this update, the new `pg_num` and `pgp_num` default for the Ceph pools is set to 128. As a result, the new default will result in improved Ceph cluster performance when there are 5-10 OSDs, but it is always best to calculate and optimize the PGs allocated to each pool for the different use cases. See the PGs chapter in the Ceph product documentation for information on how to calculate an optimal value: https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/2/html/storage_strategies_guide/placement_groups_pgs.
Story Points: ---
Clone Of: 1481821 Environment:
Last Closed: 2018-02-13 16:31:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1481821, 1489121    
Bug Blocks:    

Description jomurphy 2017-09-06 17:58:10 UTC
+++ This bug was initially created as a clone of Bug #1481821 +++

Description of problem:
OSP director defaults set Ceph's pg_num and pgp_num to 32 which is too low. This will cause performance issues for any costumer that will use it.

Version-Release number of selected component (if applicable):


How reproducible:
100%

Steps to Reproduce:
1.
2.
3.

Actual results:
The pg_num and pgp_num for all the pools is 32

Expected results:
pg_num and pgp_num default should be 128, which is the right number for the minimum deployment of 3 OSDs in a cluster

Additional info:

--- Additional comment from Jeff Brown on 2017-08-29 14:11:34 EDT ---

We need to set this to 128 by default. It will potentially overwritten by derived parameters or customers that  read the documentation.

Comment 3 Yogev Rabl 2018-01-25 20:59:34 UTC
failed. 

The code has not been merged yet in the latest puddle

Comment 7 Yogev Rabl 2018-01-31 18:01:30 UTC
Verified, the pg_num is set by default to 128

Comment 10 errata-xmlrpc 2018-02-13 16:31:14 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:0310