Bug 1680171

Summary: containerized radosgw requires higher --cpu-quota as default
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: John Harrigan <jharriga>
Component: Ceph-AnsibleAssignee: Dimitri Savineau <dsavinea>
Status: CLOSED ERRATA QA Contact: Vasishta <vashastr>
Severity: high Docs Contact: Bara Ancincova <bancinco>
Priority: high    
Version: 3.2CC: anharris, aschoen, assingh, bengland, cbodley, ceph-eng-bugs, dfuller, dsavinea, edonnell, evelu, frival, gabrioux, gmeno, jbrier, kbader, mbenjamin, nthomas, sankarshan, sweil, tchandra, tserlin, vumrao
Target Milestone: z2   
Target Release: 3.2   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: RHEL: ceph-ansible-3.2.12-1.el7cp Ubuntu: ceph-ansible_3.2.12-2redhat1 Doc Type: Bug Fix
Doc Text:
.Increased CPU CGroup limit for containerized Ceph Object Gateway The default CPU CGroup limit for containerized Ceph Object Gateway (RGW) was very low and has been increased with this update to be more reasonable for typical Hard Disk Drive (HDD) production environments. However, consider evaluating what limit to set for the site's configuration and workload. To customize the limit, adjust the `ceph_rgw_docker_cpu_limit` parameter in the Ansible `group_vars/rgws.yml` file.
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-04-30 15:57:06 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1629656    

Comment 8 Ben England 2019-02-26 19:55:19 UTC
you're right, sloppy on my part.  It does not remove the CGroup limit but it sets it so high that it is as if we had removed the limit - there is no way for the container to get that big.  I have recommended that the memory CGroup limit in ceph-ansible be removed entirely, see discussion in https://github.com/ceph/ceph-ansible/issues/3617

Comment 15 John Harrigan 2019-04-01 14:32:29 UTC
I think Ben's response in C#14 addresses the needinfo request.

Comment 21 Vasishta 2019-04-22 03:24:41 UTC
Observed changes from ceph-ansible's perspective.
Looks intact as per the requirements.
Moving to VERIFIED state.

Comment 24 Ben England 2019-04-28 21:46:05 UTC
Am having trouble reading the doc text in the preceding post here, but got it in the e-mail.   It said "The default CPU quota for containerized Ceph Object Gateway was significantly lower than for bare-metal Ceph Object Gateway. With this update, the default value for the CPU quota (`--cpu-quota`) for Ceph Object Gateways deployed in containers has been increased."

This is incorrect.  There is no CPU quota for bare-metal Ceph Rados (not Object) Gateway.   You could say is that "the default CPU CGroup limit for containerized RGW was very low and has been increased in this update to be more reasonable for typical HDD production environments - however, the sysadmin may want to evaluate what limit should be set for the site's configuration and workload."  Make sense?

Comment 26 Ben England 2019-04-29 19:01:55 UTC
Object Gateway is fine, I don't care which one you call it as long as people are used to that name.   My main concern was that there is no default CPU quota for bare metal configuration, and that problem has been corrected.  I talked with John Brier about that on IRC.  Thx -ben

Comment 28 errata-xmlrpc 2019-04-30 15:57:06 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2019:0911