Hide Forgot
The Elasticsearch v2.x sizing guidelines [1] state that less than 8 GB ends up with too many small instances, with 64 GB being the sweet spot, but 32 GB and 16 GB being common sizes. Let's update the default ES pod size to 16 GB (8 GB Java HEAP and 8 GB reserved for buffer cache) to stay in line with what is considered common. [1] https://www.elastic.co/guide/en/elasticsearch/guide/current/hardware.html#_memory
That would be here: https://github.com/openshift/openshift-ansible/blob/master/roles/openshift_logging/defaults/main.yml#L102 and here: https://github.com/openshift/openshift-ansible/blob/master/roles/openshift_logging/defaults/main.yml#L139 ?
(In reply to Rich Megginson from comment #1) > https://github.com/openshift/openshift-ansible/blob/master/roles/ > openshift_logging/defaults/main.yml#L102 > https://github.com/openshift/openshift-ansible/blob/master/roles/ > openshift_logging/defaults/main.yml#L139 Yes, thanks!
https://github.com/openshift/openshift-ansible/pull/7985
Deploy logging firstly and change fluentd nodeSelector to non-default value, logging-infra-test-fluentd=true # oc get ds NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE logging-fluentd 2 2 2 2 2 logging-infra-test-fluentd=true 10m Update logging with the same inventory, fluentd nodeSelector would use the default nodeSelector logging-infra-fluentd=true, not get the existing nodeSelector from environment # oc get ds NAME DESIRED CURRENT READY UP-TO-DATE AVAILABLE NODE SELECTOR AGE logging-fluentd 2 2 2 2 2 logging-infra-fluentd=true 15m # rpm -qa | grep openshift-ansible openshift-ansible-roles-3.9.28-1.git.0.4fc2ce4.el7.noarch openshift-ansible-docs-3.9.28-1.git.0.4fc2ce4.el7.noarch openshift-ansible-playbooks-3.9.28-1.git.0.4fc2ce4.el7.noarch openshift-ansible-3.9.28-1.git.0.4fc2ce4.el7.noarch
The reported BZ is specific to memory and cpu settings. I am of the opinion that it should not block this test. We should consider opening a separate BZ to resolve fluent related issues.
Tested, ES memory limits would get from existing deployment instead of using the defaults. Polarion test case OCP-18917 # rpm -qa | grep openshift-ansible openshift-ansible-roles-3.9.27-1.git.0.52e35b5.el7.noarch openshift-ansible-docs-3.9.27-1.git.0.52e35b5.el7.noarch openshift-ansible-playbooks-3.9.27-1.git.0.52e35b5.el7.noarch openshift-ansible-3.9.27-1.git.0.52e35b5.el7.noarch
Issue in Comment 15 is reported in bug 1575901
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:1566