1552977 – Use multiple primary shards for the .operations indices by default matching the number of data nodes in the cluster

Bug 1552977 - Use multiple primary shards for the .operations indices by default matching the number of data nodes in the cluster

Summary: Use multiple primary shards for the .operations indices by default matching t...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Logging
Sub Component:
Version:	3.7.0
Hardware:	All
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	3.7.z
Assignee:	Jeff Cantrill
QA Contact:	Qiaoling Tang
Docs Contact:
URL:
Whiteboard:
Duplicates (1):	1582225 (view as bug list)
Depends On:
Blocks:	1553257
TreeView+	depends on / blocked

Reported:	2018-03-08 00:55 UTC by Peter Portante
Modified:	2021-03-01 08:38 UTC (History)
CC List:	4 users (show)
Fixed In Version:
Doc Type:	Enhancement
Doc Text:	Feature: Allow the number if indices and replicas to be configured using environment variables Reason: The logs collected for infra services consumes a large portion of the available disk space. Spreading the data across available nodes by modifying the replica and shard settings allow Elasticsearch to better support these large amounts of data Result: Improved performance in Elasticsearch when there are large amounts of data from infra services.
Clone Of:
Clones:	1553257 (view as bug list)
Environment:
Last Closed:	2018-08-09 22:14:04 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	openshift origin-aggregated-logging pull 1019	0	None	closed	[release-3.7] Shard operations indices	2020-02-10 17:45:22 UTC
Red Hat Product Errata	RHBA-2018:2337	0	None	None	None	2018-08-09 22:15:05 UTC

Description Peter Portante 2018-03-08 00:55:59 UTC

Sharding the .operations indices (and .orphan indices) using a primary shard count that matches the number of nodes in the cluster will provide for an even distribution of disk usage and load across cluster members to help with high operational logging rates.  

This has little effect in low logging rate situations.

Comment 1 Peter Portante 2018-03-08 01:07:30 UTC

See the following gist for an example: https://gist.github.com/portante/f8cfecad1c6b69cdc1736ce464501d6f

Comment 2 Jeff Cantrill 2018-03-15 12:31:22 UTC

https://github.com/openshift/origin-aggregated-logging/pull/1019

Comment 5 Anping Li 2018-04-20 10:02:01 UTC

Jeff, The fix is in logging-elasticsearch/images/v3.7.44-3. The default shards number are set to 3. 

But the shard number and replicas number couldn't be update as Environment variable. The changes have been updated to the template. but it wasn't load to elasticsearch.

1. oc set env -c elasticsearch dc/logging-es-data-master-e1rbo8ai PRIMARY_SHARDS=1
2. Check the template json file
sh-4.2$ cat common.settings.operations.orphaned.json
{
  "order": 5,
  "settings": {
    "index.refresh_interval": "5s",
    "index.number_of_replicas": 0,
    "index.number_of_shards": 1,
    "index.translog.flush_threshold_size": "256mb",
    "index.unassigned.node_left.delayed_timeout": "2m"
  },
  "template": ".orphaned*"
}

3. Check the template in ES
sh-4.2$ curl -s -XGET --cacert /etc/elasticsearch/secret/admin-ca --cert /etc/elasticsearch/secret/admin-cert --key /etc/elasticsearch/secret/admin-key https://localhost:9200/_template/common.settings.operations.orphaned.json?pretty
{
  "common.settings.operations.orphaned.json" : {
    "order" : 5,
    "template" : ".orphaned*",
    "settings" : {
      "index" : {
        "refresh_interval" : "5s",
        "unassigned" : {
          "node_left" : {
            "delayed_timeout" : "2m"
          }
        },
        "number_of_shards" : "3",
        "translog" : {
          "flush_threshold_size" : "256mb"
        },
        "number_of_replicas" : "0"
      }
    },
    "mappings" : { },
    "aliases" : { }
  }
}

Comment 7 Jeff Cantrill 2018-04-26 18:55:49 UTC

Deployed locally image: brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-elasticsearch:v3.7.44-3

Modified the DC to have:

env:
        - name: PRIMARY_SHARDS
          value: "2"
        - name: REPLICA_SHARDS
          value: "1"

Rolled out latest
rsh into the pod
$ QUERY=_template/common.settings.operations.orphaned.json?pretty es_util
{
  "common.settings.operations.orphaned.json" : {
    "order" : 5,
    "template" : ".orphaned*",
    "settings" : {
      "index" : {
        "refresh_interval" : "5s",
        "unassigned" : {
          "node_left" : {
            "delayed_timeout" : "2m"
          }
        },
        "number_of_shards" : "2",
        "translog" : {
          "flush_threshold_size" : "256mb"
        },
        "number_of_replicas" : "1"
      }
    },
    "mappings" : { },
    "aliases" : { }
  }
}

Looks like the settings are applied as expected

Comment 8 Anping Li 2018-04-27 01:55:42 UTC

@jeff,  Are there persistent volume in your elasticsearch pod?

Comment 9 Jeff Cantrill 2018-04-27 18:37:32 UTC

There are not.  Are you suggesting adding storage alters the outcome?

Comment 10 Anping Li 2018-04-28 08:19:42 UTC

@jeff, No, I am thinking why we have differnet results. Is that caused by persistent storage?

Comment 11 Jeff Cantrill 2018-04-30 01:54:30 UTC

I don't see how persistent storage would make a difference unless there is some functionality where if the template is already present that ES is not accepting changes to over right what already exists.  I would expect that to manifest in some error at start up.

Comment 12 Jeff Cantrill 2018-05-25 01:29:54 UTC

*** Bug 1582225 has been marked as a duplicate of this bug. ***

Comment 14 Anping Li 2018-07-31 08:32:59 UTC

The .operations.* and .orphaned.* indices can be changed by Environments REPLICA_SHARDS and PRIMARY_SHARDS when using openshift3/logging-elasticsearch/images/v3.9.40-1

Note that: 
1) Please use same value for all Elasticsearch Deploymentconfig
2) Please run oc rollout latest $ES_Deployment_configure to apply the changes
3) To avoid confliction,  Don't run oc rollout in mid-night when the index is creating.
4) The Environment variables may lose when re-run playbooks

Comment 16 errata-xmlrpc 2018-08-09 22:14:04 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2337

Note You need to log in before you can comment on or make changes to this bug.