Bug 1462277
Summary: | Change the Elasticsearch setting "node.max_local_storage_nodes" to 1 to prevent sharing EBS volumes | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Rich Megginson <rmeggins> | |
Component: | Logging | Assignee: | Jeff Cantrill <jcantril> | |
Status: | CLOSED ERRATA | QA Contact: | Xia Zhao <xiazhao> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 3.4.1 | CC: | aos-bugs, jcantril, juzhao, pportant, pvarma, rmeggins, rromerom, xiazhao | |
Target Milestone: | --- | |||
Target Release: | 3.4.z | |||
Hardware: | All | |||
OS: | All | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | Bug Fix | ||
Doc Text: |
Cause: Elasticsearch default value for sharing storage between ES instances was wrong
Consequence: The incorrect default value allowed an ES pod starting up (when another ES pod was shutting down, e.g. during dc redeployments) to create a new location on the PV for managing the storage volume, duplicating data, and in some instances, potentially causing data loss.
Fix: All ES pods now run with "node.max_local_storage_nodes" set to 1.
Result: The ES pods starting up/shutting down will no longer share the same storage and prevent the data duplication and/or data loss.
|
Story Points: | --- | |
Clone Of: | 1460564 | |||
: | 1462281 1463046 (view as bug list) | Environment: | ||
Last Closed: | 2017-07-11 10:47:38 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1460564 | |||
Bug Blocks: | 1462281, 1463046 |
Description
Rich Megginson
2017-06-16 15:13:28 UTC
Modifying this BZ to ref 3.4.1 as it clones the one for which comment 1 PR references the cloned BZ brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:rhaos-3.4-rhel-7-docker-candidate-88845-20170620200020, brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:3.4.1, brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:latest, brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:v3.4, brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:v3.4.1.41, brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/openshift3/logging-deployer:v3.4.1.41-2 max_local_storage_nodes is 1 now # oc get configmap logging-elasticsearch -o yaml | grep -i max_local_storage_nodes max_local_storage_nodes: 1 Testing env: # openshift version openshift v3.4.1.42 kubernetes v1.4.0+776c994 etcd 3.1.0-rc.0 Images from brew registry # docker images | grep logging logging-deployer 3.4.1 80ca9c90d261 35 hours ago 857.5 MB logging-kibana 3.4.1 0c2759ddfcd9 35 hours ago 338.8 MB logging-elasticsearch 3.4.1 2240ae237369 35 hours ago 399.6 MB logging-fluentd 3.4.1 059b92a39419 35 hours ago 232.7 MB logging-curator 3.4.1 46fd26ad9a8b 35 hours ago 244.5 MB logging-auth-proxy 3.4.1 990787824baf 35 hours ago 215.3 MB @Jeff - We have a situation here with regards to the errata - https://errata.devel.redhat.com/advisory/29143 where the release date is tomorrow (29th June) and the customer is looking for this for quite some time. Customer also escalated this several times and Mustafa, Sudhir, Satish and a lot of others from the senior management is directly involved to get the issues taken care for the customer. Just received an update from Xiaoli Tan that if these bugs are fixed today, we could still have the timely release tomorrow. Thanks, Praveen Escalation Manager Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:1640 |