Bug 1916910 - Sometimes the elasticsearch-delete-xxx job failed at "Unexpected exception indices:admin/aliases/get"
Summary: Sometimes the elasticsearch-delete-xxx job failed at "Unexpected exception in...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Logging
Version: 4.6
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 4.6.z
Assignee: Jeff Cantrill
QA Contact: Giriyamma
URL:
Whiteboard: logging-exploration
Depends On: 1890838
Blocks: 1919075 1928772
TreeView+ depends on / blocked
 
Reported: 2021-01-15 19:48 UTC by Jeff Cantrill
Modified: 2024-03-25 17:52 UTC (History)
40 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
collapses the multiple policy cronjobs to a single job with multiple tasks it runs: delete rollover The reasoning is there is a potential race condition between the previous jobs which both rely upon a -write alias that may lead to false information. Additionally, ES does not have transactions or is ACID. By converting these into tasks we execute for management we: potentially free disk for ES to do additional work give a better chance for the rollover to be successful
Clone Of: 1890838
: 1928772 (view as bug list)
Environment:
Last Closed: 2021-02-08 13:41:45 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift elasticsearch-operator pull 627 0 None closed Bug 1916910: Condense indexmanagement into single cronjob 2021-02-16 13:21:09 UTC
Red Hat Product Errata RHSA-2021:0310 0 None None None 2021-02-08 13:41:52 UTC

Comment 2 Giriyamma 2021-01-28 07:51:40 UTC
Verified this issue using clusterlogging.4.6.0-202101271348.p0, elasticsearch-operator.4.6.0-202101271348.p0.

Comment 6 errata-xmlrpc 2021-02-08 13:41:45 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.6.16 extras security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2021:0310

Comment 8 Ronald 2021-02-16 12:25:28 UTC
Guys I've installed the 4.6.16 and still seeing the following err:


{"error":{"root_cause":[{"type":"security_exception","reason":"Unexpected exception indices:admin/aliases/get"}],"type":"security_exception","reason":"Unexpected exception indices:admin/aliases/get"},"status":500}
Error while attemping to determine the active write alias: {"error":{"root_cause":[{"type":"security_exception","reason":"Unexpected exception indices:admin/aliases/get"}],"type":"security_exception","reason":"Unexpected exception indices:admin/aliases/get"},"status":500}
Current write index for audit-write: audit-000160
Checking results from _rollover call
Next write index for audit-write: audit-000160
Checking if audit-000160 exists
Checking if audit-000160 is the write index for audit-write
Done!



Thanks,
Ronald

Comment 9 David Hernández Fernández 2021-02-16 12:39:59 UTC
https://bugzilla.redhat.com/show_bug.cgi?id=1928772 for 4.6.16+

Comment 11 David Hernández Fernández 2021-03-27 09:07:14 UTC
Vedanti, here are the new bugs as the issue is still not fixed. It's being verified in https://github.com/openshift/elasticsearch-operator/pull/678

Bug 1929688 (VERIFIED)   
Sometimes The Elasticsearch-Delete-Xxx Job Failed At "Unexpected Exception Indices:admin/Aliases/Get" - OCP 4.6.16
Bug 1928772 (VERIFIED)   
Sometimes The Elasticsearch-Delete-Xxx Job Failed: After OCP 4.6.16 Patch.


Note You need to log in before you can comment on or make changes to this bug.