Bug 1593310
Summary: | Fluent pipeline stuck because records in request do not equal response | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Jeff Cantrill <jcantril> |
Component: | Logging | Assignee: | Jeff Cantrill <jcantril> |
Status: | CLOSED ERRATA | QA Contact: | Anping Li <anli> |
Severity: | high | Docs Contact: | |
Priority: | unspecified | ||
Version: | 3.9.0 | CC: | anli, aos-bugs, juzhao, rmeggins, smunilla |
Target Milestone: | --- | ||
Target Release: | 3.9.z | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: fluent-plugin-elasticsearch improperly handled book keeping of the records being submitted
Consequence: Fluent was stuck processing a chuck even though there was a valid request and response.
Fix: Properly account for the records submitted to Elasticsearch.
Result: The pipeline is no longer stuck
|
Story Points: | --- |
Clone Of: | 1593297 | Environment: | |
Last Closed: | 2018-07-18 09:18:59 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1593297, 1593312 | ||
Bug Blocks: | 1562004 |
Description
Jeff Cantrill
2018-06-20 14:12:58 UTC
Waiting new packages. The fix was not in openshift3/logging-fluentd/images/v3.9.33-2. 2018-07-11 05:56:15 -0400 [warn]: fluent/output.rb:381:rescue in try_flush: temporarily failed to flush the buffer. next_retry=2018-07-11 05:57:14 -0400 error_class="Fluent::ElasticsearchErrorHandler::ElasticsearchError" error="The number of records submitted 6837 do not match the number returned 6835. Unable to process bulk response." plugin_id="object:3fdb3c8de274" Moving this back to modified in anticipation of a corrected image used buffer file from https://bugzilla.redhat.com/attachment.cgi?id=1417859 and followed steps in Comment 0, all buffer record are read without the no equal response error in fluentd pods logs. Moved to verified logging images version: v3.9.33-3 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2018:2213 |