Bug 1901096 - fluentforward inability to connect blocks fluent pods from starting
Summary: fluentforward inability to connect blocks fluent pods from starting
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Logging
Version: 4.7
Hardware: Unspecified
OS: Unspecified
high
medium
Target Milestone: ---
: 4.7.0
Assignee: Jeff Cantrill
QA Contact: Qiaoling Tang
URL:
Whiteboard: logging-core
Depends On:
Blocks: 1906479
TreeView+ depends on / blocked
 
Reported: 2020-11-24 13:43 UTC by Jeff Cantrill
Modified: 2021-02-24 11:23 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 1906479 (view as bug list)
Environment:
Last Closed: 2021-02-24 11:22:27 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift cluster-logging-operator pull 813 0 None closed Bug 1901096: Set forward heartbeat to none 2021-02-13 12:16:59 UTC
Red Hat Product Errata RHBA-2021:0652 0 None None None 2021-02-24 11:23:08 UTC

Description Jeff Cantrill 2020-11-24 13:43:07 UTC
Description of problem:

LogForwarding configured with forward protocol tries to send a heartbeat to the configured output and will fail if the output does not respond:


Version-Release number of selected component (if applicable):

This causes the pods to error and go into a crash-loop

How reproducible:

Always

Steps to Reproduce:
1. Deploy ClusterLogForwarding with a fluent forward output that does not exist
2.
3.

Actual results:

2020-11-23 20:26:38 +0000 [error]: unexpected error error_class=SocketError error="getaddrinfo: Name or service not known"
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:680:in `getaddrinfo'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:680:in `resolve_dns!'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:666:in `resolved_host'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:516:in `validate_host_resolution!'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:237:in `block in configure'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:227:in `each'
  2020-11-23 20:26:38 +0000 [error]: /usr/local/share/gems/gems/fluentd-1.7.4/lib/fluent/plugin/out_forward.rb:227:in `configure'

Expected results:

The pod should start and continue to process log to other endpoints

Comment 1 Jeff Cantrill 2020-11-24 13:47:47 UTC
Reviewing it looks like the solution is to set the heartbeat for fluentforward to :none

Comment 4 Qiaoling Tang 2021-01-05 02:18:37 UTC
Verified with clusterlogging.4.7.0-202101020306.p0

Comment 8 errata-xmlrpc 2021-02-24 11:22:27 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Errata Advisory for Openshift Logging 5.0.0), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:0652


Note You need to log in before you can comment on or make changes to this bug.