Bug 1464024
Summary: | [trello 1qpV9jcS] fluentd pods still connecting to master api servers after deploying with openshift_logging_use_mux=true | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Mike Fiedler <mifiedle> | ||||||
Component: | Logging | Assignee: | Rich Megginson <rmeggins> | ||||||
Status: | CLOSED ERRATA | QA Contact: | Xia Zhao <xiazhao> | ||||||
Severity: | high | Docs Contact: | |||||||
Priority: | unspecified | ||||||||
Version: | 3.6.0 | CC: | aos-bugs, jcantril, jeder, juzhao, pportant | ||||||
Target Milestone: | --- | ||||||||
Target Release: | 3.6.z | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | aos-scalability-36 | ||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: |
Cause: Fluentd was not removing the kubernetes metadata filter configuration when being used as a mux client.
Consequence: Fluentd was still opening connections to the OpenShift API server.
Fix: Make sure to remove the kubernetes metadata filter configuration file when Fluentd is being used as a mux client.
Result: No connection from Fluentd running as a mux client to the OpenShift API server.
|
Story Points: | --- | ||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2017-10-25 13:02:19 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Attachments: |
|
Description
Mike Fiedler
2017-06-22 09:57:26 UTC
pretty sure this is just because we haven't updated all of the downstream images yet and/or openshift-ansible 3.6 rpm packages Removing TestBlocker since the workaround in comment 4 works ok. Should also ba able to clone the openshift-ansible PR branch while the merge is pending. I will try that today. Merged https://github.com/openshift/openshift-ansible/commit/9613d2e517ced0bc5d165801df3442ab331d214c Will be fixed downstream when there is a new build of openshift-ansible for 3.6 with this fix @mifiedle The way in comment #8 didn't work when I want to verify this issue, here is the detailed step, could you help to take a look? 1. have logging + mux deployed in -n logging: # oc get po NAME READY STATUS RESTARTS AGE logging-curator-1-hn64k 1/1 Running 0 2h logging-es-data-master-whsfhhon-1-cww70 1/1 Running 0 2h logging-fluentd-7dvhq 1/1 Running 0 2h logging-fluentd-n74kv 1/1 Running 0 2h logging-kibana-1-btk71 2/2 Running 0 2h logging-mux-1-v286l 1/1 Running 0 2h 2. login kibana UI --> log entries presented fine there 3. Attempt to verify bug fix, none of the commands able to give output: [root@host-8-175-70 ~]# oc exec logging-fluentd-7dvhq -- ss -tnpi State Recv-Q Send-Q Local Address:Port Peer Address:Port [root@host-8-175-70 ~]# oc exec logging-fluentd-7dvhq -- ss -tnpi State Recv-Q Send-Q Local Address:Port Peer Address:Port [root@host-8-175-70 ~]# oc exec logging-fluentd-n74kv -- ss -tnpi State Recv-Q Send-Q Local Address:Port Peer Address:Port [root@host-8-175-70 ~]# oc exec logging-mux-1-v286l -- ss -tnpi Cannot open netlink socket: Permission denied State Recv-Q Send-Q Local Address:Port Peer Address:Port Test env: # openshift version openshift v3.6.133 kubernetes v1.6.1+5115d708d7 etcd 3.2.1 # rpm -qa | grep ansible openshift-ansible-callback-plugins-3.6.133-1.git.0.950bb48.el7.noarch openshift-ansible-docs-3.6.133-1.git.0.950bb48.el7.noarch openshift-ansible-lookup-plugins-3.6.133-1.git.0.950bb48.el7.noarch openshift-ansible-filter-plugins-3.6.133-1.git.0.950bb48.el7.noarch openshift-ansible-playbooks-3.6.133-1.git.0.950bb48.el7.noarch ansible-2.2.3.0-1.el7.noarch openshift-ansible-3.6.133-1.git.0.950bb48.el7.noarch openshift-ansible-roles-3.6.133-1.git.0.950bb48.el7.noarch @Rich, Seems the fix is not in the above ansible packages, attached the fluentd daemonset, there is no USE_MUX_CLIENT exist. Created attachment 1294441 [details]
fluentd_daemonset_with_openshift-ansible-playbooks-3.6.133-1.git.0.950bb48.el7.noarch
@xiazhao The PR still hasn't merged. See https://github.com/openshift/openshift-ansible/pull/4554. The only way to test mux right now is to use the workaround. merged upstream: https://github.com/openshift/openshift-ansible/commit/01f91dfe6257dc1df73df4e12ccd8db899369d27 awaiting new downstream package . . . openshift-ansible-3.6.136-1.git.0.ac6bb62.el7 Verified on openshift-ansible 3.6.136 rpm install. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2017:3049 |