Bugzilla will be upgraded to version 5.0. The upgrade date is tentatively scheduled for 2 December 2018, pending final testing and feedback.
Bug 1468113 - metrics 3.6, metricsPublicURL is only configured to first master with ansible metrics installer
metrics 3.6, metricsPublicURL is only configured to first master with ansible...
Status: CLOSED ERRATA
Product: OpenShift Container Platform
Classification: Red Hat
Component: Installer (Show other bugs)
3.6.0
Unspecified Unspecified
medium Severity medium
: ---
: ---
Assigned To: Kenny Woodson
Junqi Zhao
:
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2017-07-06 02:06 EDT by Junqi Zhao
Modified: 2017-08-16 15 EDT (History)
4 users (show)

See Also:
Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
Environment:
Last Closed: 2017-08-10 01:29:50 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
ansible inventory file (965 bytes, text/plain)
2017-07-06 02:06 EDT, Junqi Zhao
no flags Details
ansible running log (976.40 KB, text/plain)
2017-07-06 02:08 EDT, Junqi Zhao
no flags Details
ansible log, which don't have log about restart master, master api, master controller (1.54 MB, text/plain)
2017-07-13 02:03 EDT, Junqi Zhao
no flags Details
ansible log, it have log about restart master, master api, master controller (1.65 MB, text/plain)
2017-07-13 02:05 EDT, Junqi Zhao
no flags Details


External Trackers
Tracker ID Priority Status Summary Last Updated
Red Hat Product Errata RHEA-2017:1716 normal SHIPPED_LIVE Red Hat OpenShift Container Platform 3.6 RPM Release Advisory 2017-08-10 05:02:50 EDT

  None (edit)
Description Junqi Zhao 2017-07-06 02:06:47 EDT
Created attachment 1294828 [details]
ansible inventory file

Description of problem:
Same issue as https://bugzilla.redhat.com/show_bug.cgi?id=1465275
metricsPublicURL is configured to first master only with HA masters setup

Version-Release number of selected component (if applicable):
# openshift version
openshift v3.6.135
kubernetes v1.6.1+5115d708d7
etcd 3.2.1

# rpm -qa | grep openshift-ansible
openshift-ansible-playbooks-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-callback-plugins-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-roles-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-docs-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-filter-plugins-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-lookup-plugins-3.6.135-1.git.0.5533fe3.el7.noarch
openshift-ansible-3.6.135-1.git.0.5533fe3.el7.noarch


How reproducible:
Always

Steps to Reproduce:
1. Deploy metrics 3.6 on HA env.
ansible-playbook -vvv -i ${inventory_file} playbooks/byo/openshift-cluster/openshift-metrics.yml
2.
3.

Actual results:
metricsPublicURL is configured to first master only

Expected results:
metricsPublicURL should be configured to all masters

Additional info:
Attached ansible inventory file and running log

Description of problem:

Version-Release number of the following components:
rpm -q openshift-ansible
rpm -q ansible
ansible --version

How reproducible:

Steps to Reproduce:
1.
2.
3.

Actual results:
Please include the entire output from the last TASK line through the end of output if an error is generated

Expected results:

Additional info:
Please attach logs from ansible-playbook with the -vvv flag
Comment 1 Junqi Zhao 2017-07-06 02:08 EDT
Created attachment 1294829 [details]
ansible running log
Comment 4 Junqi Zhao 2017-07-11 04:13:24 EDT
Issue is not fixed, it is changed to ON_QA automatically by add this defect to advisory by Errata System.

Testing env:
# openshift version
openshift v3.6.140
kubernetes v1.6.1+5115d708d7
etcd 3.2.1


# rpm -qa | grep openshift-ansible
openshift-ansible-callback-plugins-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-playbooks-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-lookup-plugins-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-roles-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-docs-3.6.140-1.git.0.4a02427.el7.noarch
openshift-ansible-filter-plugins-3.6.140-1.git.0.4a02427.el7.noarch
Comment 5 Kenny Woodson 2017-07-11 09:30:48 EDT
The code change is here.  It is not merged as of this morning.

https://github.com/openshift/openshift-ansible/pull/4701
Comment 6 Kenny Woodson 2017-07-11 09:39:30 EDT
The changes have merged now.
Comment 7 Junqi Zhao 2017-07-13 02:02:13 EDT
I found a strange phenomenon, sometimes it only came to step "Adding metrics route information to metricsPublicURL", and then the ansible work stopped without [openshift_metrics : restart master], [openshift_metrics : restart master api],[openshift_metrics : restart master controllers] steps, the metrics route could not be accessed, and showed "Application is not available" error on https://hawkular-metrics.${SUB_DOAMIN}/hawkular/metrics page.

Attached the ansible running log which is this scenario


Most of time, the ansible showed logs about restart master,master api and master controllers,metrics route could be accessed
Comment 8 Junqi Zhao 2017-07-13 02:03 EDT
Created attachment 1297426 [details]
ansible log, which don't have log about restart master, master api, master controller
Comment 9 Junqi Zhao 2017-07-13 02:05 EDT
Created attachment 1297429 [details]
ansible log, it have log about restart master, master api, master controller
Comment 10 Junqi Zhao 2017-07-13 02:11:37 EDT
run the ansible script by using the following commands
# cd /usr/share/ansible/openshift-ansible/
# ansible-playbook -vvv -i ${INVENTORY_FILE}  playbooks/byo/openshift-cluster/openshift-metrics.yml
Comment 12 Scott Dodson 2017-07-17 11:14:05 EDT
Unless we're sure that the playbooks are doing something wrong I think we should treat that as a router error. Does the problem eventually go away with no further action via ansible?
Comment 13 Junqi Zhao 2017-07-19 01:57:34 EDT
Tested today, used the following commands, the strange issue in Comment 7 was not happen today, could find [openshift_metrics : restart master], [openshift_metrics : restart master api],[openshift_metrics : restart master controllers] steps, and metricsPublicURL were configured to all masters.

# cd /usr/share/ansible/openshift-ansible/
# ansible-playbook -vvv -i ${INVENTORY_FILE}  playbooks/byo/openshift-cluster/openshift-metrics.yml

openshift-ansible packages
# rpm -qa | grep openshift-ansible
openshift-ansible-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-roles-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-docs-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-callback-plugins-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-lookup-plugins-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-playbooks-3.6.153-1.git.0.5a6bf7d.el7.noarch
openshift-ansible-filter-plugins-3.6.153-1.git.0.5a6bf7d.el7.noarch
Comment 15 errata-xmlrpc 2017-08-10 01:29:50 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2017:1716

Note You need to log in before you can comment on or make changes to this bug.