RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1984150 - metrics role: Grafana dashboard not working after metrics role run unless services manually restarted
Summary: metrics role: Grafana dashboard not working after metrics role run unless ser...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Linux 9
Classification: Red Hat
Component: rhel-system-roles
Version: 9.0
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: beta
: 9.0 Beta
Assignee: Rich Megginson
QA Contact: Jan Kurik
URL:
Whiteboard: role:metrics
Depends On: 1978357
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-07-20 20:01 UTC by Rich Megginson
Modified: 2022-08-02 18:06 UTC (History)
10 users (show)

Fixed In Version: rhel-system-roles-1.7.5-1.el9
Doc Type: Bug Fix
Doc Text:
Clone Of: 1978357
Environment:
Last Closed: 2021-12-07 22:04:56 UTC
Type: Bug
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Rich Megginson 2021-07-20 20:01:50 UTC
+++ This bug was initially created as a clone of Bug #1978357 +++

Description of problem:
If I run the metrics role to setup a grafana host and server clients to be monitored, the playbook completes successfully, but when I pull up the "PCP Redis: Host Overview" dashboard in Grafana I get these error messages:

Templating [host]
Error updating options: timeout while connecting to 'http://localhost:44322/series/labels?names=hostname'
Error
timeout while connecting to 'http://localhost:44322/series/labels?names=hostname'


Version-Release number of selected component (if applicable):
$ rpm -qa | egrep "rhel-system-roles|pcp|grafana"
pcp-conf-5.2.5-4.el8.x86_64
pcp-pmda-nfsclient-5.2.5-4.el8.x86_64
pcp-zeroconf-5.2.5-4.el8.x86_64
grafana-pcp-3.0.2-1.el8.x86_64
pcp-doc-5.2.5-4.el8.noarch
pcp-libs-5.2.5-4.el8.x86_64
python3-pcp-5.2.5-4.el8.x86_64
pcp-pmda-openmetrics-5.2.5-4.el8.x86_64
pcp-pmda-dm-5.2.5-4.el8.x86_64
grafana-7.3.6-2.el8.x86_64
rhel-system-roles-1.0.1-1.el8.noarch
pcp-selinux-5.2.5-4.el8.x86_64
pcp-5.2.5-4.el8.x86_64
pcp-system-tools-5.2.5-4.el8.x86_64


How reproducible:
Every time


Steps to Reproduce:
1. Start with 5 newly built RHEL hosts
2. Manually apply fix for https://github.com/linux-system-roles/metrics/pull/87
3. Run metrics role to setup PCP on the hosts and setup Grafana on one of the hosts
4. Import the "PCP Redis: Host Overview" dashboard in Grafana
5. View the "PCP Redis: Host Overview" dashboard

Actual results:
Templating [host]
Error updating options: timeout while connecting to 'http://localhost:44322/series/labels?names=hostname'
Error
timeout while connecting to 'http://localhost:44322/series/labels?names=hostname'


Expected results:
Dashboard works 


Additional info:
If I run "sudo systemctl restart pmcd pmlogger pmproxy" on the grafana host after the metrics role completes, everything works as expected.

...

--- Additional comment from Brian Smith on 2021-07-20 15:36:20 UTC ---

Hi Nathan,
I did some more testing this morning.  With the https://github.com/linux-system-roles/metrics/commit/06f9aa59739cb421056a30ee3edb5276650943cd patch and also moving the firewall tasks to the top of the playbook, everything worked properly on two runs of the playbook (restoring the VM's back to a clean snapshot each time).  

The https://github.com/linux-system-roles/metrics/commit/06f9aa59739cb421056a30ee3edb5276650943cd is definitely needed.  I tried running the playbook with the firewall tasks at the top of the playbook, and without this patch, and ran in to the original issue in the description of the BZ.  

I'm working on getting the blog post updated so that it shows the firewall tasks at the top of the playbook, and will also still have the workaround of restarting pmlogger and pmproxy to workaround this BZ.  

Thanks for your help getting this resolved!


Note You need to log in before you can comment on or make changes to this bug.