Bug 2186738 - [CEE/sd][ceph-monitoring][node-exporter] node-exporter on a fresh installation is crashing due to `panic: "node_rapl_package-0-die-0_joules_total" is not a valid metric name`
Summary: [CEE/sd][ceph-monitoring][node-exporter] node-exporter on a fresh installatio...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Cephadm
Version: 5.3
Hardware: x86_64
OS: Linux
unspecified
medium
Target Milestone: ---
: 6.1
Assignee: Nizamudeen
QA Contact: Mohit Bisht
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2023-04-14 10:16 UTC by Tridibesh Chakraborty
Modified: 2023-06-15 09:17 UTC (History)
9 users (show)

Fixed In Version: ceph-17.2.6-64.el9cp
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-06-15 09:17:19 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHCEPH-6457 0 None None None 2023-04-14 10:17:50 UTC
Red Hat Knowledge Base (Solution) 7012799 0 None None None 2023-05-12 04:46:45 UTC

Description Tridibesh Chakraborty 2023-04-14 10:16:28 UTC
Description of problem:
Node-exporter on a fresh RHCS 5.3z1 installation is failing as soon as it is getting deployed due to the error `panic: "node_rapl_package-0-die-0_joules_total" is not a valid metric name`

Version-Release number of selected component (if applicable):
RHCS 5.3z1: 16.2.10-138.el8cp

How reproducible:
This is customer environment specific

Steps to Reproduce:
1. Bootstrap a RHCS 5.3z1 cluster on server with processor AMD EPYC 7301 16-Core Processor
2. Deploy the monitoring stack using service configuration file
3. As soon as prometheus is coming up, node exporter is crashing

Actual results:
Node-exporter is crashing

Expected results:
node-exporter should be running 

Additional info:
Customer is using AMD EPYC 7301 processor. Found below node-exporter tracker and looks like this may be same issue as we are also using node-exporter version 1.3.1 same as the version reported in the below tracker.
 
https://github.com/prometheus/node_exporter/issues/2299

Comment 26 errata-xmlrpc 2023-06-15 09:17:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:3623


Note You need to log in before you can comment on or make changes to this bug.