Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use https://ibm-ceph.atlassian.net/ for all bug tracking management.

Bug 2186738

Summary: [CEE/sd][ceph-monitoring][node-exporter] node-exporter on a fresh installation is crashing due to `panic: "node_rapl_package-0-die-0_joules_total" is not a valid metric name`
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Tridibesh Chakraborty <trchakra>
Component: CephadmAssignee: Nizamudeen <nia>
Status: CLOSED ERRATA QA Contact: Mohit Bisht <mobisht>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 5.3CC: adking, ceph-eng-bugs, cephqe-warriors, kdreyer, ktdreyer, mobisht, nia, tserlin, vereddy
Target Milestone: ---   
Target Release: 6.1   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: ceph-17.2.6-64.el9cp Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-06-15 09:17:19 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Tridibesh Chakraborty 2023-04-14 10:16:28 UTC
Description of problem:
Node-exporter on a fresh RHCS 5.3z1 installation is failing as soon as it is getting deployed due to the error `panic: "node_rapl_package-0-die-0_joules_total" is not a valid metric name`

Version-Release number of selected component (if applicable):
RHCS 5.3z1: 16.2.10-138.el8cp

How reproducible:
This is customer environment specific

Steps to Reproduce:
1. Bootstrap a RHCS 5.3z1 cluster on server with processor AMD EPYC 7301 16-Core Processor
2. Deploy the monitoring stack using service configuration file
3. As soon as prometheus is coming up, node exporter is crashing

Actual results:
Node-exporter is crashing

Expected results:
node-exporter should be running 

Additional info:
Customer is using AMD EPYC 7301 processor. Found below node-exporter tracker and looks like this may be same issue as we are also using node-exporter version 1.3.1 same as the version reported in the below tracker.
 
https://github.com/prometheus/node_exporter/issues/2299

Comment 26 errata-xmlrpc 2023-06-15 09:17:19 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: Red Hat Ceph Storage 6.1 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2023:3623