1881082 – etcd raft term is not available through metrics

Bug 1881082 - etcd raft term is not available through metrics

Summary: etcd raft term is not available through metrics

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Etcd
Sub Component:
Version:	4.6
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	4.6.0
Assignee:	Sam Batschelet
QA Contact:	ge liu
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1883268
TreeView+	depends on / blocked

Reported:	2020-09-21 14:04 UTC by Dan Mace
Modified:	2023-09-14 06:08 UTC (History)
CC List:	0 users
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-10-27 16:43:33 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Github	openshift cluster-etcd-operator pull 444	None	closed	Bug 1881082: Expose etcd raft term as a metric	2020-11-17 07:04:11 UTC
Github	openshift cluster-etcd-operator pull 451	None	closed	Bug 1881082: remove erroneously specified label drop rules	2020-11-17 07:03:51 UTC
Red Hat Product Errata	RHBA-2020:4196	None	None	None	2020-10-27 16:43:54 UTC

Description Dan Mace 2020-09-21 14:04:38 UTC

Description of problem:

etcd does not currently expose the raft term through prometheus metrics. This limits our ability to do granular post-mortem performance analysis using the CI data we have available to us. The etcd operator could itself re-expose this metric point so that we have more information to inform our decisions.

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 4 Dan Mace 2020-09-28 13:33:38 UTC

Ge,

Good catch!

You can see the operator exposing the metrics with:

    $ oc exec --namespace openshift-etcd-operator deployments/etcd-operator -c etcd-operator -- /bin/bash -c 'curl -k -H "Authorization: Bearer $(cat /var/run/secrets/kubernetes.io/serviceaccount/token)" https://localhost:8443/metrics'

But it looks like we have an error in the etcd-operator scrape configuration causing the metrics to be dropped during collection. I've opened https://github.com/openshift/cluster-etcd-operator/pull/451 to fix the issue.

Comment 9 errata-xmlrpc 2020-10-27 16:43:33 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4196

Comment 10 Red Hat Bugzilla 2023-09-14 06:08:39 UTC

The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days

Note You need to log in before you can comment on or make changes to this bug.