Bug 1868922 - [Docs] Document debugging cluster performance issues with Prometheus
Summary: [Docs] Document debugging cluster performance issues with Prometheus
Keywords:
Status: CLOSED DEFERRED
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Documentation
Version: 4.6
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 4.6.z
Assignee: Tami Love
QA Contact: ge liu
Latha S
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-08-14 11:19 UTC by oarribas
Modified: 2024-03-25 16:18 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-03-09 00:58:58 UTC
Target Upstream Version:
Embargoed:
oarribas: needinfo-


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1790551 0 medium CLOSED Etcd storage requirements 2024-01-06 04:27:31 UTC
Red Hat Bugzilla 1853618 0 high CLOSED [DDF] Add latency recommendations between the different kinds of servers, specifically for masters. 2024-03-25 16:08:25 UTC
Red Hat Bugzilla 1868343 0 unspecified CLOSED Provide working solution (toolbox or dd) to test etcd performance and document it well. 2023-12-15 18:49:27 UTC
Red Hat Knowledge Base (Solution) 4770281 0 None None None 2021-01-18 18:20:15 UTC
Red Hat Knowledge Base (Solution) 4885641 0 None None None 2020-08-14 11:19:33 UTC
Red Hat Knowledge Base (Solution) 4929021 0 None None None 2020-08-14 11:19:33 UTC
Red Hat Knowledge Base (Solution) 5013531 0 None None None 2020-09-01 16:36:21 UTC
Red Hat Knowledge Base (Solution) 5060021 0 None None None 2020-10-07 08:44:18 UTC
Red Hat Knowledge Base (Solution) 5141931 0 None None None 2020-09-01 16:36:21 UTC

Internal Links: 1790551

Description oarribas 2020-08-14 11:19:34 UTC
Description of problem:
Provide a supported tool to test ETCD performance, as bad performance of ETCD is a common cause of issues in OpenShift clusters.

Currently, there are KCS to test the ETCD disk performance using fio [1] [2], but the tool is not included in any supported image, and also not supported to use the fedora image.


Version-Release number of selected component (if applicable):
All OCP 4 versions, including 4.5


How reproducible:
See [1] and [2]


Actual results:
No supported tool to test ETCD performance


Expected results:
Documented and supported tools to test ETCD performance



[1] https://access.redhat.com/solutions/4885641
[2] https://access.redhat.com/solutions/4929021

Comment 14 Shubha Narayanan 2021-09-01 10:06:25 UTC
@oarribas - Is this bug still valid. Looks like all the fixes have been tracked through another bug that's already closed https://bugzilla.redhat.com/show_bug.cgi?id=1853618 . Kindly confirm.

Comment 16 Shubha Narayanan 2021-09-15 04:06:41 UTC
As per SME, the debugging steps for all etcd issues are available in etcd runbook. Here's the link: https://github.com/openshift/runbooks/tree/master/alerts/cluster-etcd-operator. Since we do not link or replicate the content available in runbook, closing this bug.

Comment 25 Shiftzilla 2023-03-09 00:58:58 UTC
OpenShift has moved to Jira for its defect tracking! This bug can now be found in the OCPBUGS project in Jira.

https://issues.redhat.com/browse/OCPBUGS-8791

Comment 26 Red Hat Bugzilla 2023-09-18 00:21:58 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days


Note You need to log in before you can comment on or make changes to this bug.