Bug 1938112 - [RFE] Add Slow Ops alert
Summary: [RFE] Add Slow Ops alert
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenShift Container Storage
Classification: Red Hat Storage
Component: ceph-monitoring
Version: 4.7
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: OCS 4.8.0
Assignee: Anmol Sachan
QA Contact: Aman Agrawal
URL:
Whiteboard:
Depends On:
Blocks: 1966139
TreeView+ depends on / blocked
 
Reported: 2021-03-12 08:28 UTC by Anmol Sachan
Modified: 2021-08-03 18:15 UTC (History)
12 users (show)

Fixed In Version: 4.8.0-402.ci
Doc Type: Enhancement
Doc Text:
.Added a new alert to improve notification to the users in case one or more OSD requests are taking a long time to process This alert is important to notify OpenShift Container Storage administrators about the slow operations which can be an indication of extreme load, a slow storage device, or a software bug. Users can check ceph status to find out the cause for slowness.
Clone Of:
: 1966139 (view as bug list)
Environment:
Last Closed: 2021-08-03 18:15:14 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github rook rook pull 7417 0 None open ceph: add osd slow ops alert 2021-03-16 10:16:15 UTC
Red Hat Bugzilla 1885441 1 high CLOSED mgr/prometheus should provide a metric indicating SLOW_OPS for alerting 2021-04-28 20:13:04 UTC
Red Hat Product Errata RHBA-2021:3003 0 None None None 2021-08-03 18:15:46 UTC

Comment 9 Mudit Agarwal 2021-03-16 08:59:53 UTC
Elad, given that this BZ is there to add the alert only can we please provide qa_ack?

Comment 20 Martin Bukatovic 2021-06-23 09:54:40 UTC
For reference, name of the new alert is CephOSDSlowOps

Comment 26 Olive Lakra 2021-07-09 04:27:21 UTC
@Mudit - Please review the revised doc text and share feedback

Comment 27 Mudit Agarwal 2021-07-12 06:21:04 UTC

.Added a new alert(CephOSDSlowOps) to improve notification to the users in case one or more OSD requests are taking a long time to process 
This alert is important to notify OpenShift Container Storage administrators about the slow operations which can be an indication of extreme load, a slow storage device, or a software bug.
Users can check ceph status to find out the cause for slowness.

Comment 30 errata-xmlrpc 2021-08-03 18:15:14 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenShift Container Storage 4.8.0 container images bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3003


Note You need to log in before you can comment on or make changes to this bug.