2257634 – Add Runbooks for ODF alerts - no links, wrong text, sublinks not work

Bug 2257634 - Add Runbooks for ODF alerts - no links, wrong text, sublinks not work

Summary: Add Runbooks for ODF alerts - no links, wrong text, sublinks not work

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenShift Data Foundation
Classification:	Red Hat Storage
Component:	ceph-monitoring
Sub Component:
Version:	4.15
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	ODF 4.15.0
Assignee:	Divyansh Kamboj
QA Contact:	Daniel Osypenko
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2024-01-10 11:03 UTC by Daniel Osypenko
Modified:	2024-03-19 15:31 UTC (History)
CC List:	8 users (show)
Fixed In Version:	4.15.0-130
Doc Type:	No Doc Update
Doc Text:
Clone Of:
Environment:
Last Closed:	2024-03-19 15:31:01 UTC
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Github	openshift runbooks pull 161	None	open	Update CephOSDNearFull.md to include must gather	2024-01-23 10:02:26 UTC
Github	openshift runbooks pull 163	None	open	Some minor corrections in the alerts documentation	2024-01-23 10:02:10 UTC
Github	red-hat-storage ocs-operator pull 2388	None	open	Bug 2257634: [release-4.15] Add 'runbook_url' link to alerts' annotation	2024-01-16 12:57:50 UTC
Github	red-hat-storage ocs-operator pull 2389	None	open	Bug 2257634: [release-4.15] add runbooks links for ceph rules	2024-01-17 05:33:25 UTC
Github	red-hat-storage ocs-operator pull 2391	None	open	Bug 2257634: [release-4.15] Fixed wrong runbook link for CephMonLowNumber alert	2024-01-17 10:53:40 UTC
Red Hat Product Errata	RHSA-2024:1383	None	None	None	2024-03-19 15:31:05 UTC

Description Daniel Osypenko 2024-01-10 11:03:36 UTC

Description of problem (please be detailed as possible and provide log
snippests):

Multiple inconsistencies, some alerts from the list (see comments section https://issues.redhat.com/browse/RHSTOR-3613) are not covered 


Version of all relevant components (if applicable):
ODF 4.15.0-110.stable

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
yes, if the cluster hit Alert 

Is there any workaround available to the best of your knowledge?
search relevant info in Internet

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
1

Can this issue reproducible?
yes

Can this issue reproduce from the UI?
yes

If this is a regression, please provide more details to justify this:
new feature

Steps to Reproduce:
step #1 login to management console
step #2 edit a Prometheus alert rule making the rule violated on healthy cluster
step #3 wait until alert appears on management console dashboard, open it
step #4 click on the link with hitten problem exlanation and solution tips. Text should be clear and correct in regards of the Alert
step #5 repeat for every alert from the Alert list

Alerts List:

CephClusterErrorState
CephClusterWarningState
CephOSDVersionMismatch
PersistentVolumeUsageCritical
CephClusterCriticallyFull
CephClusterNearFull
CephClusterReadOnly
CephMonVersionMismatch
CephPoolQuotaBytesCriticallyExhausted
CephPoolQuotaBytesNearExhaustion
CephMgrIsAbsent
CephMgrIsMissingReplicas
CephMdsMissingReplicas
CephMonQuorumAtRisk
CephMonQuorumLost
CephMonHighNumberOfLeaderChanges
CephNodeDown
CephOSDCriticallyFull
ObcQuotaObjectsAlert
ObcQuotaBytesExhausedAlert
ObcQuotaObjectsExhausedAlert
ClusterObjectStoreState
KMSServerConnectionAlert
ODFRBDClientBlocked
OdfMirrorDaemonStatus
OdfPoolMirroringImageHealth
CephOSDFlapping
CephOSDNearFull
CephOSDDiskNotResponding
CephOSDDiskUnavailable
CephOSDSlowOps
CephDataRecoveryTakingTooLong
CephPGRepairTakingTooLong
PersistentVolumeUsageNearFull
ODFPersistentVolumeMirrorStatus
ObcQuotaBytesAlert"


Actual results:
https://docs.google.com/spreadsheets/d/1Gdx9nn4LNMMe-z-4TWFNDA_x3GN48Q6Rlk5yttjQ2MY/edit?usp=sharing

Expected results:
all Runbook links work, text is relevant, sublinks work

Additional info:
https://issues.redhat.com/browse/RHSTOR-3613

Comment 5 arun kumar mohan 2024-01-16 12:10:10 UTC

One more PR (in addition to comment#2 list): https://github.com/red-hat-storage/ocs-operator/pull/2321

Comment 6 arun kumar mohan 2024-01-16 12:24:40 UTC

Missed PR: https://github.com/openshift/runbooks/pull/158

Comment 9 Daniel Osypenko 2024-01-21 15:41:42 UTC

Validated on ODF 4.15.0-120.stable

Updated the spreadsheet https://url.corp.redhat.com/spreadsheet with new comments 

There are 3 Articles having sublinks leading to dev-preview articles
1 Article having TODO section
1 Article with minor issue on must-gather

Comment 10 Daniel Osypenko 2024-02-06 10:41:44 UTC

checked
spreadsheet updated https://docs.google.com/spreadsheets/d/1Gdx9nn4LNMMe-z-4TWFNDA_x3GN48Q6Rlk5yttjQ2MY/edit#gid=0

Comment 12 errata-xmlrpc 2024-03-19 15:31:01 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.15.0 security, enhancement, & bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:1383

Note You need to log in before you can comment on or make changes to this bug.