Bug 2030364

Summary: Shared resource CSI driver monitoring is not setup correctly
Product: OpenShift Container Platform Reporter: Alice Rum <irum>
Component: StorageAssignee: Alice Rum <irum>
Storage sub component: Shared Resource CSI Driver QA Contact: Priti Kumari <pkumari>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: unspecified CC: aos-bugs, rvanderp
Version: 4.10   
Target Milestone: ---   
Target Release: 4.10.0   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-03-10 16:32:46 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Alice Rum 2021-12-08 15:01:19 UTC
Description of problem:

The way CSO sets up the csi shared resource operator is such that monitoring metrics are not gathered by the prometheus server. Container port is not set up at all and TLS secrets are not mounted into the operator.

Version-Release number of selected component (if applicable):

How reproducible:
always

Steps to Reproduce:
1. log into the web console of the OpenShift cluster
2. observe -> metrics
3. Try to look for openshift_csi_share_configmap_total metric.
4. oc exec <share resource operator pod> curl http://localhost:6000/metrics


Actual results:
In the 4th step metric should be there being output by the metric server, but in the 3rd step it is not in the prometheus.

Expected results:
Metric is available in both places.

Master Log:

Node Log (of failed PODs):

PV Dump:

PVC Dump:

StorageClass Dump (if StorageClass used by PV/PVC):

Additional info:

Comment 2 Priti Kumari 2021-12-16 09:28:52 UTC
I have verified the metrics using console and the csi metrics are available.

1. openshift_csi_share_secret
2. openshift_csi_share_configmap
3. openshift_csi_share_mount_failures_total
4. openshift_csi_share_mount_requests_total

Comment 3 Gabe Montero 2021-12-16 22:48:38 UTC
*** Bug 2033057 has been marked as a duplicate of this bug. ***

Comment 7 errata-xmlrpc 2022-03-10 16:32:46 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.10.3 security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:0056