Bug 2274675

Summary: CSI Pods going into OOM during volume group restore
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Leela Venkaiah Gangavarapu <lgangava>
Component: ocs-client-operatorAssignee: Leela Venkaiah Gangavarapu <lgangava>
Status: CLOSED ERRATA QA Contact: Jilju Joy <jijoy>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 4.14CC: kramdoss, mrajanna, muagarwa, nberry, odf-bz-bot, omitrani, resoni
Target Milestone: ---   
Target Release: ODF 4.14.8   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard: isf-provider
Fixed In Version: 4.14.8-3 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 2274676 (view as bug list) Environment:
Last Closed: 2024-06-12 07:38:58 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2274676    

Description Leela Venkaiah Gangavarapu 2024-04-12 06:53:22 UTC
In internal mode CSI Pods doesn't have any resource limits set probably due to the proportional requirement wrt increase in PV requirement and provider mode also should remove the limits set to match internal mode.

Tracker for https://github.ibm.com/ProjectAbell/abell-tracking/issues/34944

Comment 7 Jilju Joy 2024-06-05 14:23:31 UTC
Verified in version:

% oc get csv                     
NAME                                    DISPLAY                            VERSION        REPLACES                                PHASE
ocs-client-operator.v4.14.8-rhodf       OpenShift Data Foundation Client   4.14.8-rhodf   ocs-client-operator.v4.14.7-rhodf       Succeeded
odf-csi-addons-operator.v4.14.8-rhodf   CSI Addons                         4.14.8-rhodf   odf-csi-addons-operator.v4.14.7-rhodf   Succeeded

% oc get catsrc -n ocs-catalogsource -n openshift-marketplace -o yaml | grep image
    image: quay.io/rhceph-dev/ocs-registry:4.14.8-6

OCP version is 4.16.0-rc.3

Verified on client cluster in a provider-client setup.

Based on the discussions, the verification step is to verify that the resources limit and request is removed.


% oc get deployment csi-rbdplugin-provisioner -oyaml | yq '.spec .template .spec .containers[] | [.name, .resources]';
- csi-provisioner
- {}
- csi-attacher
- {}
- csi-resizer
- {}
- csi-snapshotter
- {}
- csi-rbdplugin
- {}


% oc get deployment csi-cephfsplugin-provisioner -oyaml | yq '.spec .template .spec .containers[] | [.name, .resources]';
- csi-provisioner
- {}
- csi-attacher
- {}
- csi-resizer
- {}
- csi-snapshotter
- {}
- csi-cephfsplugin
- {}


% oc get daemonset csi-cephfsplugin -oyaml | yq '.spec .template .spec .containers[] | [.name, .resources]'
- csi-driver-registrar
- {}
- csi-cephfsplugin
- {}

% oc get daemonset csi-rbdplugin -oyaml | yq '.spec .template .spec .containers[] | [.name, .resources]'
- csi-driver-registrar
- {}
- csi-rbdplugin
- {}
- csi-addons
- {}

Comment 12 errata-xmlrpc 2024-06-12 07:38:58 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenShift Data Foundation 4.14.8 Bug Fix Update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:3861