Bug 2215021

Summary: [GSS][ODF 4.12] Failing to respond to cache pressure
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Rafrojas <rafrojas>
Component: cephAssignee: Kotresh HR <khiremat>
ceph sub component: CephFS QA Contact: Elad <ebenahar>
Status: NEW --- Docs Contact:
Severity: medium    
Priority: unspecified CC: bniver, khiremat, mparida, muagarwa, odf-bz-bot, sostapov, tnielsen, vshankar
Version: 4.12   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Rafrojas 2023-06-14 13:33:47 UTC
Description of problem (please be detailed as possible and provide log
snippests):
After increasing the value of KCS 2 times the issue persists


Version of all relevant components (if applicable):
ODF operator 4.11.3 + Mode Internal

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
Not at the moment the warning is constantly increasing

Is there any workaround available to the best of your knowledge?
No

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?


Can this issue reproducible?
Constantly

Can this issue reproduce from the UI?


If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1.
2.
3.


Actual results:


Expected results:


Additional info:

Comment 3 Malay Kumar parida 2023-06-14 17:09:07 UTC
Seems like some issue related to ceph configuration, Moving to rook for investigation.

Comment 4 Travis Nielsen 2023-06-14 21:18:01 UTC
Rook sets the mds_cache_memory_limit to 50% of the memory limit [1], which by default in ODF is set to 8Gi. 

Have you tried increasing the 'mds' memory limits specified in the StorageCluster CR to see if more memory will help resolve the warning? 

[1] https://github.com/rook/rook/blob/cd27f17ce8a9fd22000c961cee3e0a147707074f/pkg/operator/ceph/file/mds/config.go#L72-L74