Bug 2290526

Summary: [Tracker ACM-12001] [RDR] VolSync - rsync-tls fails to sync when there are too many files in the root of the source PVC
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Aman Agrawal <amagrawa>
Component: odf-drAssignee: Karolin Seeger <kseeger>
odf-dr sub component: ramen QA Contact: Aman Agrawal <amagrawa>
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: unspecified CC: asriram, kramdoss, kseeger, muagarwa, nberry, sheggodu
Version: 4.16Keywords: Tracking
Target Milestone: ---   
Target Release: ODF 4.16.2   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: The default limit on number of files open was too low if a PVC had many files in the root dir... Consequence: Fix: Result:
Story Points: ---
Clone Of:
: 2290691 (view as bug list) Environment:
Last Closed: 2024-09-18 11:57:37 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2290691    

Description Aman Agrawal 2024-06-05 08:16:13 UTC
Description of problem (please be detailed as possible and provide log
snippests):

ARGS_MAX can be different values on different linux versions, but have been able to reproduce with ~60k files in the root of the source PVC.


Version of all relevant components (if applicable):
ceph version 18.2.1-188.el9cp (b1ae9c989e2f41dcfec0e680c11d1d9465b1db0e) reef (stable)
OCP 4.16.0-0.nightly-2024-05-23-173505
ACM 2.11.0-DOWNSTREAM-2024-05-23-15-16-26
MCE 2.6.0-104 
ODF 4.16.0-108.stable
Gitops v1.12.3 
Submariner 0.18.0 (image: brew.registry.redhat.io/rh-osbs/iib:722673)
VolSync 0.8.1

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?


Is there any workaround available to the best of your knowledge?


Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?


Can this issue reproducible?


Can this issue reproduce from the UI?


If this is a regression, please provide more details to justify this:


Steps to Reproduce:
1. Create source PVC with 60k files or dirs (easier to create if using long file names) in the root of the PVC
Create a replicationsource/dest using rsync-tls to sync this to the destination
Check the destination after replication is complete and compare to ensure all files/dirs were copied correctly
2.
3.


Actual results: Files from the source may not be copied to the destination

Expected results: Files should be copied to the dst without any loss/hinderance


Additional info:

Comment 4 Aman Agrawal 2024-06-05 09:04:25 UTC
We don't need logs for this BZ because live setup was used for debugging/RCA of the issue by Benamar and @tflower together and the fix would land from VolSync.

Comment 9 Sunil Kumar Acharya 2024-06-25 07:57:59 UTC
As ACM-12001 is on_qa moving the bz to ON_QA.

Comment 11 Sunil Kumar Acharya 2024-06-25 12:09:21 UTC
Please update the RDT flag/text appropriately.

Comment 28 errata-xmlrpc 2024-09-18 11:57:37 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: Red Hat OpenShift Data Foundation 4.16.2 security and bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:6755