Bug 2230601

Summary: [GSS] [csi-driver] Unable to to perform backup due to clone from snapshot is pending but no clone subvolumes exist
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Steve Baldwin <sbaldwin>
Component: csi-driverAssignee: Madhu Rajanna <mrajanna>
Status: ASSIGNED --- QA Contact: krishnaram Karthick <kramdoss>
Severity: medium Docs Contact:
Priority: medium    
Version: 4.10CC: assingh, glaw, khiremat, mrajanna, muagarwa, odf-bz-bot, pdhange, rar, ypadia
Target Milestone: ---   
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Steve Baldwin 2023-08-09 17:35:50 UTC
Description of problem (please be detailed as possible and provide log
snippests):

Unable to perform backup/snapshot of cephfs pv, the jobs created stay in a pending state with the following message:
Generated from openshift-storage.cephfs.csi.ceph.com_csi-cephfsplugin-provisioner-689769bf9c-4f85m_b888385e-0c95-47d5-ac4e-cf7be65d5714 14 times in the last 1 hour failed to provision volume with StorageClass "ocs-storagecluster-cephfs": rpc error: code = Aborted desc = clone from snapshot is pending

When looking at the csi-vols we do not see any subvolumes reporting type "clone"
Version of all relevant components (if applicable):
4.10.12 / rhcs 16.2.7-126

Does this issue impact your ability to continue to work with the product
(please explain in detail what is the user impact)?
yes, customer is unable to perform backups currently on 

Is there any workaround available to the best of your knowledge?
No work-around

Rate from 1 - 5 the complexity of the scenario you performed that caused this
bug (1 - very simple, 5 - very complex)?
4

Can this issue reproducible?
Yes, at the customer site.

Can this issue reproduce from the UI?
Yes, 

If this is a regression, please provide more details to justify this:
N/A

Steps to Reproduce:
1. Initiate a backup/snapshot through SPP
2.Snapshot job stays in a "pending" state due to "clone from snapshot is pending"


Actual results:
Backups/Snapshot jobs stay in pending state due to pending clone but we do not see any  subvolume clones.

Expected results:
Successful snapshot/clone operation

Additional info: