Bug 2275459

Summary: [CephFS - Consistency Group] - Quiesce fails randomly with error EBADF
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: sumr
Component: CephFSAssignee: Leonid Usov <lusov>
Status: VERIFIED --- QA Contact: sumr
Severity: high Docs Contact: Akash Raj <akraj>
Priority: unspecified    
Version: 7.1CC: akraj, ceph-eng-bugs, cephqe-warriors, lusov, tserlin
Target Milestone: ---Keywords: Automation
Target Release: 7.1   
Hardware: x86_64   
OS: All   
Whiteboard:
Fixed In Version: ceph-18.2.1-146.el9cp Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 2267614    

Description sumr 2024-04-17 09:23:34 UTC
Description of problem:

Functional workflow "Verify quiesce on already quiesced subvolume" fails with error 'EBADF' as quiesce command response.

Version-Release number of selected component (if applicable):


How reproducible:2/5


Steps to Reproduce:
1. Run IO on all 6 subvolumes. 
2. Perform quiesce on 3 of 6 subvolumes in a quiesce set
3. Perform quiesce with all 6 subvolumes in quiesce set. Verify quiesce behaviour  on already quiesced subvolume.

Actual results: Quiesce on already quiesced subvolume should gracefully exit. It should complete quiesce on other subvolumes.


Expected results: Quiesce on set of 6 subvolumes failed with error EBADF.


Additional info:
2024-04-17 04:28:35,200 (cephci.snapshot_clone.cg_snap_test) [INFO] - cephci.ceph.ceph.py:1563 - Running command ceph fs quiesce cephfs  "subvolgroup_cg/sv1_non_def_1"  "subvolgroup_cg/sv1_non_def_2"  "subvolgroup_cg/sv1_non_def_3"  "subvolgroup_cg/sv1_non_def_4"  "subvolgroup_cg/sv1_non_def_5"  "subvolgroup_cg/sv1_non_def_6"  --format json  --set-id cg_test1_i9i  --await  --timeout 300  --expiration 300 on 10.0.209.37 timeout 600
2024-04-17 04:28:35,819 (cephci.snapshot_clone.cg_snap_test) [ERROR] - cephci.ceph.ceph.py:1599 - Error 9 during cmd, timeout 600
2024-04-17 04:28:35,820 (cephci.snapshot_clone.cg_snap_test) [ERROR] - cephci.ceph.ceph.py:1600 - Error EBADF:

Automation logs: http://magna002.ceph.redhat.com/cephci-jenkins/cephci-run-DXNWCJ/cg_snap_test_0.log