Description of problem: ------------------------ RHHI-V DR mechanism makes use of gluster geo-replication to sync the data to the remote site. I see that works good, and the checkpoint is reached, which means the data is synced successfully to the secondary site. But RHV Manager at the primary site fails to recognize the completion of geo-rep data sync and waits indefinitely. Version-Release number of selected component (if applicable): ------------------------------------------------------------- RHV Manager 4.4.3 RHVH 4.4.3 RHGS 3.5.3 How reproducible: ------------------- Always Steps to Reproduce: ------------------- 1. Create a primary site with 3 node RHHI-V deployment 2. Create a secondary site with 3 node RHHI-V deployment, with no storage domains created, but just the volumes created 3. Create a VM with 40GB OS disk and install it with RHEL 8.3 4. Create a geo-rep session from primary site to secondary site 5. Create a schedule to sync the data to secondary site 6. Wait for the schedule for geo-rep session to get triggered Actual results: --------------- Geo-rep session starts and syncs the data successfully, which the RHV Manager /Engine fails to interpret Expected results: ----------------- Once the gluster geo-replication successfully completes data sync, engine should understand the same, and appropriate events to be triggered.
Created attachment 1726770 [details] ENGINE.LOG
Created attachment 1726791 [details] supervdsm.log_grafton10
Created attachment 1726792 [details] supervdsm.log_grafton10
Created attachment 1726794 [details] supervdsm.log_grafton7.tar.gz
Created attachment 1726795 [details] supervdsm.log_grafton8.tar.gz
Created attachment 1726796 [details] supervdsm.log_grafton9.tar.gz
Comment on attachment 1726791 [details] supervdsm.log_grafton10 This is not the right log file
Comment on attachment 1726792 [details] supervdsm.log_grafton10 Incorrect logfile
Created attachment 1726900 [details] engine.log with debug enabled
Tested with 4.4.3.12-0.1.el8ev and glusterfs-6.0-49.el8rhgs, with glusterfs-selinux package. Geo-replication successfully syncs the data from the primary gluster volume to secondary gluster volume using rsync as the sync-method. Also post the sync disaster-recovery roles works good and the VMs could successfully start on the secondary site
Closing this bug as the fix is shipped with latest RHHI-V 1.8.2