Description of problem: ----------------------- When deleting the VM image file of size 1TB, there are sequence of issues/errors seen in RHV Manager. SPM goes non-operational and reboots, sanlock errors are seen. Possible guess is that the latency in the gluster storage domain is causing such problem. Version-Release number of selected component (if applicable): -------------------------------------------------------------- RHV 4.0 RHGS 3.4.2 How reproducible: ----------------- Always Steps to Reproduce: -------------------- 1. Create a gluster storage domain 2. Create a disk of size 1TB ( either preallocate the disk or thin-allocate and write some data in to the disk ) 3. Delete the VM disk from RHV Manager UI Actual results: --------------- On the hosts tab, host with SPM role goes inactive, events tab shows that sanlock error has occurred, vdsm heartbeat exceeded on that host, and the SPM host goes to reboot. VMs running on the SPM host goes to unknown state Expected results: ----------------- No errors and healthy VMs
This issue is seen with RHGS 3.0 & RHV 4.0.7. When updating to the latest RHGS 3.4.2 ( glusterfs-3.12.2-32.el7rhgs ) and RHV 4.2.7, this issue is not seen any more. I have discussed the same with Sahina, and I'm closing this bug as the issue is not seen with latest gluster builds