Bug 2097511

Summary: [RDR] When sequential failover is performed, primary cluster took ~1hr:18mins to cleanup from the time Nodes were powered ON, health & image_health in mirroring status summary on both clusters remain in Warning and never recover
Product: [Red Hat Storage] Red Hat OpenShift Data Foundation Reporter: Aman Agrawal <amagrawa>
Component: cephAssignee: Adam Kupczyk <akupczyk>
ceph sub component: RADOS QA Contact: Aman Agrawal <amagrawa>
Status: CLOSED CURRENTRELEASE Docs Contact:
Severity: high    
Priority: unspecified CC: akupczyk, bmekhiss, bniver, ebenahar, ekuric, idryomov, kramdoss, muagarwa, nojha, odf-bz-bot, pdhiran, srangana, sseshasa, vumrao
Version: 4.10Keywords: TestBlocker
Target Milestone: ---Flags: srangana: needinfo-
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of:
: 2116605 (view as bug list) Environment:
Last Closed: 2023-08-25 06:04:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 6 Benamar Mekhissi 2022-06-16 15:00:35 UTC
@Madhu Rajanna can you please take a look.  During the time C1 was trying to become primary for 100 PVCs, the ForcePromote was failing.  It took 13 minutes for all 100 to be marked as Primary.  That's from ~15T17:50 to ~15T18:03. However, C2, the old primary, took a lot longer to transition to secondary (more than an hour), and then all 100 to get deleted. During that time, all VRs were failing to resync.

Comment 22 Mudit Agarwal 2022-07-12 13:37:42 UTC
Aman, do we have an update on this?

Comment 24 Mudit Agarwal 2022-07-25 07:06:41 UTC
Hi Aman, any news on this? Is this still a blocker (specifically TP blocker)?

Comment 66 Elad 2023-06-19 06:05:45 UTC
Moving to 4.13.z for verification purposes

Comment 71 Red Hat Bugzilla 2023-12-24 04:25:04 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days