2120760 – [RDR] Relocate operation takes avg. ~20mins just for the Pods to show up in the ContainerCreating state on the desired cluster

Bug 2120760 - [RDR] Relocate operation takes avg. ~20mins just for the Pods to show up in the ContainerCreating state on the desired cluster

Summary: [RDR] Relocate operation takes avg. ~20mins just for the Pods to show up in t...

Keywords:
Status:	ASSIGNED
Alias:	None
Product:	Red Hat OpenShift Data Foundation
Classification:	Red Hat Storage
Component:	documentation
Sub Component:
Version:	4.11
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	ODF 4.15.0
Assignee:	Olive Lakra
QA Contact:	krishnaram Karthick
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2022-08-23 17:06 UTC by Aman Agrawal
Modified:	2024-01-22 16:08 UTC (History)
CC List:	9 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Github	RamenDR ramen pull 905	0	None	open	adding additional status to VRG and DRPC	2023-07-03 12:26:47 UTC

Comment 4 Elad 2022-11-13 12:26:14 UTC

Proposing as a blocker for 4.12.0 as the current behavior leads to application downtime and very high RTO which isn't acceptable.

Comment 12 Mudit Agarwal 2022-11-22 07:42:16 UTC

Given that it is a relocate operation and not failover, according to engineering this does not qualify as a blocker.
Also, in order to identify the actual bottleneck we need more enhancements which are not possible for 4.12

Shyam,

Do we need to add it as a known issue in 4.12?

Comment 13 Shyamsundar 2022-11-23 12:29:28 UTC

(In reply to Mudit Agarwal from comment #12)
> Given that it is a relocate operation and not failover, according to
> engineering this does not qualify as a blocker.
> Also, in order to identify the actual bottleneck we need more enhancements
> which are not possible for 4.12
> 
> Shyam,
> 
> Do we need to add it as a known issue in 4.12?

I do not think so, as it would depend on the amount of data etc. to transfer. IOW, barring the unavailability of a metric on how much data is left to transfer there is no known issue here.

Requested to add a note to the DR document in the relocate pre-req to check last sync time is closer to current time before relocation, to speed up the process: https://docs.google.com/document/d/1N6IbYv6SGkmAsj6UHGrZWNnUD3qHq3nsPFH4obCBf6M/edit?disco=AAAAkUOiR84

Note You need to log in before you can comment on or make changes to this bug.