Bug 2120760 - [RDR] Relocate operation takes avg. ~20mins just for the Pods to show up in the ContainerCreating state on the desired cluster
Summary: [RDR] Relocate operation takes avg. ~20mins just for the Pods to show up in t...
Keywords:
Status: ASSIGNED
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: odf-dr
Version: 4.11
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: ---
Assignee: rakesh
QA Contact: krishnaram Karthick
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-08-23 17:06 UTC by Aman Agrawal
Modified: 2023-08-14 14:40 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github RamenDR ramen pull 905 0 None open adding additional status to VRG and DRPC 2023-07-03 12:26:47 UTC

Comment 4 Elad 2022-11-13 12:26:14 UTC
Proposing as a blocker for 4.12.0 as the current behavior leads to application downtime and very high RTO which isn't acceptable.

Comment 12 Mudit Agarwal 2022-11-22 07:42:16 UTC
Given that it is a relocate operation and not failover, according to engineering this does not qualify as a blocker.
Also, in order to identify the actual bottleneck we need more enhancements which are not possible for 4.12

Shyam,

Do we need to add it as a known issue in 4.12?

Comment 13 Shyamsundar 2022-11-23 12:29:28 UTC
(In reply to Mudit Agarwal from comment #12)
> Given that it is a relocate operation and not failover, according to
> engineering this does not qualify as a blocker.
> Also, in order to identify the actual bottleneck we need more enhancements
> which are not possible for 4.12
> 
> Shyam,
> 
> Do we need to add it as a known issue in 4.12?

I do not think so, as it would depend on the amount of data etc. to transfer. IOW, barring the unavailability of a metric on how much data is left to transfer there is no known issue here.

Requested to add a note to the DR document in the relocate pre-req to check last sync time is closer to current time before relocation, to speed up the process: https://docs.google.com/document/d/1N6IbYv6SGkmAsj6UHGrZWNnUD3qHq3nsPFH4obCBf6M/edit?disco=AAAAkUOiR84


Note You need to log in before you can comment on or make changes to this bug.