Bug 2063862 - S3 validation failure on the failed cluster prevents failover for Metro DR
Summary: S3 validation failure on the failed cluster prevents failover for Metro DR
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenShift Data Foundation
Classification: Red Hat Storage
Component: odf-dr
Version: 4.10
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: ODF 4.10.0
Assignee: Raghavendra Talur
QA Contact: Martin Bukatovic
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-03-14 14:12 UTC by Raghavendra Talur
Modified: 2023-08-09 17:00 UTC (History)
11 users (show)

Fixed In Version: 4.10.0-210
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2022-04-21 09:12:53 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github RamenDR ramen pull 405 0 None Merged controllers: skip the s3 validation for the failed cluster 2022-03-22 07:16:02 UTC
Github red-hat-storage ramen pull 22 0 None open Bug 2063862: controllers: skip the s3 validation for the failed cluster 2022-03-22 14:39:13 UTC

Description Raghavendra Talur 2022-03-14 14:12:23 UTC
Description of problem (please be detailed as possible and provide log
snippests):
When the primary cluster fails in a Metro DR configuration, the S3 store may also be down. In that scenario, the s3 validation code in Ramen prevents the failover process from progressing.


Version of all relevant components (if applicable):
4.10


Is there any workaround available to the best of your knowledge?
No workarounds.

Can this issue reproducible?
Yes. If the S3 store is cohosted with ODF. Like it is in the default ODF setup.

Steps to Reproduce:
1. Setup Metro DR with S3 store on both the Metro DR clusters and provide them in the Ramen config.
2. Bring down the primary cluster and initiate the failover process by editing the fencing status for the failed cluster.


Actual results:
The failover process is stuck.

Expected results:
The failover process should complete.

Comment 5 Raghavendra Talur 2022-03-16 14:53:01 UTC
Patch posted upstream at https://github.com/RamenDR/ramen/pull/405


Note You need to log in before you can comment on or make changes to this bug.