Bug 1395763 - [RFE] Procedure for disaster recovery with minimal intervention.
Summary: [RFE] Procedure for disaster recovery with minimal intervention.
Keywords:
Status: CLOSED DUPLICATE of bug 1284364
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: RFEs
Version: 3.6.8
Hardware: All
OS: Linux
unspecified
high
Target Milestone: ---
: ---
Assignee: Yaniv Lavi
QA Contact: Raz Tamir
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-11-16 15:52 UTC by Theophanis Kontogiannis
Modified: 2019-04-28 14:35 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2017-06-13 09:04:47 UTC
oVirt Team: Storage
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Generic script to be used for fail over (1009 bytes, application/x-shellscript)
2016-11-16 15:52 UTC, Theophanis Kontogiannis
no flags Details

Description Theophanis Kontogiannis 2016-11-16 15:52:53 UTC
Created attachment 1221254 [details]
Generic script to be used for fail over

Description of problem:

The following describes an action path that has been tested to provide DR operations with minimal intervention

Version-Release number of selected component (if applicable):


How reproducible:

100%

Steps to Reproduce:
Assumptions: 
  -- There is full LUN replication in the DR site.
  -- DC = Main Data Center
  -- DR = Disaster Recovery Data Center
  -- RHV-M is on replicated LUN.


1. Add new RHEV-H in the same cluster as the one to be failed over in DR 
2. Approve new RHEV-H (same cluster – Low SPM Priority)
3. Wait to finish Installation of new RHEV-H
4. Place new RHEV-H in maintenance
5. Set persistent FW rules in DR so traffic from DR located RHEV-H, is redirected to the DR operated RHEV-M
6. Add new DR RHEV-H to the same cluster running in Data Center.
NOTE: At least one RHEV-H needs to be configured in DR, to take over VMs during DR operations, as an SPM is needed in the DR.
7. Brink RHEV-H in teh DR Maintenance Mode
8. Power Off RHEV-M in DR
Once we need to move to DR (assuming: LUNs have been replicated to DR, they are brought on line
1. Activate replicated RHEV-M in DR
2. Force RHEV-M in DR to elect a new SPM
  To do this the command 'action host <name of RHEV-H> fence --fence_type manual' must be issued via rhev shell. It can be scripted

3. Allow cluster to settle

Done


Actual results:
Failover between DC and DR works

Expected results:

As actual results

Additional info:

To revert from DC to DR, one needs to reverse LUN replication (both RHEV-datacenter LUNs and RHEV-M LUNs, and launch RHEV-M in DC once sync is completed.

Keep in mind that failing over from DC to DR and back to DC involved manual actions. The above procedure takes out of the loop any need to make manual changes to the database or anywhere else.

Comment 1 Yaniv Lavi 2016-12-04 12:51:30 UTC
I'm not sure what this ticket is about?
What was the intention in opening it?

Comment 2 Theophanis Kontogiannis 2016-12-05 14:00:18 UTC
With Ekin, we worked a minimal intervention / mod-ing methodology for DR operations, that works with minimal intervention (apart from the one any way possibly needed during DR-DC migrations).

We thought to initiate through BZ, the investigation of the possibility of integration to RHEV.

Comment 3 Yaniv Lavi 2017-06-13 09:04:47 UTC
Please review this script as part of the work we are doing on DR for RHV 4.2.

*** This bug has been marked as a duplicate of bug 1284364 ***

Comment 4 Maor 2017-06-13 09:15:15 UTC
Thanks! this might help on block storage domain replication, we also need to add support for registering all the VMs/Templates/Disks from the original setup.


Note You need to log in before you can comment on or make changes to this bug.