Created attachment 1398051 [details] Screenshot evidences Description of problem: In a multi-region and HA environment, when a primary vmdb of a child region becomes unavailable and repmgr and failover-monitor switches the workers for standby vmdb, replication in global region stop working and a "500 Internal Server Error" is showed in Replication tab (look at screenshots attached). Version-Release number of selected component (if applicable): 5.8.0 How reproducible: Yes Steps to Reproduce: 1. Setup a global and a remote region. 2. The remote region DB needs to be HA. 3. Simulate a failure in primary DB in remote region. Standby VMDB is switched to Primary VMDB. 4. Access "Configuration -> Settings -> Region -> Replication tab". The "500 Internal Server Error" is presented. Actual results: - Replication stops and a "500 Internal Server Error" is presented. Expected results: - Global region should detect that the Primary VMDB is out and start working with Standby VMDB, just like failover-monitor does with the workers in the region. - No "Internal Server Error" should be presented. Additional info:
Is this not a duplicate of 1391095? The current way replication/HA works it wont failover correctly without some virtual IP usage as stated in that RFE bug.
I think so, unless by the "500 Internal Server Error" issue (I didn't see any reference to this error).
The 500 error was fixed as a part of https://bugzilla.redhat.com/show_bug.cgi?id=1540688 (specifically in https://github.com/ManageIQ/pg-pglogical/pull/20) Marking this a duplicate of bug 1391095 *** This bug has been marked as a duplicate of bug 1391095 ***
Hi Nick! The screenshot I have is a little bit different, is it being fixed by this PR also? Regards, Giovanni