Bug 1479050

Summary:	Split brain when configuring keepalived
Product:	Red Hat Enterprise Linux 7	Reporter:	Brett Thurber <bthurber>
Component:	keepalived	Assignee:	Ryan O'Hara <rohara>
Status:	CLOSED NOTABUG	QA Contact:	Brandon Perkins <bperkins>
Severity:	medium	Docs Contact:
Priority:	unspecified
Version:	7.3	CC:	bthurber, cluster-maint
Target Milestone:	rc
Target Release:	---
Hardware:	All
OS:	Linux
Whiteboard:
Fixed In Version:		Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2017-10-24 14:55:33 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:
Bug Blocks:	1447969

Description Brett Thurber 2017-08-07 20:25:02 UTC

Description of problem:
When configuring keepalived per the following:  https://access.redhat.com/articles/2967461

...the VIP exists on both nodes.  There appears to be communication error between the master and backup.

Version-Release number of selected component (if applicable):
7.3

How reproducible:
Everytime

Steps to Reproduce:
1.  Follow the steps in the linked KB
2.
3.

Actual results:
VIP exists on both the master and backup keepalived nodes

Expected results:
VIP only exists on the master

Additional info:
Reference BZ for reproduction:  https://bugzilla.redhat.com/show_bug.cgi?id=1447969

Comment 2 Ryan O'Hara 2017-08-08 13:27:38 UTC

I checked the other BZ and I am not seeing details about how to reproduce this in a standalone keepalived environment. If the VIP exists on both nodes, that usually suggests that VRRP traffic is not getting through (ie. firewall is blocking). Please provide details of how to reproduce as well as logs from keepalived and all configuration files (from both nodes). You can also tcpdump to see the VRRP traffic, but keep in mind that tcpdump will see it before iptables.

Comment 10 Ryan O'Hara 2017-10-24 14:55:11 UTC

The problem here is that keepalived on ha1 has a VRID of 151 and ha2 has a VRID of 152, so the two node are not participating on the same virtual router. You want the virtual_router_id to be the same on each node.