Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1417282

Summary:	[3.3] ipfailover keepalived split brain
Product:	OpenShift Container Platform	Reporter:	Phil Cameron <pcameron>
Component:	Networking	Assignee:	Phil Cameron <pcameron>
Status:	CLOSED ERRATA	QA Contact:	Meng Bo <bmeng>
Severity:	high	Docs Contact:
Priority:	high
Version:	3.3.0	CC:	akokshar, aloughla, aos-bugs, bbennett, bmeng, bperkins, eparis, knakayam, mrobson, pcameron, stwalter, tdawson, zzhao
Target Milestone:	---
Target Release:	3.3.1
Hardware:	Unspecified
OS:	Unspecified
Whiteboard:
Fixed In Version:	OSE 3.3 PR 571	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:	1402536	Environment:
Last Closed:	2017-02-22 18:11:43 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:	1402536
Bug Blocks:	1417276

Comment 1 Phil Cameron 2017-01-27 19:17:23 UTC

ported OES 3.2 code to OSE 3.4 PR 573

Comment 2 Troy Dawson 2017-02-01 14:27:11 UTC

Full URL's to pull requests are much more helpful than just a number.  Since this bug is for 3.3, that pull request is here.
https://github.com/openshift/ose/pull/571

It has been merged and is in v3.3.1.12 or newer.
It is now ready to be tested.

Comment 4 Meng Bo 2017-02-03 09:59:44 UTC

As the comment https://bugzilla.redhat.com/show_bug.cgi?id=1402536#c35 said, the fix was not included in the latest ocp 3.3.1.12 build.

Move the bug back.

Comment 5 Ben Bennett 2017-02-03 19:51:38 UTC

Troy: Is this now in OCP?  Should this be in modified state?

Comment 6 Troy Dawson 2017-02-03 20:48:49 UTC

After looking at the pull request again, I see that it did *not* make it into the images.  My apologies.
I am moving this back to MODIFIED, and I will make sure it makes it into the next image builds, which are scheduled for Tues, Feb. 6.

Comment 7 Troy Dawson 2017-02-08 22:42:54 UTC

This has been merged into ocp and is in OCP v3.3.1.13 or newer.  I have double checked it this time.

Comment 8 zhaozhanqi 2017-02-09 07:45:03 UTC

Due to the router still have the issue https://bugzilla.redhat.com/show_bug.cgi?id=1408129#c10  in OCP v3.3.1.13

So QE will verified this bug once above router issue is fixed. thanks.

Comment 9 zhaozhanqi 2017-02-13 09:55:01 UTC

Verified this bug on OCP v3.3.3.1.13 with ipfailover image id(ba970a85afb9)

steps:

1) set up multi-node env with 2 nodes 
2) make the router is running on each node
3) create ipfailover service with --replicas=1
4) scale up the ipfailover pod
5) Check those two ipfailover pod are working well and the new create ipfailover pod in another node will enter the 'MASTER' status ,the old one will be enter the 'BACKUP' status.

Comment 11 errata-xmlrpc 2017-02-22 18:11:43 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:0289