Bug 1775119

Summary: OSP Keepalived static pod crashlooping
Product: OpenShift Container Platform Reporter: Tomas Sedovic <tsedovic>
Component: InstallerAssignee: Tomas Sedovic <tsedovic>
Installer sub component: OpenShift on OpenStack QA Contact: David Sanz <dsanzmor>
Status: CLOSED ERRATA Docs Contact:
Severity: unspecified    
Priority: unspecified CC: bperkins
Version: 4.3.0   
Target Milestone: ---   
Target Release: 4.3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of:
: 1775123 (view as bug list) Environment:
Last Closed: 2020-01-23 11:13:36 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1775123    

Description Tomas Sedovic 2019-11-21 12:21:32 UTC
Description of problem:

The keepalived static pod on the bootstrap machine keeps crashlooping. This is because the VRID we calculate occasionally ends up being 0 which keepalived considers invalid.

How reproducible: random

Actual results:

The keepalived static pod does not start up properly.

Expected results:

The keepalived pod should always start up.

Comment 1 Tomas Sedovic 2019-11-21 12:24:48 UTC
Keepalived logs on the bootstrap machine:

Starting Keepalived v1.3.5 (03/19,2017), git commit v1.3.5-6-g6fa32f2
Opening file '/etc/keepalived/keepalived.conf'.
Starting VRRP child process, pid=7
Registering Kernel netlink reflector
Registering Kernel netlink command channel
Registering gratuitous ARP shared channel
Opening file '/etc/keepalived/keepalived.conf'.
VRRP Error : VRID not valid - must be between 1 & 255. reconfigure !
Truncating auth_pass to 8 characters
Truncating auth_pass to 8 characters
VRRP_Instance(c3rs517m-90437_API) the virtual id must be set!
Stopped
Keepalived_vrrp exited with permanent error CONFIG. Terminating
Stopping
Stopped Keepalived v1.3.5 (03/19,2017), git commit v1.3.5-6-g6fa32f2

Comment 2 Tomas Sedovic 2019-11-21 12:25:40 UTC
Github issue: https://github.com/openshift/baremetal-runtimecfg/issues/21

Comment 3 Tomas Sedovic 2019-11-21 12:26:44 UTC
Fixed by: https://github.com/openshift/baremetal-runtimecfg/pull/23

Comment 4 Tomas Sedovic 2019-11-21 12:27:07 UTC
We're no longer seeing these issues in our CI.

Comment 6 errata-xmlrpc 2020-01-23 11:13:36 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:0062