Bug 1878905

Summary: [IPI baremetal] Keepalived.conf cannot use new interface name after SDN migration
Product: OpenShift Container Platform Reporter: Ben Nemec <bnemec>
Component: NetworkingAssignee: Antoni Segura Puimedon <asegurap>
Networking sub component: runtime-cfg QA Contact: Victor Voronkov <vvoronko>
Status: CLOSED ERRATA Docs Contact:
Severity: urgent    
Priority: urgent CC: asegurap, bperkins, jerzhang, pliu, vvoronko, yboaron
Version: 4.6Keywords: Triaged
Target Milestone: ---   
Target Release: 4.6.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: After SDN migration from openshift-SDN to OVN-K8S node's control plane IP and the VIP assigned to ovs bridge instead of physical NIC. After VIP assigned to ovs bridge, we still have an orphan route to control plane network pointing to the physical NIC instead of the ovs-bridge interface. Consequence: nodes can't communicate with other nodes in the control plane network. which leads to Keepalived wrongly sets the API VIP in multiple nodes, and as a result of that, the API is unavailable. Fix: Set the network mask of VIPs to host netmask (e.g: /32 for IPV4) Result: Successfully migrate SDN plugin to OVN-K8S
Story Points: ---
Clone Of: 1873955 Environment:
Last Closed: 2020-10-27 16:40:46 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Bug Depends On:    
Bug Blocks: 1854306    

Comment 1 Ben Nemec 2020-09-16 16:03:35 UTC
Moving back to 4.6 as the bug that depends on this was pulled back in too.

Comment 2 Victor Voronkov 2020-10-04 14:09:00 UTC
[kni@provisionhost-0-0 ~]$ oc version
Client Version: 4.6.0-0.nightly-2020-10-02-065738
Server Version: 4.6.0-0.ci-2020-10-02-054056

Cluster deployed with OpenShiftSDN
 
After migration to OVN keepalived conf switched from interface to br-ex

[core@master-0-0 ~]$ cat /etc/keepalived/keepalived.conf | grep interface
    interface br-ex
    interface br-ex

Also even after this change, it took a lot of time to Kube-API to rise, more than few hours

Comment 5 errata-xmlrpc 2020-10-27 16:40:46 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4196