Bug 1970002

Summary: IPI vsphere installation failing for having the API VIP moved to master before kube-apiserver starts
Product: OpenShift Container Platform Reporter: Christian Passarelli <cpassare>
Component: NetworkingAssignee: Beth White <beth.white>
Networking sub component: runtime-cfg QA Contact: Victor Voronkov <vvoronko>
Status: CLOSED DUPLICATE Docs Contact:
Severity: medium    
Priority: unspecified CC: aos-bugs, augol, beth.white, bnemec, sgreene, yjoseph
Version: 4.7   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-08-03 17:53:32 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Christian Passarelli 2021-06-09 15:40:35 UTC
Description of problem:
During the installation of IPI cluster on Vsphere the API VIP is moved to a master node that has not any kube-apiserver running, 
and is not automatically moved back to the bootstrap node that is the unique node with a running kube-apiserver at that moment.
The effect is the installation blocked and not able to continue.

Version-Release number of selected component (if applicable):
4.7.12 was used by the customer reporting the issue.

How reproducible:
Not sure but on the customer side happened 3/3 times.

Steps to Reproduce:
1.
2.
3.

Actual results:
The keepalived on one of the master node became MASTER for the API VIP without having kube-apiserver running.

Expected results:
The keepalived shouldn't move the VIP at that stage or should be able to detect the VIP not working on the master and move it back to the bootstrap node.

Additional info:
The workaround found at this moment is to manually restart the master node owning the VIP to force the VIP 
to be assigned back to the bootstrap node and allow the installation to finish.

Comment 6 Red Hat Bugzilla 2023-09-15 01:09:35 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 500 days