Description of problem: There can be timing issues on the bootstrap node that result in keepalived stopping before kube-apiserver is up on the masters. When this happens, the API VIP migrates to the masters before they are ready and this causes the deployment to fail. The problem appears to be that the bootstrap kube-apiserver goes down for a period of time, and if this time is long enough to trigger the keepalived-monitor to stop keepalived, then the deployment breaks. How reproducible: Intermittent. In some environments it happens frequently, in others rarely.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4196