Bug 1855288

Summary: Libvirt machine controller causes hypervisor URI to hang
Product: OpenShift Container Platform Reporter: David Benoit <dbenoit>
Component: Cloud ComputeAssignee: David Benoit <dbenoit>
Cloud Compute sub component: Other Providers QA Contact: sunzhaohua <zhsun>
Status: CLOSED ERRATA Docs Contact:
Severity: high    
Priority: unspecified CC: aos-bugs, mfojtik, psundara
Version: 4.6   
Target Milestone: ---   
Target Release: 4.6.0   
Hardware: All   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-10-27 16:13:10 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1857759    

Description David Benoit 2020-07-09 13:26:22 UTC
Description of problem:
When using libvirt IPI as a platform for openshift, the machine-controller does not terminate connections to the libvirt URI, leading to libvirtd becoming unresponsive after enough simultaneous connections occur.  Unresponsiveness is usually seen with about 5000 concurrent connections, which can occur in as little as an hour in a five node cluster.

Version-Release number of selected component (if applicable):
4.6 and earlier

How reproducible:
Always

Steps to Reproduce:
1. deploy openshift on libvirt (IPI)
2. netstat -tnaup | grep 16509 | grep ESTABLISHED | wc-l

Actual results:
Cluster deploys successfully, but the number of connections to the URI grows quickly.

Expected results:
Generally not more than 2-3 concurrent connections to the libvirt URI at a given time.

Additional info:
Fix submitted upstream.
https://github.com/openshift/cluster-api-provider-libvirt/pull/199

Comment 6 errata-xmlrpc 2020-10-27 16:13:10 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.6 GA Images), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4196