2004295 – [4.8] virt-handler goroutine count increases over time

Bug 2004295 - [4.8] virt-handler goroutine count increases over time

Summary: [4.8] virt-handler goroutine count increases over time

Keywords:
Status:	CLOSED DUPLICATE of bug 2004299
Alias:	None
Product:	Container Native Virtualization (CNV)
Classification:	Red Hat
Component:	SSP
Sub Component:
Version:	4.8.2
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Target Release:	future
Assignee:	Andrej Krejcir
QA Contact:	Geetika Kapoor
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	2004299 2004300
TreeView+	depends on / blocked

Reported:	2021-09-14 23:39 UTC by Sarah Bennert
Modified:	2022-06-08 11:54 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	2004299 2004300 (view as bug list)
Environment:
Last Closed:	2022-06-08 11:54:10 UTC
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
test VM yaml (996 bytes, text/plain) 2021-10-04 15:15 UTC, Sarah Bennert	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Bugzilla	1996660	1	high	CLOSED	[4.8] Goroutine count and memory remains high after VMIs are removed	2023-01-26 13:10:52 UTC

Description Sarah Bennert 2021-09-14 23:39:39 UTC

Tracking for 4.8.z

Description of problem:
  Opened as a follow-up to https://bugzilla.redhat.com/show_bug.cgi?id=1996660
  Observed an additional slower goroutine leak.

How reproduceible:
  Very reproduceible, must be run for an extended time to observe increase.

Steps to Reproduce:
  Allow for any stabilization between each step.
  While observing goroutines metric on virt-handler,
    1. Create and Start a batch of 100-300 VMs
    2. Delete all VMs in batch
    3. Repeat

Expected Results:
  Goroutines do not increase between VM deployments after initial deployment

Actual Results:
  Over the course of approximately 48 hours, goroutine count increased between 14-28 per virt-handler.

Comment 1 Andrej Krejcir 2021-10-04 14:36:24 UTC

Because the goroutine leak is in virt-handler, it may be possible to reproduce it in a smaller cluster.

Sarah, do you remember approximately how many VMs were running on each node?
Can you share the VM definition that you used?

Comment 2 Sarah Bennert 2021-10-04 15:15:13 UTC

Created attachment 1829079 [details]
test VM yaml

Hi Andrej,

I deployed 100-300 VMs across three worker nodes using the attached VM definition, so should have been somewhere around 30-100 VMs per node.

Process should be repeated continuously over the test period, but allowing time for the control plane to stabilize between state transitions (start/delete/start/delete/...). On the three worker node cluster, allowed approximately 10-15 minutes for stabilization between states.

Comment 3 Geetika Kapoor 2022-06-08 11:54:10 UTC


*** This bug has been marked as a duplicate of bug 2004299 ***

Note You need to log in before you can comment on or make changes to this bug.