2117387 – vsphere: installer for vsphere does not have steal clock accounting enabled

Bug 2117387 - vsphere: installer for vsphere does not have steal clock accounting enabled

Summary: vsphere: installer for vsphere does not have steal clock accounting enabled

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	OpenShift Container Platform
Classification:	Red Hat
Component:	Installer
Sub Component:
Version:	4.12
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	medium
Target Milestone:	---
Target Release:	4.12.0
Assignee:	OCP Installer
QA Contact:	gaoshang
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2022-08-10 20:41 UTC by Joseph Callen
Modified:	2023-01-17 19:55 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2023-01-17 19:54:46 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Github	openshift installer pull 6215	None	open	Bug 2117387: vSphere - enable steal time accounting	2022-08-10 20:43:24 UTC
Github	openshift machine-api-operator pull 1056	None	open	Bug 2117387: vSphere - enable steal time accounting	2022-08-10 20:51:16 UTC
Red Hat Product Errata	RHSA-2022:7399	None	None	None	2023-01-17 19:55:07 UTC

Description Joseph Callen 2022-08-10 20:41:21 UTC

The linux kernel was updated:
https://lkml.org/lkml/2020/3/20/1030
to include steal{time,clock} accounting

This would greatly assist in troubleshooting vSphere performance issues
caused by over-provisioned ESXi hosts.

Comment 1 gaoshang 2022-08-24 02:12:11 UTC

This bug has been verified pre-merge and passed.

Steps:
1, Install cluster on vSphere with above 2 PR
2, Once control nodes and worker nodes are ready, check steal time is correctly enabled.

In vSphere web console, check machines 'Edit Settings' -> 'VM Options' -> 'Advanced' -> 'Configuration Parameters'
stealclock.enable TRUE

In control/worker nodes, check st running top command.

$ oc debug node/sgao-v-b7hkt-master-0
...
sh-4.4# top

top - 22:30:42 up  1:19,  0 users,  load average: 1.38, 0.91, 0.87
Tasks: 364 total,   1 running, 363 sleeping,   0 stopped,   0 zombie
%Cpu(s):  7.1 us,  7.1 sy,  0.0 ni, 81.4 id,  1.4 wa,  1.4 hi,  0.0 si,  1.4 st
MiB Mem :  16019.3 total,   1903.0 free,   6177.2 used,   7939.1 buff/cache
MiB Swap:      0.0 total,      0.0 free,      0.0 used.   9414.4 avail Mem


$ oc debug node/sgao-v-b7hkt-worker-fhrg8
...
sh-4.4# top

top - 22:28:44 up  1:07,  0 users,  load average: 0.70, 0.99, 1.00
Tasks: 336 total,   1 running, 335 sleeping,   0 stopped,   0 zombie
%Cpu(s):  3.2 us,  1.4 sy,  0.0 ni, 94.4 id,  0.0 wa,  0.2 hi,  0.3 si,  0.5 st
MiB Mem :  15869.0 total,   4076.7 free,   4080.9 used,   7711.4 buff/cache
MiB Swap:      0.0 total,      0.0 free,      0.0 used.  11361.9 avail Mem

Comment 2 gaoshang 2022-09-20 03:37:04 UTC

Since PR is already merged, set BZ status to verified.

Comment 5 errata-xmlrpc 2023-01-17 19:54:46 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: OpenShift Container Platform 4.12.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:7399

Note You need to log in before you can comment on or make changes to this bug.