Bug 2034147

Summary: OpenShift VMware IPI Installation fails with Resource customization when corespersocket is unset and vCPU count is not a multiple of 4
Product: OpenShift Container Platform Reporter: Nitish Kaushik <nkaushik>
Component: InstallerAssignee: Aditya Narayanaswamy <anarayan>
Installer sub component: openshift-installer QA Contact: Wenxin Wei <wwei>
Status: CLOSED ERRATA Docs Contact:
Severity: medium    
Priority: medium CC: anarayan, dbewley, jcallen, jialiu, jima
Version: 4.9   
Target Milestone: ---   
Target Release: 4.11.0   
Hardware: All   
OS: All   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Cause: Proper validation was missing for the field "corespersocket" Consequence: The corespersocket value can be more than the numCores set, causing problems during cluster creation Fix: Added a validation for the corespersocket Result: User warned before creating cluster and asked to fix before creation
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-10 10:40:43 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Comment 1 Dale Bewley 2022-02-09 20:44:58 UTC
The change is flagged as an enhancement so Doc Type set accordingly. https://github.com/openshift/installer/commit/ea437be225b14fd75a29c881c0ce1c2e9767b61f

Comment 4 Wenxin Wei 2022-03-07 03:42:33 UTC
before fix:

set num_cpus: 6:

output error:
level=error msg=Error: error powering on virtual machine: powerOn task for vm "/SDDC-Datacenter/vm/wwei-bugtest-fb4kx/wwei-bugtest-fb4kx-master-0" failed: Module CPUID power on failed. 


after fix:

set num_cpus: 6:
    coresPerSocket:

output error:
level=fatal msg=failed to fetch Master Machines: failed to load asset "Install Config": invalid "install-config.yaml" file: controlPlane.platform.vsphere.cpus: Invalid value: 6: numCPUs specified should be a multiple of cores per socket which is by default 4



set num_cpus:
    coresPerSocket: 6
./openshift-install create manifests --dir ./
FATAL failed to fetch Master Machines: failed to load asset "Install Config": invalid "install-config.yaml" file: compute[0].platform.vsphere.coresPerSocket: Invalid value: 6: cores per socket must be less than number of CPUs which is by default 4

set num_cpus: 4
    coresPerSocket: 6
./openshift-install create manifests --dir ./
FATAL failed to fetch Master Machines: failed to load asset "Install Config": invalid "install-config.yaml" file: compute[0].platform.vsphere.coresPerSocket: Invalid value: 6: cores per socket must be less than number of CPUs  

set num_cpus: 6
    coresPerSocket: 4

./openshift-install create manifests --dir ./
FATAL failed to fetch Master Machines: failed to load asset "Install Config": invalid "install-config.yaml" file: compute[0].platform.vsphere.cpus: Invalid value: 6: numCPUs specified should be a multiple of cores per socket

Comment 5 Joseph Callen 2022-03-07 18:58:49 UTC
Is this going to be backported to 4.10?

Comment 6 Wenxin Wei 2022-03-08 03:30:01 UTC
reproduce version:
 registry.ci.openshift.org/ocp/release:4.9.0-0.nightly-2022-03-03-053721

verified version:
 registry.ci.openshift.org/ocp/release:4.11.0-0.nightly-2022-03-04-063157

the details refer to comment 4 above

Comment 11 errata-xmlrpc 2022-08-10 10:40:43 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: OpenShift Container Platform 4.11.0 bug fix and security update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2022:5069