Bug 1898194
| Summary: | GCP: can't install on custom machine types | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Manuel Dewald <mdewald> |
| Component: | Installer | Assignee: | Jeremiah Stuever <jstuever> |
| Installer sub component: | openshift-installer | QA Contact: | To Hung Sze <tsze> |
| Status: | CLOSED ERRATA | Docs Contact: | |
| Severity: | urgent | ||
| Priority: | urgent | CC: | gshereme, jeder, jstuever, mstaeble, tsze |
| Version: | 4.7 | Keywords: | ServiceDeliveryBlocker, ServiceDeliveryImpact |
| Target Milestone: | --- | ||
| Target Release: | 4.7.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | No Doc Update | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-02-24 15:33:36 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Manuel Dewald
2020-11-16 15:58:26 UTC
Simpler way to test (this manually):
create an install-config by openshift-install create install-config --dir <install-dir>
modify the install-config.yaml and replace the {} after GCP with:
type: custom-4-16384
Then run: openshift-install create manifests --dir <install-dir>
4.7 is spitting out an error like this:
FATAL failed to fetch Master Machines: failed to load asset "Install Config": [controlPlane.platform.gcp.type: Invalid value: "custom-4-16384": instance type custom-4-16384 not found, compute[0].platform.gcp.type: Invalid value: "custom-4-16384": instance type custom-4-16384 not found]
Also, 'gcloud compute machine-types list' doesn't seem to return custom types.
Simpler way to test (this manually):
create an install-config by openshift-install create install-config --dir <install-dir>
modify the install-config.yaml and replace the {} after GCP with:
type: custom-4-16384
Then run: openshift-install create manifests --dir <install-dir>
4.7 is spitting out an error like this:
FATAL failed to fetch Master Machines: failed to load asset "Install Config": [controlPlane.platform.gcp.type: Invalid value: "custom-4-16384": instance type custom-4-16384 not found, compute[0].platform.gcp.type: Invalid value: "custom-4-16384": instance type custom-4-16384 not found]
Also, 'gcloud compute machine-types list' doesn't seem to return custom types.
We will need to add special handling for custom machine types. They are named using a specific format: [type-]custom-<cpu>-<memory> and defaults to N1 when no type is specified. This can be tested by using the following custom machine types in the control plane section of the install-config: No errors as these are valid: n1-standard-4, custom-4-16384, n1-custom-4-16384 Google API 404 (not found): n1-dne-4, custom-2, custom-a, custom-2-b, n1-custom-2, n1-custom-a, n1-custom-2-b controlPlane.platform.gcp.type: Internal error: googleapi: Error 404: The resource 'projects/myproject/zones/us-west1-b/machineTypes/custom-2' was not found, notFound Google API Internal Error (invalidResourceUsage): custom-4-16383, n1-custom-4-16383, custom-3-16384, n1-custom-3-16384 controlPlane.platform.gcp.type: Internal error: googleapi: Error 400: Invalid resource usage: 'Memory should be a multiple of 256MiB, while 16383MiB is requested'., invalidResourceUsage controlPlane.platform.gcp.type: Internal error: googleapi: Error 400: Invalid resource usage: 'Number of vCPUs should be multiple of 2 if greater than 2, while 3 is requested'., invalidResourceUsage Invalid memory and CPU: n1-standard-2, custom-2-7680, n1-custom-2-7680, controlPlane.platform.gcp.type: Invalid value: "custom-2-7680": instance type does not meet minimum resource requirements of 4 vCPUs controlPlane.platform.gcp.type: Invalid value: "custom-2-7680": instance type does not meet minimum resource requirements of 15360 MB Memory Verified with openshift-install-linux-4.7.0-0.nightly-2020-11-23-074526 FATAL failed to fetch Master Machines: failed to load asset "Install Config": compute[0].platform.gcp.type: Invalid value: "n1-custom-4-16383": instance type n1-custom-4-16383 not found FATAL failed to fetch Master Machines: failed to load asset "Install Config": compute[0].platform.gcp.type: Invalid value: "n1-dn2-4": instance type n1-dn2-4 not found FATAL failed to fetch Master Machines: failed to load asset "Install Config": [controlPlane.platform.gcp.type: Invalid value: "n1-standard-2": instance type does not meet minimum resource requirements of 4 vCPUs, controlPlane.platform.gcp.type: Invalid value: "n1-standard-2": instance type does not meet minimum resource requirements of 15360 MB Memory] added automation for the changes here in OCP-36886 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.7.0 security, bug fix, and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2020:5633 |