Bug 1825290
Summary: | [GCE] Machine status should be "Failed" with an invalid configuration | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Joel Speed <jspeed> |
Component: | Cloud Compute | Assignee: | Joel Speed <jspeed> |
Cloud Compute sub component: | Other Providers | QA Contact: | sunzhaohua <zhsun> |
Status: | CLOSED ERRATA | Docs Contact: | |
Severity: | medium | ||
Priority: | medium | CC: | agarcial, jspeed, zhsun |
Version: | 4.5 | ||
Target Milestone: | --- | ||
Target Release: | 4.5.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Cause: Errors returned from the cloud-provider actuator no longer matched the expected type due to being wrapped using github.com/pkg/errors
Consequence: The Machine controller could not determine that the Machine should be marked as failed
Fix: Use error wrapping from the standard library to check the error types
Result: Machine controller can now determine when Machines should be marked Failed
|
Story Points: | --- |
Clone Of: | 1824497 | Environment: | |
Last Closed: | 2020-07-13 17:28:35 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1824497 | ||
Bug Blocks: |
Description
Joel Speed
2020-04-17 15:02:33 UTC
All PRs are merged. Seems automation failed to update status. Setting to modified verified clusterversion: 4.5.0-0.nightly-2020-05-07-144853 $ oc get machine NAME PHASE TYPE REGION ZONE AGE zhsung-b5bff-m-0 Running n1-standard-4 us-central1 us-central1-a 39m zhsung-b5bff-m-1 Running n1-standard-4 us-central1 us-central1-b 39m zhsung-b5bff-m-2 Running n1-standard-4 us-central1 us-central1-c 39m zhsung-b5bff-w-a-p8zzv Running n1-standard-4 us-central1 us-central1-a 24m zhsung-b5bff-w-b-2wj9k Running n1-standard-4 us-central1 us-central1-b 24m zhsung-b5bff-w-c-l7mxd Running n1-standard-4 us-central1 us-central1-c 24m zhsung-b5bff-w-f-gw25b Failed 91s I0508 02:23:41.339369 1 controller.go:82] controllers/MachineSet "msg"="Reconciling" "machineset"="zhsung-b5bff-w-f" "namespace"="openshift-machine-api" I0508 02:23:41.386005 1 controller.go:166] zhsung-b5bff-w-f-gw25b: reconciling Machine I0508 02:23:41.399656 1 controller.go:282] controller-runtime/controller "msg"="Successfully Reconciled" "controller"="machine_controller" "request"={"Namespace":"openshift-machine-api","Name":"zhsung-b5bff-w-f-gw25b"} I0508 02:23:41.400457 1 controller.go:166] zhsung-b5bff-w-f-gw25b: reconciling Machine I0508 02:23:41.400818 1 actuator.go:75] zhsung-b5bff-w-f-gw25b: Checking if machine exists E0508 02:23:41.610579 1 controller.go:105] controllers/MachineSet "msg"="Failed to reconcile MachineSet" "error"="error fetching machine type \"n1-standard-4-invalid\": error fetching machine type \"n1-standard-4-invalid\" in zone \"us-central1-f\": googleapi: Error 404: The resource 'projects/openshift-qe/zones/us-central1-f/machineTypes/n1-standard-4-invalid' was not found, notFound" "machineset"="zhsung-b5bff-w-f" "namespace"="openshift-machine-api" I0508 02:23:41.611905 1 recorder.go:52] controller-runtime/manager/events "msg"="Warning" "message"="error fetching machine type \"n1-standard-4-invalid\": error fetching machine type \"n1-standard-4-invalid\" in zone \"us-central1-f\": googleapi: Error 404: The resource 'projects/openshift-qe/zones/us-central1-f/machineTypes/n1-standard-4-invalid' was not found, notFound" "object"={"kind":"MachineSet","namespace":"openshift-machine-api","name":"zhsung-b5bff-w-f","uid":"afce054e-4822-4116-9011-fe5be3fdb093","apiVersion":"machine.openshift.io/v1beta1","resourceVersion":"28001"} "reason"="ReconcileError" I0508 02:23:41.618821 1 controller.go:282] controller-runtime/controller "msg"="Successfully Reconciled" "controller"="machineset" "request"={"Namespace":"openshift-machine-api","Name":"zhsung-b5bff-w-f"} I0508 02:23:41.618890 1 controller.go:82] controllers/MachineSet "msg"="Reconciling" "machineset"="zhsung-b5bff-w-f" "namespace"="openshift-machine-api" E0508 02:23:41.744165 1 controller.go:105] controllers/MachineSet "msg"="Failed to reconcile MachineSet" "error"="error fetching machine type \"n1-standard-4-invalid\": error fetching machine type \"n1-standard-4-invalid\" in zone \"us-central1-f\": googleapi: Error 404: The resource 'projects/openshift-qe/zones/us-central1-f/machineTypes/n1-standard-4-invalid' was not found, notFound" "machineset"="zhsung-b5bff-w-f" "namespace"="openshift-machine-api" I0508 02:23:41.744676 1 recorder.go:52] controller-runtime/manager/events "msg"="Warning" "message"="error fetching machine type \"n1-standard-4-invalid\": error fetching machine type \"n1-standard-4-invalid\" in zone \"us-central1-f\": googleapi: Error 404: The resource 'projects/openshift-qe/zones/us-central1-f/machineTypes/n1-standard-4-invalid' was not found, notFound" "object"={"kind":"MachineSet","namespace":"openshift-machine-api","name":"zhsung-b5bff-w-f","uid":"afce054e-4822-4116-9011-fe5be3fdb093","apiVersion":"machine.openshift.io/v1beta1","resourceVersion":"28005"} "reason"="ReconcileError" I0508 02:23:41.752495 1 controller.go:282] controller-runtime/controller "msg"="Successfully Reconciled" "controller"="machineset" "request"={"Namespace":"openshift-machine-api","Name":"zhsung-b5bff-w-f"} I0508 02:23:41.848534 1 reconciler.go:302] zhsung-b5bff-w-f-gw25b: Machine does not exist I0508 02:23:41.848745 1 controller.go:421] zhsung-b5bff-w-f-gw25b: going into phase "Provisioning" I0508 02:23:41.860465 1 controller.go:310] zhsung-b5bff-w-f-gw25b: reconciling machine triggers idempotent create I0508 02:23:41.860497 1 actuator.go:57] zhsung-b5bff-w-f-gw25b: Creating machine I0508 02:23:42.418022 1 reconciler.go:168] zhsung-b5bff-w-f-gw25b: Reconciling machine object with cloud state I0508 02:23:42.418057 1 reconciler.go:147] Error launching instance: googleapi: Error 400: Invalid value for field 'resource.machineType': 'zones/us-central1-f/machineTypes/n1-standard-4-invalid'. Machine type with name 'n1-standard-4-invalid' does not exist in zone 'us-central1-f'., invalid I0508 02:23:42.418234 1 machine_scope.go:161] "zhsung-b5bff-w-f-gw25b": patching machine W0508 02:23:42.437368 1 controller.go:312] zhsung-b5bff-w-f-gw25b: failed to create machine: error launching instance: googleapi: Error 400: Invalid value for field 'resource.machineType': 'zones/us-central1-f/machineTypes/n1-standard-4-invalid'. Machine type with name 'n1-standard-4-invalid' does not exist in zone 'us-central1-f'., invalid I0508 02:23:42.437395 1 controller.go:412] Actuator returned invalid configuration error: error launching instance: googleapi: Error 400: Invalid value for field 'resource.machineType': 'zones/us-central1-f/machineTypes/n1-standard-4-invalid'. Machine type with name 'n1-standard-4-invalid' does not exist in zone 'us-central1-f'., invalid I0508 02:23:42.437402 1 controller.go:421] zhsung-b5bff-w-f-gw25b: going into phase "Failed" I0508 02:23:42.438291 1 recorder.go:52] controller-runtime/manager/events "msg"="Warning" "message"="error launching instance: googleapi: Error 400: Invalid value for field 'resource.machineType': 'zones/us-central1-f/machineTypes/n1-standard-4-invalid'. Machine type with name 'n1-standard-4-invalid' does not exist in zone 'us-central1-f'., invalid" "object"={"kind":"Machine","namespace":"openshift-machine-api","name":"zhsung-b5bff-w-f-gw25b","uid":"ed508997-091a-4ad7-90f3-1099d5342ac6","apiVersion":"machine.openshift.io/v1beta1","resourceVersion":"28011"} "reason"="FailedCreate" I0508 02:23:42.446819 1 controller.go:282] controller-runtime/controller "msg"="Successfully Reconciled" "controller"="machine_controller" "request"={"Namespace":"openshift-machine-api","Name":"zhsung-b5bff-w-f-gw25b"} I0508 02:23:42.447467 1 controller.go:166] zhsung-b5bff-w-f-gw25b: reconciling Machine W0508 02:23:42.447500 1 controller.go:263] zhsung-b5bff-w-f-gw25b: machine has gone "Failed" phase. It won't reconcile Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:2409 |