Bug 1989524 - [Azure] Machine object showing Failed phase even node is ready and VM is running properly
Summary: [Azure] Machine object showing Failed phase even node is ready and VM is runn...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Cloud Compute
Version: 4.6.z
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: 4.7.z
Assignee: dmoiseev
QA Contact: Milind Yadav
URL:
Whiteboard:
Depends On: 1957349
Blocks:
TreeView+ depends on / blocked
 
Reported: 2021-08-03 11:55 UTC by OpenShift BugZilla Robot
Modified: 2021-08-17 12:12 UTC (History)
8 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2021-08-17 12:12:09 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github openshift cluster-api-provider-azure pull 228 0 None None None 2021-08-03 11:55:46 UTC
Red Hat Product Errata RHBA-2021:3032 0 None None None 2021-08-17 12:12:50 UTC

Comment 2 Milind Yadav 2021-08-09 04:58:51 UTC
Validated on - 



When machine was stopped from console :
.
.
.
ace":"openshift-machine-api","name":"miyadav-az0908-p65g2-worker-northcentralus-47bnf","uid":"92c2565a-354c-4408-a74f-3ba35f730e3a","apiVersion":"machine.openshift.io/v1beta1","resourceVersion":"32254"} "reason"="Updated"
I0809 04:51:06.088031       1 reconciler.go:394] Provisioning state is 'Updating' for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf
I0809 04:51:06.088136       1 controller.go:276] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling machine triggers idempotent update
I0809 04:51:06.088180       1 actuator.go:168] Updating machine miyadav-az0908-p65g2-worker-northcentralus-47bnf
I0809 04:51:06.298266       1 machine_scope.go:160] miyadav-az0908-p65g2-worker-northcentralus-47bnf: patching machine
I0809 04:51:06.320348       1 controller.go:168] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling Machine
I0809 04:51:06.320364       1 actuator.go:201] miyadav-az0908-p65g2-worker-northcentralus-47bnf: actuator checking if machine exists
I0809 04:51:06.320426       1 recorder.go:98] controller-runtime/manager/events "msg"="Normal"  "message"="Updated machine \"miyadav-az0908-p65g2-worker-northcentralus-47bnf\"" "object"={"kind":"Machine","namespace":"openshift-machine-api","name":"miyadav-az0908-p65g2-worker-northcentralus-47bnf","uid":"92c2565a-354c-4408-a74f-3ba35f730e3a","apiVersion":"machine.openshift.io/v1beta1","resourceVersion":"32385"} "reason"="Updated"
.
.
Same logs as above when machine is restarted ..

When machine was deleted :
.
.
.
E0809 04:56:17.259907       1 controller.go:271] miyadav-az0908-p65g2-worker-northcentralus-47bnf: failed to check if machine exists: vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state
E0809 04:56:17.259962       1 controller.go:267] controller-runtime/manager/controller/machine_controller "msg"="Reconciler error" "error"="vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state" "name"="miyadav-az0908-p65g2-worker-northcentralus-47bnf" "namespace"="openshift-machine-api" 
I0809 04:56:17.580111       1 controller.go:168] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling Machine
I0809 04:56:17.580412       1 actuator.go:201] miyadav-az0908-p65g2-worker-northcentralus-47bnf: actuator checking if machine exists
E0809 04:56:17.708850       1 actuator.go:213] failed to check machine miyadav-az0908-p65g2-worker-northcentralus-47bnf exists: vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state
E0809 04:56:17.708878       1 controller.go:271] miyadav-az0908-p65g2-worker-northcentralus-47bnf: failed to check if machine exists: vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state
E0809 04:56:17.708935       1 controller.go:267] controller-runtime/manager/controller/machine_controller "msg"="Reconciler error" "error"="vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state" "name"="miyadav-az0908-p65g2-worker-northcentralus-47bnf" "namespace"="openshift-machine-api" 
I0809 04:56:18.349122       1 controller.go:168] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling Machine
I0809 04:56:18.349167       1 actuator.go:201] miyadav-az0908-p65g2-worker-northcentralus-47bnf: actuator checking if machine exists
E0809 04:56:18.554213       1 actuator.go:213] failed to check machine miyadav-az0908-p65g2-worker-northcentralus-47bnf exists: vm for machine miyadav-az0908-p65g2-worker-northcentralus-47bnf has unexpected 'Deleting' provisioning state
.
.
I0809 04:56:59.244506       1 controller.go:168] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling Machine
I0809 04:56:59.244535       1 actuator.go:201] miyadav-az0908-p65g2-worker-northcentralus-47bnf: actuator checking if machine exists
W0809 04:56:59.467651       1 virtualmachines.go:91] vm miyadav-az0908-p65g2-worker-northcentralus-47bnf not found: %!w(string=compute.VirtualMachinesClient#Get: Failure responding to request: StatusCode=404 -- Original Error: autorest/azure: Service returned an error. Status=404 Code="ResourceNotFound" Message="The Resource 'Microsoft.Compute/virtualMachines/miyadav-az0908-p65g2-worker-northcentralus-47bnf' under resource group 'miyadav-az0908-p65g2-rg' was not found. For more details please go to https://aka.ms/ARMResourceNotFoundFix")
I0809 04:56:59.467691       1 controller.go:426] miyadav-az0908-p65g2-worker-northcentralus-47bnf: going into phase "Failed"
I0809 04:56:59.500922       1 controller.go:168] miyadav-az0908-p65g2-worker-northcentralus-47bnf: reconciling Machine
W0809 04:56:59.500947       1 controller.go:265] miyadav-az0908-p65g2-worker-northcentralus-47bnf: machine has gone "Failed" phase. It won't reconcile
.
.
.


Additional Info :
Based on above logs moving to VERIFIED ..

Comment 3 Milind Yadav 2021-08-09 04:59:59 UTC
[miyadav@miyadav azure]$ oc get clusterversion --kubeconfig az
NAME      VERSION                             AVAILABLE   PROGRESSING   SINCE   STATUS
version   4.7.0-0.nightly-2021-08-06-180629   True        False         14m     Cluster version is 4.7.0-0.nightly-2021-08-06-180629
[miyadav@miyadav azure]$

Comment 6 errata-xmlrpc 2021-08-17 12:12:09 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Platform 4.7.24 bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3032


Note You need to log in before you can comment on or make changes to this bug.