Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2011089

Summary: Machine stuck deleting after instance has been terminated and node deleted
Product: OpenShift Container Platform Reporter: Matt Bargenquast <mbargenq>
Component: Cloud ComputeAssignee: Joel Speed <jspeed>
Cloud Compute sub component: Cloud Controller Manager QA Contact: sunzhaohua <zhsun>
Status: CLOSED DUPLICATE Docs Contact:
Severity: unspecified    
Priority: unspecified CC: aos-bugs
Version: 4.8   
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2021-10-08 12:04:51 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Matt Bargenquast 2021-10-05 23:20:20 UTC
Description of problem:
A cluster has been observed in the following state:
- It has a Machine which has been in a Deleting state for several weeks.
- The Node corresponding to the Machine has been successfully deleted.
- The AWS instance associated with the Machine no longer exists.

The machine-api-controllers just repeat that they cannot find the instance:

W1005 23:05:28.413511       1 reconciler.go:450] xxxxxx-sncmc-worker-us-east-1a-t72sb: Failed to find existing instance by id i-0ca2e1f7a004dbc7f: InvalidInstanceID.NotFound: The instance ID 'i-0ca2e1f7a004dbc7f' does not exist
        status code: 400, request id: cab0c58b-c3ef-4f7c-a295-a31f2ea656cf
I1005 23:05:28.471091       1 reconciler.go:261] xxxxxx-sncmc-worker-us-east-1a-t72sb: Possible eventual-consistency discrepancy; returning an error to requeue
E1005 23:05:28.471115       1 controller.go:246] xxxxxx-sncmc-worker-us-east-1a-t72sb: failed to check if machine exists: requeue in: 20s

Version-Release number of selected component (if applicable):
4.8.11

Expected results:
The machine should be deleted if the instance no longer exists.

Comment 2 Joel Speed 2021-10-08 12:04:51 UTC
Based on the description, this appears to be a duplicate of 2007802, please let me know if you disagree and we can reopen this for further investigation

*** This bug has been marked as a duplicate of bug 2007802 ***