Summary: | cluster-api machines are recreated when backing instance removed | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Michael Gugino <mgugino> |
Component: | Cloud Compute | Assignee: | Alberto <agarcial> |
Status: | CLOSED ERRATA | QA Contact: | Jianwei Hou <jhou> |
Severity: | medium | Docs Contact: | |
Priority: | unspecified | ||
Version: | 4.1.0 | CC: | agarcial, erich |
Target Milestone: | --- | ||
Target Release: | 4.3.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2020-01-23 11:03:45 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: |
Description
Michael Gugino
2019-05-20 17:05:20 UTC
Added to 4.1 issue tracker comments: https://github.com/openshift/openshift-docs/issues/12487 If we ship a fix for this prior to shipping 4.1 GA, may want to ensure it gets tracked properly there as well. This is by design the expected behaviour for any kubenetes controller and we shouldn't deviate from it - it reconciles existing state with desired state. If something deletes an instance out of band the machine api will notice only once the controller resync period is expired. How the machine health checking or any other component interacts by consuming the API is orthogonal. Keeping this open and bumping to 4.3 as we plan to make machines objects "fire and forget" in terms of cloud instance creation (In reply to Michael Gugino from comment #0) Why is this not expected behavior? If it's not expected behavior then why is it not part of the docs directly? (In reply to Alberto from comment #2) > This is by design the expected behaviour for any kubenetes controller and we > shouldn't deviate from it - it reconciles existing state with desired state. If this is true then we should close as NOT A BUG. (In reply to Eric Rich from comment #4) > (In reply to Michael Gugino from comment #0) > > Why is this not expected behavior? If it's not expected behavior then why is > it not part of the docs directly? > @Eric This behavior is mostly an artifact from upstream. There's a multitude of reasons why it doesn't fit well for us, and upstream is (I believe) also switching to the 'create once' model. We only recently decided which behavior we actually want. > (In reply to Alberto from comment #2) > > This is by design the expected behaviour for any kubenetes controller and we > > shouldn't deviate from it - it reconciles existing state with desired state. > > If this is true then we should close as NOT A BUG. This statement is outdated. This bug might be redundant if we're tracking feature work elsewhere, but on the other hand, this might be useful to others if they consider it a bug to have the rational here until we cover in docs and code. Since we introduced machine phases this should be fixed now. If a cloud instance is deleted out of band the backing machine should enter a failed phase. It must be deleted. https://github.com/openshift/cluster-api/pull/75 Verified in 4.3.0-0.nightly-2019-11-13-233341. If the instance is deleted, it's backing machine has 'Failed' phase. The machine must be deleted. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:0062 |