Bug 1992841
Summary: | Deleting Machine Node object throws reconciliation error after WMCO restart | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Mansi Kulkarni <mankulka> |
Component: | Windows Containers | Assignee: | Mohammad Saif Shaikh <mohashai> |
Status: | CLOSED ERRATA | QA Contact: | Ronnie Rasouli <rrasouli> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.9 | CC: | aliryan, aos-bugs, rrasouli, team-winc |
Target Milestone: | --- | ||
Target Release: | 4.10.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
Previously, deleting the Node associated with a Windows Machine object threw a reconciliation error upon restart of the operator. This fix opts not to react or reconcile when the node referenced by a Windows Machine in Running state is not found within the cluster, preventing any error loop and standardizing functionality with Linux Machine objects.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2022-03-28 09:36:25 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 2009474 |
Description
Mansi Kulkarni
2021-08-11 19:41:40 UTC
After discussing with the team, the decision was made to not react to Windows node deletion events. This approach was chosen rather than deleting & re-creating the unassociated Machine as optimizing machine management is not in scope of WMCO responsibilities. In addition, not reacting is in line with the current behavior with Linux Machines (MCO), standardizing OpenShift functionality across OSs. Also, the Machine cannot be reconfigured to create a new Node object since the machine-api's nodelink_controller will not update any Machine's Node reference, neither after deleting a Node nor after reconfiguring a Machine. A fix has been tested and the PR is under review here https://github.com/openshift/windows-machine-config-operator/pull/675. verified on 5.0.0+a88772f Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: Windows Container Support for Red Hat OpenShift 5.0.0 [security update]), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2022:0577 |