Bug 1838430 - [Metal] Support Machine Remediation
Summary: [Metal] Support Machine Remediation
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: OpenShift Container Platform
Classification: Red Hat
Component: Bare Metal Hardware Provisioning
Version: 4.5
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 4.5.0
Assignee: Nir
QA Contact: mlammon
URL: https://github.com/openshift/cluster-...
Whiteboard:
Depends On: 1831603
Blocks: 1838431
TreeView+ depends on / blocked
 
Reported: 2020-05-21 06:57 UTC by Nir
Modified: 2020-07-13 17:40 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Enhancement
Doc Text:
Feature: Remediate unhealthy baremetal machines by rebooting them Reason: auto recovery from transient errors Result: Unhealthy baremetal machines will be automatically rebooted
Clone Of:
: 1838431 (view as bug list)
Environment:
Last Closed: 2020-07-13 17:40:34 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2020:2409 0 None None None 2020-07-13 17:40:49 UTC

Description Nir 2020-05-21 06:57:49 UTC
We introduced baremetal machine remediation logic into openshift/CAPBM in
https://github.com/openshift/cluster-api-provider-baremetal/pull/59

This basically power-cycle unhealthy hosts (as detected by Machine Healthcheck Controller)

We would like to backport this to 4.4 and we need a BZ for that.

This feature depends on Baremetal Operator Reboot API:
https://bugzilla.redhat.com/show_bug.cgi?id=1831603

Comment 5 mlammon 2020-06-02 17:18:20 UTC
Successfully test on nightly build 4.5.0-0.nightly-2020-06-01-111748

Comment 6 errata-xmlrpc 2020-07-13 17:40:34 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:2409


Note You need to log in before you can comment on or make changes to this bug.