Bug 1918101
| Summary: | [vsphere]Delete Provisioning machine took about 12 minutes | ||
|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | sunzhaohua <zhsun> |
| Component: | Cloud Compute | Assignee: | dmoiseev |
| Cloud Compute sub component: | Other Providers | QA Contact: | sunzhaohua <zhsun> |
| Status: | CLOSED ERRATA | Docs Contact: | |
| Severity: | low | ||
| Priority: | low | CC: | aarapov, dmoiseev, mgugino |
| Version: | 4.7 | ||
| Target Milestone: | --- | ||
| Target Release: | 4.8.0 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: |
Cause:
Due to no distinction between various vCenter task types in machine-controller deletion procedure blocking if failed task presented in vCenter.
Consequence:
Deletion of not actually created machine (due to some reasons, like lack of datastore space) takes a long time.
Fix:
Machine-controller deletion procedure now check vCenter task type and do not block deletion.
Result:
Machine in Provisioning phase deletes quickly.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-07-27 22:36:15 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
sunzhaohua
2021-01-20 03:26:27 UTC
It's interesting that we can't delete a machine if there's no space on the datastore, perhaps we need to check there's space before we attempt to create a VM, and go into Failed if not The bug is a case where we should set some status on the machine object if we receive an error, rather than a 'machine still exists response'. Aside from that, cluster owners are required to ensure their infrastructure is healthy. I don't think we should be accountable for ensuring enough space exists on the infrastructure. The API will tell us when there isn't, and that's the check. I think adding some healthchecks to the datacenter to prevent us trying to create machines on unhealthy datacenters may be useful, will see if someone has time to look at this next sprint verified clusterversion: 4.8.0-0.nightly-2021-05-21-233425 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |