Bug 1958913
| Summary: | "Replacing an unhealthy etcd member whose node is not ready" procedure results in new etcd pod in CrashLoopBackOff | ||||||
|---|---|---|---|---|---|---|---|
| Product: | OpenShift Container Platform | Reporter: | Lubov <lshilin> | ||||
| Component: | Etcd | Assignee: | Sam Batschelet <sbatsche> | ||||
| Status: | CLOSED ERRATA | QA Contact: | ge liu <geliu> | ||||
| Severity: | high | Docs Contact: | |||||
| Priority: | high | ||||||
| Version: | 4.8 | CC: | augol, hpokorny | ||||
| Target Milestone: | --- | Keywords: | Regression, Triaged | ||||
| Target Release: | 4.8.0 | ||||||
| Hardware: | Unspecified | ||||||
| OS: | Unspecified | ||||||
| Whiteboard: | |||||||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |||||
| Doc Text: | Story Points: | --- | |||||
| Clone Of: | Environment: | ||||||
| Last Closed: | 2021-07-27 23:07:46 UTC | Type: | Bug | ||||
| Regression: | --- | Mount Type: | --- | ||||
| Documentation: | --- | CRM: | |||||
| Verified Versions: | Category: | --- | |||||
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
| Cloudforms Team: | --- | Target Upstream Version: | |||||
| Embargoed: | |||||||
| Bug Depends On: | |||||||
| Bug Blocks: | 1970141 | ||||||
| Attachments: |
|
||||||
|
Description
Lubov
2021-05-10 12:19:06 UTC
Thanks for the report. Could you please retest with the proposed fix?[1] By returning an error instead of retry we didn't give etcd time to scale up the pod. [1] https://github.com/openshift/etcd/pull/81 Marking blocker - as there is a manual remedy which would include deleting the current data directory as outlined in the section "Replacing an unhealthy etcd member whose etcd pod is crashlooping" ``` Move the etcd data directory to a different location: sh-4.2# mv /var/lib/etcd/ /tmp ``` (In reply to Sam Batschelet from comment #3) > Marking blocker - as there is a manual remedy which would include deleting > the current data directory as outlined in the section > > "Replacing an unhealthy etcd member whose etcd pod is crashlooping" > > ``` > Move the etcd data directory to a different location: > > > sh-4.2# mv /var/lib/etcd/ /tmp > ``` tried this WA, it helped Verified with comment 9 revising: Verified with comment 11 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |