Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1801379

Summary: etcd: raft can stop before purge loop exists resulting in wal: file not found
Product: OpenShift Container Platform Reporter: Sam Batschelet <sbatsche>
Component: EtcdAssignee: Sam Batschelet <sbatsche>
Status: CLOSED WONTFIX QA Contact: ge liu <geliu>
Severity: medium Docs Contact:
Priority: high    
Version: 4.3.0CC: geliu
Target Milestone: ---   
Target Release: 4.3.z   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1801237 Environment:
Last Closed: 2020-05-22 00:00:31 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 1801196, 1801237, 1815634, 1815646    
Bug Blocks:    

Description Sam Batschelet 2020-02-10 18:21:07 UTC
+++ This bug was initially created as a clone of Bug #1801237 +++

+++ This bug was initially created as a clone of Bug #1801196 +++

Description of problem: In some circumstances, raft can stop before purge loop exists. Basically the result is that etcd can remove wal files that are still needed to replay state. So when etcd is restarted it will fail with a catastrophic error.

C | etcdserver: open wal error: wal: file not found.


https://github.com/etcd-io/etcd/pull/11308

Version-Release number of selected component (if applicable):


How reproducible: rare


Steps to Reproduce:
1.
2.
3.

Actual results: catastrophic error


Expected results: etcd does not fail with unrecoverable catastrophic error


Additional info:

Comment 1 Michal Fojtik 2020-05-12 10:45:12 UTC
This bug hasn't had any activity in the last 30 days. Maybe the problem got resolved, was a duplicate of something else, or became less pressing for some reason - or maybe it's still relevant but just hasn't been looked at yet.

As such, we're marking this bug as "LifecycleStale" and decreasing the severity. 

If you have further information on the current state of the bug, please update it, otherwise this bug will be automatically closed in 7 days. The information can be, for example, that the problem still occurs, that you still want the feature, that more information is needed, or that the bug is (for whatever reason) no longer relevant.

Comment 2 OpenShift BugZilla Robot 2020-05-22 00:00:31 UTC
This bug hasn't had any activity 7 days after it was marked as LifecycleStale, so we are closing this bug as WONTFIX. If you consider this bug still valuable, please reopen it or create new bug.

Comment 3 Red Hat Bugzilla 2023-09-14 05:52:15 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days