Bug 1965024
Summary: | [DR] backup and restore should perform consistency checks on etcd snapshots | ||
---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Sam Batschelet <sbatsche> |
Component: | Etcd | Assignee: | Sam Batschelet <sbatsche> |
Status: | CLOSED ERRATA | QA Contact: | ge liu <geliu> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 4.6 | CC: | wlewis |
Target Milestone: | --- | ||
Target Release: | 4.8.0 | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Enhancement | |
Doc Text: |
Feature: Validate the status of the etcd snapshot after backup and before restore
Reason: Previously, backup procedure was not validating the snapshot taken to be complete; and restore was not verifying that the snapshot being attempted to be restored is valid, and not corrupted. It would be a good enhancement to validate the status of the backup.
Result: If there is a corruption on disk during backup or restore, the error is clearly reported to the admin.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2021-07-27 23:10:12 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 1976287 |
Description
Sam Batschelet
2021-05-26 14:52:18 UTC
For 4.8, we decided not to store the checksum during the backup. That means, there is no checksum to check against during the restore. The current PR only makes sure that the backup database is not corrupted, by running the status check against the database. So, for testing purposes: 1. run DR backup and 2. corrupt the etcd db file (on linux use truncate to truncate the last few blocks of the database file). 3. Attempt to restore 4. The attempt to restore should fail with validation error. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Moderate: OpenShift Container Platform 4.8.2 bug fix and security update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2021:2438 |