Bug 1812122 - Improve the reliability of device remove
Summary: Improve the reliability of device remove
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: heketi
Version: ocs-3.11
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: ---
: OCS 3.11.z Batch Update 6
Assignee: John Mulligan
QA Contact: Vinayak Papnoi
Amrita
URL:
Whiteboard:
Depends On: 1630172
Blocks: 1850072
TreeView+ depends on / blocked
 
Reported: 2020-03-10 14:56 UTC by John Mulligan
Modified: 2021-06-01 17:02 UTC (History)
12 users (show)

Fixed In Version: heketi-9.0.0-12
Doc Type: Enhancement
Doc Text:
With this update, Heketi’s device remove operation is fully tracked and based on a series of brick eviction operations , thereby making the overall process much more reliable. Heketi used to track an operation for overall device replacement but that operation did not track the underlying changes Heketi was making to volumes as part of a device remove operation. When an error was encountered during the device remove operation, the lack of detailed tracking could have lead to database inconsistency with Gluster, creating a need to manually repair the database.
Clone Of:
: 1850072 (view as bug list)
Environment:
Last Closed: 2020-12-17 04:31:42 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2020:5602 0 None None None 2020-12-17 04:32:12 UTC

Description John Mulligan 2020-03-10 14:56:06 UTC
Description of problem:

The last large action customers take using heketi that is not based on reliable operations metadata is 'device remove' / 'device replace'. Changing this to be based on operations would greatly improve the reliability of the system and allow the heketi auto-clean behavior to assist with keeping the db in a good state.


In theory this would reduce the number of times support team would need assistance from developers and avoid having to edit the db as json.

Comment 18 errata-xmlrpc 2020-12-17 04:31:42 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (OpenShift Container Storage 3.11.z bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:5602


Note You need to log in before you can comment on or make changes to this bug.