817770 – [RFE] Investigate ways to improve internal interfaces/caching to support more-granular metadata

Bug 817770 - [RFE] Investigate ways to improve internal interfaces/caching to support more-granular metadata

Summary: [RFE] Investigate ways to improve internal interfaces/caching to support more...

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	LVM and device-mapper
Classification:	Community
Component:	lvm2
Sub Component:
Version:	0-beta1
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	LVM and device-mapper development team
QA Contact:	cluster-qe@redhat.com
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2012-05-01 08:18 UTC by Alasdair Kergon
Modified:	2019-10-23 19:41 UTC (History)
CC List:	8 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2019-10-23 19:41:55 UTC
Embargoed:
Flags:	rule-engine: lvm-technical-solution? rule-engine: lvm-test-coverage?

Attachments	(Terms of Use)

Description Alasdair Kergon 2012-05-01 08:18:46 UTC

E.g. provide the ability to update a subset of LVs in a VG in isolation

Maybe LV-level functions/caching.

Comment 2 RHEL Program Management 2012-05-05 04:15:22 UTC

Since RHEL 6.3 External Beta has begun, and this bug remains
unresolved, it has been rejected as it is not proposed as
exception or blocker.

Red Hat invites you to ask your support representative to
propose this request, if appropriate and relevant, in the
next release of Red Hat Enterprise Linux.

Comment 3 Petr Rockai 2012-05-20 10:00:44 UTC

I think the two changes needed first are:
- remove most/all of lvmcache
- add a per-LV query interface to lvmetad

Both should get a separate BZ tickets I think. For the rest, I suggest following:
- internally, use a transaction-oriented interface
- existing code that requires multiple commits needs to be refactored

- complex commits need to be handled more atomically
- all possibly-failing *computation* needs to be done before the commit starts
- this is especially troublesome with current mirror handling code
and more so with cluster mirrors

What I suggest: the "computation" part of a transaction builds up an action log, which has both DM actions (suspend, resume, etc.) and metadata actions (write, commit, update lvmetad) appropriately interspersed. The log is played out on a commit, discarded on abort and it could also contain an undo action for each step. That would make it possible to clean up a partially aborted commit (which currently leave behind messed up state in many cases).

The idea is, that a "traditional" LVM2 VG will, upon opening a transaction, read the VG metadata (either from lvmetad, or through a globally-locked scan operation) without grabbing VG locks. The commit can then obtain appropriate locks (i.e. lock out both scanning and other commits on the same VG), check that seqno did not change and either proceed with repeating the action log or aborting the transaction. Since the computation within a transaction is side-effect free (it instead builds an action log), it can be restarted easily. Therefore, this approach improves concurrency (especially for read-intensive workloads), even without a new metadata format.

The tricky part is reworking the operations that run within a transaction. It might be possible to go incrementally about this, but it all depends on how the above works out.

This change then enables better granularity by hiding the VG itself behind an opaque concept of transaction. The transaction does *not* expose a VG structure, only per-LV and per-PV queries (lookup by name, uuid, parameter queries, segments...). LV and PV enumeration should be possible too, although used sparingly.

When the above changes are done, a LVM3 format can then implement the transaction-oriented APIs directly, on top of a transactional metadata store. Action log is still required for DM actions. However, fewer conflicts would arise with such a store, allowing further concurrency, and less reading and parsing would need to be done when opening transactions (linear in the operation size, not in the VG size).

Opinions?

Comment 4 Alasdair Kergon 2012-05-21 00:48:12 UTC

Reminds me of one more requirement I forgot to throw into the mix: the ability to find out from any lvm command precisely what changes it would make without actually changing anything yet.  (A souped-up --test that can cope with multi-step processes.)

Comment 5 Tom Lavigne 2012-09-07 15:22:15 UTC

This request was evaluated by Red Hat Product Management for 
inclusion in the current release of Red Hat Enterprise Linux.
Since we are unable to provide this feature at this time,  
it has been proposed for the next release of 
Red Hat Enterprise Linux.

Comment 7 Peter Rajnoha 2015-10-14 08:58:00 UTC

This is not a change suitable for RHEL6, moving to RHEl7 for consideration.

Note You need to log in before you can comment on or make changes to this bug.