459339 – Misordered exceptions in snapshots may cause leaking of disk space

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 459339 - Misordered exceptions in snapshots may cause leaking of disk space

Summary: Misordered exceptions in snapshots may cause leaking of disk space

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	Red Hat Enterprise Linux 6
Classification:	Red Hat
Component:	lvm2
Sub Component:
Version:	6.0
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	low
Target Milestone:	rc
Target Release:	---
Assignee:	Mikuláš Patočka
QA Contact:	Corey Marthaler
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2008-08-16 21:38 UTC by Mikuláš Patočka
Modified:	2010-11-29 23:04 UTC (History)
CC List:	7 users (show)
Fixed In Version:
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2010-11-29 23:04:56 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Description Mikuláš Patočka 2008-08-16 21:38:34 UTC

Chunks for new exceptions are allocated sequentially at the time when they are first accessed. The are committed to on-disk exception table when their copying finishes. Because that copying can finish out of order (the block layer or disk may reorder requests), exceptions are sometimes written out-of-order into on-disk exception storage.

Normally there is no problem with out-of-order exceptions, the exception record contains both new and old chunk, so they will be loaded correctly on next activation and there is no data corruption.

However, when the computer crashes at certain point (when the high-chunk number exception record is written but succeeding low-chunk number exception is not yet written), the space for that low-chunk number exception record is lost and will never be used.

The result is that snapshots with sufficient space can overflow because of this behaviour.

Example (assume 4k chunk size):
- write simultaneously to chunks 0 ... 511 on the origin.
- exceptions are allocated at places 2 ... 257 and 259 ... 514 in the chunk storage. All the exceptions are being copied simultaneously
- now assume that copying of exceptions at origin 256 ... 511 (snapshot 259 ... 514) finishes first
- records containing origin 256...511 and snapshot 259...514 are written into exception metadata area at chunk 1

now assume *CRASH*

after reboot, 256 exceptions from the metadata area at chunk 1 is correctly loaded. The next free snapshot exception number is set to 515. The snapshot chunks 2 ... 257 are lost and can't ever be used.

Comment 1 RHEL Program Management 2009-06-15 21:01:30 UTC

This request was evaluated by Red Hat Product Management for inclusion in a Red
Hat Enterprise Linux major release.  Product Management has requested further
review of this request by Red Hat Engineering, for potential inclusion in a Red
Hat Enterprise Linux Major release.  This request is not yet committed for
inclusion.

Comment 3 RHEL Program Management 2010-07-15 15:02:31 UTC

This issue has been proposed when we are only considering blocker
issues in the current Red Hat Enterprise Linux release. It has
been denied for the current Red Hat Enterprise Linux release.

** If you would still like this issue considered for the current
release, ask your support representative to file as a blocker on
your behalf. Otherwise ask that it be considered for the next
Red Hat Enterprise Linux release. **

Comment 6 Mikuláš Patočka 2010-11-29 23:04:24 UTC

A fix would need re-architecting the snapshot support. And the bug isn't really serious --- if the machine crashes, there may remain few chunks allocated-but-unused, but no data corruption. So I'm naking it.

Comment 7 RHEL Program Management 2010-11-29 23:04:56 UTC

Development Management has reviewed and declined this request.  You may appeal
this decision by reopening this request.

Note You need to log in before you can comment on or make changes to this bug.