1200428 – (disk_clone_using_sg_dd) [RFE] disk clone: if xcopy (RFE bug 1069557) is not available, use sg_dd (better performance than 'dd')

Bug 1200428 (disk_clone_using_sg_dd) - [RFE] disk clone: if xcopy (RFE bug 1069557) is not available, use sg_dd (better performance than 'dd')

Summary: [RFE] disk clone: if xcopy (RFE bug 1069557) is not available, use sg_dd (bet...

Keywords:
Status:	CLOSED DEFERRED
Alias:	disk_clone_using_sg_dd
Product:	Red Hat Enterprise Virtualization Manager
Classification:	Red Hat
Component:	vdsm
Sub Component:
Version:	3.4.5
Hardware:	All
OS:	Linux
Priority:	high
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Yaniv Lavi
QA Contact:	Raz Tamir
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1314382
TreeView+	depends on / blocked

Reported:	2015-03-10 14:28 UTC by Simon Sekidde
Modified:	2022-06-22 13:36 UTC (History)
CC List:	10 users (show)
Fixed In Version:
Doc Type:	Enhancement
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-11-26 11:44:04 UTC
oVirt Team:	Storage
Target Upstream Version:
Embargoed:
Flags:	sherold: Triaged+

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Red Hat Bugzilla	872530	high	CLOSED	[RFE][Tracker] - Wipe volumes using "blkdiscard" instead of "dd"	2021-08-18 15:48:32 UTC
Red Hat Bugzilla	1069557	high	CLOSED	[RFE] disk clone: use xcopy if available (depends on platform BZ for qemu-img to support xcopy - bug 1482537)	2021-02-22 00:41:40 UTC
Red Hat Issue Tracker	RHV-46482	None	None	None	2022-06-22 13:36:19 UTC

Internal Links: 872530 1069557

Description Simon Sekidde 2015-03-10 14:28:12 UTC

Description of problem:

This is a request to replace the 'dd' version of VDSM with that of the sg3-utils version

Version-Release number of selected component (if applicable):

vdsm-4.16.8.1-5.el6ev.x86_64
sg3_utils-1.28-6.el6.x86_64

How reproducible:

100%

Steps to Reproduce:

1. created a 200GB pre-allocated disk in the same storage domain
2. did a mock up export from the san (source qcow2 thin disk) / to the san ( dest pre-alloc/raw disk) using qemu-img:

[root@localhost]# export TIMEFORMAT="%3R"
[root@localhost]# time /usr/bin/qemu-img convert -t none -f qcow2 d4ebc4c0-0092-4f60-8979-2d6c241cc7ef -O qcow2 f22756f4-129a-434c-84a2-8a5f0aa1e10c                                                                                                                     

966.510  <-- Time elapsed from execution

3. repeated the same test using dd (1mb buffer size, direct io read+fullblock, direct io write)

[root@localhost]# time dd if=d4ebc4c0-0092-4f60-8979-2d6c241cc7ef iflag=direct,fullblock of=f22756f4-129a-434c-84a2-8a5f0aa1e10c oflag=direct bs=1M
165888+0 records in
165888+0 records out
173946175488 bytes (174 GB) copied, 824.455 s, 211 MB/s

4. repeated the same test using dd and block sized aligned to the PE size of the volume (32MB)

[root@localhost]# dd if=d4ebc4c0-0092-4f60-8979-2d6c241cc7ef iflag=direct,fullblock of=f22756f4-129a-434c-84a2-8a5f0aa1e10c oflag=direct bs=32M
5184+0 records in
5184+0 records out
173946175488 bytes (174 GB) copied, 587.914 s, 296 MB/s

5. repeated the same test using sgp_dd, 12 threads and direct IO (32mb blocks)

[root@localhost 809c8fb5-400e-4648-bc86-1e4b6dd76bda]# sgp_dd time=1 thr=12 if=d4ebc4c0-0092-4f60-8979-2d6c241cc7ef iflag=direct of=f22756f4-129a-434c-84a2-8a5f0aa1e10c oflag=direct bpt=65535
Assume default 'bs' (block size) of 512 bytes
time to transfer data was 568.471145 secs, 305.99 MB/sec
339738624+0 records in
339738624+0 records out

In Summary:

qemu-img -t none : 16.1 minutes
dd 1mb block (O_DIRECT) : 13.73 minutes
dd 32mb block (O_DIRECT) : 9.78 minutes
sgp_dd 32mb block (O_DIRECT / 12thread) : 9.46 minutes

Comment 5 Nir Soffer 2015-08-20 22:00:14 UTC

(In reply to Simon Sekidde from comment #0)
> 4. repeated the same test using dd and block sized aligned to the PE size of
> the volume (32MB)
> 
> [root@localhost]# dd if=d4ebc4c0-0092-4f60-8979-2d6c241cc7ef
> iflag=direct,fullblock of=f22756f4-129a-434c-84a2-8a5f0aa1e10c oflag=direct
> bs=32M
> 5184+0 records in
> 5184+0 records out
> 173946175488 bytes (174 GB) copied, 587.914 s, 296 MB/s
> 
> 5. repeated the same test using sgp_dd, 12 threads and direct IO (32mb
> blocks)
> 
> [root@localhost 809c8fb5-400e-4648-bc86-1e4b6dd76bda]# sgp_dd time=1 thr=12
> if=d4ebc4c0-0092-4f60-8979-2d6c241cc7ef iflag=direct
> of=f22756f4-129a-434c-84a2-8a5f0aa1e10c oflag=direct bpt=65535
> Assume default 'bs' (block size) of 512 bytes
> time to transfer data was 568.471145 secs, 305.99 MB/sec
> 339738624+0 records in
> 339738624+0 records out

According to this, there is no significant difference between sg_dd and dd.
Nobody will ever tell the difference between 587 and 568 seconds.

What is the difference between sg_dd and dd when copying multiple disks in
the same time?

Comment 7 Yaniv Kaul 2016-03-02 20:24:16 UTC

- While I assume sgp_dd could be faster than 'dd', we need to account for the CPU usage as well - I don't see this above.
- For the usual case of 'clone' VM, I prefer we invest in XCOPY support (via ddpt?)

Note You need to log in before you can comment on or make changes to this bug.