Bug 1260630 - [Docs] [Technical] Wrong migration settings description in Technical Guide
Summary: [Docs] [Technical] Wrong migration settings description in Technical Guide
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: Documentation
Version: 3.5.3
Hardware: Unspecified
OS: Unspecified
medium
high
Target Milestone: ovirt-3.6.5
: ---
Assignee: Tahlia Richardson
QA Contact:
URL: https://access.redhat.com/documentati...
Whiteboard:
Depends On:
Blocks: 1328636
TreeView+ depends on / blocked
 
Reported: 2015-09-07 11:51 UTC by Michal Skrivanek
Modified: 2016-05-09 00:59 UTC (History)
9 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 1328636 (view as bug list)
Environment:
Last Closed: 2016-05-09 00:59:21 UTC
oVirt Team: Docs
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Michal Skrivanek 2015-09-07 11:51:07 UTC
Technical Guide in 3.5.z, section 5.9 Migration:

A bandwidth limit of 30 Mbps is imposed on each virtual machine migration. A migration will timeout after a certain amount of time has passed. The timeout happens after either 300 seconds, or after the amount of virtual machine memory in Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger.
By default, concurrent outgoing migrations are limited to one per CPU core per host, or 5, whichever is smaller


^^ defaults are very outdated, ever since IIRC 3.2 we have a limit of 32MiBps (megabytes/s), concurrent migration has a limit of 3, there is a "progress" timeout of 150s (stalling, when migration doesn't progress it's aborted), 'migration_max_time_per_gib_mem' is 64s

Comment 1 Sandro Bonazzola 2015-10-26 12:36:30 UTC
this is an automated message. oVirt 3.6.0 RC3 has been released and GA is targeted to next week, Nov 4th 2015.
Please review this bug and if not a blocker, please postpone to a later release.
All bugs not postponed on GA release will be automatically re-targeted to

- 3.6.1 if severity >= high
- 4.0 if severity < high

Comment 2 Andrew Dahms 2015-11-18 12:55:38 UTC
Clearing needinfo request raised against the default assignee.

Comment 3 Yaniv Lavi 2016-03-16 12:55:19 UTC
Is this still correct for 3.6?

Comment 4 Michal Skrivanek 2016-03-16 13:03:29 UTC
yes. until bug 1252426 gets in

Comment 5 Lucy Bopf 2016-03-22 06:13:34 UTC
Assigning to Tahlia for review.

Comment 6 Tahlia Richardson 2016-04-13 23:53:57 UTC
> "progress" timeout of 150s (stalling, when migration doesn't progress it's
> aborted), 'migration_max_time_per_gib_mem' is 64s

Just to clarify this last point, does this mean that "The timeout happens after either 300 seconds, or after the amount of virtual machine memory in Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger." should now be "The timeout happens after either 150 seconds, or after 64 seconds per Gb of virtual machine memory, whichever is larger."?



The paragraph would then read in full: "A bandwidth limit of 32 Mbps is imposed on each virtual machine migration. A migration will timeout after a certain amount of time has passed. The timeout happens after either 150 seconds, or after 64 seconds per Gb of virtual machine memory, whichever is larger.
By default, concurrent outgoing migrations are limited to one per CPU core per host, or 3, whichever is smaller." Is this correct?

Comment 7 Michal Skrivanek 2016-04-14 07:18:46 UTC
(In reply to Tahlia Richardson from comment #6)
> > "progress" timeout of 150s (stalling, when migration doesn't progress it's
> > aborted), 'migration_max_time_per_gib_mem' is 64s
> 
> Just to clarify this last point, does this mean that "The timeout happens
> after either 300 seconds, or after the amount of virtual machine memory in
> Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger."
> should now be "The timeout happens after either 150 seconds, or after 64
> seconds per Gb of virtual machine memory, whichever is larger."?
> 
> 
> 
> The paragraph would then read in full: "A bandwidth limit of 32 Mbps is
> imposed on each virtual machine migration. A migration will timeout after a
> certain amount of time has passed. The timeout happens after either 150
> seconds, or after 64 seconds per Gb of virtual machine memory, whichever is
> larger.
> By default, concurrent outgoing migrations are limited to one per CPU core
> per host, or 3, whichever is smaller." Is this correct?

It's 32MiBps (megabytes/s), not Mbps. Please make the distinction clear.
150s timeout describes only a situation when migration is not progressing. It's not a total limit, it's a "stalling" time limit, counting time when the amount of required memory to transfer increases or doesn't change as opposed to normal situation when it progresses from 0% to 100%. That 64s per 1GB is correct and it is indeed total/absolute limit.

Also, please note a bug 1323952 where we changed some of the values in 3.6.6 (bandwidth is 52MiBps, concurrent ongoing migrations are limited to 2, and stalling timeout is 240s). It's always good to refer e.g. https://access.redhat.com/solutions/744423 for more details about tuning migration for more details about the various settings and values

Thanks

Comment 8 Tahlia Richardson 2016-04-20 00:00:34 UTC
Thanks Michal. I'll add these updates for the next 3.5 and 3.6 updates, and clone a new bug for the 3.6.6 changes so they don't go out too early.

Comment 12 Tahlia Richardson 2016-04-21 00:04:54 UTC
Because I changed the structure of the paragraph, I've had another writer look over it; but since it's otherwise just detail changes which have been reviewed and approved by engineering, I'm going to verify this one myself once I've merged it into the master and 3.5 branches.

Comment 13 Tahlia Richardson 2016-04-21 00:25:26 UTC
Merged to master and 3.5; revision histories updated.


Note You need to log in before you can comment on or make changes to this bug.