Technical Guide in 3.5.z, section 5.9 Migration: A bandwidth limit of 30 Mbps is imposed on each virtual machine migration. A migration will timeout after a certain amount of time has passed. The timeout happens after either 300 seconds, or after the amount of virtual machine memory in Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger. By default, concurrent outgoing migrations are limited to one per CPU core per host, or 5, whichever is smaller ^^ defaults are very outdated, ever since IIRC 3.2 we have a limit of 32MiBps (megabytes/s), concurrent migration has a limit of 3, there is a "progress" timeout of 150s (stalling, when migration doesn't progress it's aborted), 'migration_max_time_per_gib_mem' is 64s
this is an automated message. oVirt 3.6.0 RC3 has been released and GA is targeted to next week, Nov 4th 2015. Please review this bug and if not a blocker, please postpone to a later release. All bugs not postponed on GA release will be automatically re-targeted to - 3.6.1 if severity >= high - 4.0 if severity < high
Clearing needinfo request raised against the default assignee.
Is this still correct for 3.6?
yes. until bug 1252426 gets in
Assigning to Tahlia for review.
> "progress" timeout of 150s (stalling, when migration doesn't progress it's > aborted), 'migration_max_time_per_gib_mem' is 64s Just to clarify this last point, does this mean that "The timeout happens after either 300 seconds, or after the amount of virtual machine memory in Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger." should now be "The timeout happens after either 150 seconds, or after 64 seconds per Gb of virtual machine memory, whichever is larger."? The paragraph would then read in full: "A bandwidth limit of 32 Mbps is imposed on each virtual machine migration. A migration will timeout after a certain amount of time has passed. The timeout happens after either 150 seconds, or after 64 seconds per Gb of virtual machine memory, whichever is larger. By default, concurrent outgoing migrations are limited to one per CPU core per host, or 3, whichever is smaller." Is this correct?
(In reply to Tahlia Richardson from comment #6) > > "progress" timeout of 150s (stalling, when migration doesn't progress it's > > aborted), 'migration_max_time_per_gib_mem' is 64s > > Just to clarify this last point, does this mean that "The timeout happens > after either 300 seconds, or after the amount of virtual machine memory in > Mb divided by 2048 Mb multiplied by 300 seconds, whichever is larger." > should now be "The timeout happens after either 150 seconds, or after 64 > seconds per Gb of virtual machine memory, whichever is larger."? > > > > The paragraph would then read in full: "A bandwidth limit of 32 Mbps is > imposed on each virtual machine migration. A migration will timeout after a > certain amount of time has passed. The timeout happens after either 150 > seconds, or after 64 seconds per Gb of virtual machine memory, whichever is > larger. > By default, concurrent outgoing migrations are limited to one per CPU core > per host, or 3, whichever is smaller." Is this correct? It's 32MiBps (megabytes/s), not Mbps. Please make the distinction clear. 150s timeout describes only a situation when migration is not progressing. It's not a total limit, it's a "stalling" time limit, counting time when the amount of required memory to transfer increases or doesn't change as opposed to normal situation when it progresses from 0% to 100%. That 64s per 1GB is correct and it is indeed total/absolute limit. Also, please note a bug 1323952 where we changed some of the values in 3.6.6 (bandwidth is 52MiBps, concurrent ongoing migrations are limited to 2, and stalling timeout is 240s). It's always good to refer e.g. https://access.redhat.com/solutions/744423 for more details about tuning migration for more details about the various settings and values Thanks
Thanks Michal. I'll add these updates for the next 3.5 and 3.6 updates, and clone a new bug for the 3.6.6 changes so they don't go out too early.
Because I changed the structure of the paragraph, I've had another writer look over it; but since it's otherwise just detail changes which have been reviewed and approved by engineering, I'm going to verify this one myself once I've merged it into the master and 3.5 branches.
Merged to master and 3.5; revision histories updated.
Content now published in https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Virtualization/3.6/html/Technical_Reference/Migration.html and https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Virtualization/3.5/html/Technical_Guide/Migration.html