Bug 891316
Summary: | ovirt-engine-backend [scalability]: Deadlock occurred during mass startup of VMs. | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat Enterprise Virtualization Manager | Reporter: | Omri Hochman <ohochman> | ||||||
Component: | ovirt-engine | Assignee: | Omer Frenkel <ofrenkel> | ||||||
Status: | CLOSED NOTABUG | QA Contact: | Yuri Obshansky <yobshans> | ||||||
Severity: | urgent | Docs Contact: | |||||||
Priority: | high | ||||||||
Version: | 3.2.0 | CC: | acathrow, bazulay, eedri, iheim, jkt, jturner, lpeer, michal.skrivanek, ofrenkel, ohochman, pstehlik, rgolan, Rhev-m-bugs, srevivo, yeylon | ||||||
Target Milestone: | --- | Keywords: | Regression, ZStream | ||||||
Target Release: | 3.4.0 | ||||||||
Hardware: | x86_64 | ||||||||
OS: | Linux | ||||||||
Whiteboard: | virt | ||||||||
Fixed In Version: | is1 | Doc Type: | Bug Fix | ||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | |||||||||
: | 1060692 (view as bug list) | Environment: | |||||||
Last Closed: | 2014-05-13 08:58:00 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | |||||||||
Bug Blocks: | 1060692, 1078909, 1142926 | ||||||||
Attachments: |
|
Description
Omri Hochman
2013-01-02 15:09:25 UTC
Created attachment 671471 [details]
engine.log
Created attachment 671472 [details]
console_log
Fixed Description: ***************** I Started *1000* VMs with 256MB Memory (100 by 100) on 13 hosts. the deadlock is caused by the refresh thread holding the VdsManager lock waiting on decreasedPending lock and RunVm thread performing rerun() and holding the decreasedPending lock waiting to perform UpdateVdsDynamicData ( a VDS command which acquires the VdsManager lock) I see 2 main ways to solve this: 1. get rid of the decreasedPending lock and make it AtomicInteger to ensure atomicity and visibility without blocking 2. fix the order of lock acquisition in decreasedPending() method - first get the VdsManager lock and then perform decreasPending and call During doing some work on phantom vds status, the deadlock also will be solved a patch is added to bug Currently we do not have the resources (Lab) to test it. will have to push it forward to 3.4 QE cannot verify this bug in 3.3, will verify in 3.4 The bug is identical to bug *Bug 1060692* <https://bugzilla.redhat.com/show_bug.cgi?id=1060692> -ovirt-engine-backend [scalability]: Deadlock occurred during mass startup of VMs. which was fixed and verified in 3.3.2 So, closed |