Bug 1360265 - Engine doesn't start because of outdated entry of RunVm in command_entities
Summary: Engine doesn't start because of outdated entry of RunVm in command_entities
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: ovirt-engine
Classification: oVirt
Component: BLL.Virt
Version: 4.0.1
Hardware: Unspecified
OS: Unspecified
high
urgent
Target Milestone: ovirt-4.0.2
: 4.0.2.4
Assignee: Arik
QA Contact: sefi litmanovich
URL:
Whiteboard:
Depends On: 1360378
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-07-26 11:34 UTC by Arik
Modified: 2023-09-14 03:28 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-08-12 14:30:09 UTC
oVirt Team: Virt
Embargoed:
rule-engine: ovirt-4.0.z+
rule-engine: blocker+
rule-engine: planning_ack+
michal.skrivanek: devel_ack+
mavital: testing_ack+


Attachments (Terms of Use)
database dump (9.07 MB, application/x-gzip)
2016-07-26 11:34 UTC, Arik
no flags Details


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 61395 0 master MERGED core: protection from NPE in the initialization of RunVm 2016-07-27 07:29:33 UTC
oVirt gerrit 61447 0 ovirt-engine-4.0 MERGED core: protection from NPE in the initialization of RunVm 2016-07-27 12:19:25 UTC
oVirt gerrit 61484 0 ovirt-engine-4.0.2 MERGED core: protection from NPE in the initialization of RunVm 2016-07-27 19:52:47 UTC

Description Arik 2016-07-26 11:34:00 UTC
Created attachment 1184200 [details]
database dump

Description of problem:
If there are outdated entries that remain in command_entities, it might cause the engine to fail to start.
Specifically, in the reported case I see an instance of RunVmCommand in command_entities that refers to VM that has already been removed ('f8b68db9-3503-4295-afd9-92b0596f746d').

Version-Release number of selected component (if applicable):


How reproducible:
Not sure how to reproduce it exactly, but once we get to this state the engine will never start.

Steps to Reproduce:
1.
2.
3.

Actual results:
NPE prevents the engine from starting.

Expected results:
The engine should start.

Additional info:
I set the severity to urgent because it prevents the engine from starting.
However, I think that the priority can be lower because we have a workaround that worked - to delete the entries from command_entities.

Comment 1 Red Hat Bugzilla Rules Engine 2016-07-26 13:42:58 UTC
Bug tickets must have version flags set prior to targeting them to a release. Please ask maintainer to set the correct version flags and only then set the target milestone.

Comment 2 Michal Skrivanek 2016-07-27 05:00:43 UTC
The actual problem of engine strtup is tracked in bug 1360378

Comment 5 sefi litmanovich 2016-08-02 10:36:52 UTC
Verified with rhevm-4.0.2.3-0.1.el7ev.noarch.

Steps to re produce:

1. Invokes an action to fill command_entities table - added a new vm with preallocated disk.
2. Dumped the data from the command_entities table to some file in csv format.
3. Removed vm.
4. Stopped engine.
5. Copied the data from the file back to command_entities table.
6. Started engine.

Result:
Engine started with no problem. The redundant entries in command_entities table were removed after a certain timeout.

Comment 6 Red Hat Bugzilla 2023-09-14 03:28:43 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days


Note You need to log in before you can comment on or make changes to this bug.