Bug 1320903 - engine service doesn't propagate signal
Summary: engine service doesn't propagate signal
Keywords:
Status: CLOSED WONTFIX
Alias: None
Product: ovirt-engine
Classification: oVirt
Component: Services
Version: 3.6.4
Hardware: Unspecified
OS: Unspecified
low
high vote
Target Milestone: ---
: ---
Assignee: Rafael Martins
QA Contact: Pavel Stehlik
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-03-24 09:46 UTC by Oved Ourfali
Modified: 2022-02-25 08:36 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-06-02 08:22:56 UTC
oVirt Team: Integration


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHV-44931 0 None None None 2022-02-25 08:36:29 UTC

Description Oved Ourfali 2016-03-24 09:46:32 UTC
Description of problem:
Let's say you send a signal to the engine service (python script).
Then, you restart the engine.
The engine isn't stopped, as the service is down, so the result is two engines.
Only one can get requests, of course, but there are many internal processes that are running.

Version-Release number of selected component (if applicable):
3.6.4

How reproducible:
Happened on local rhev environment once. Didn't try to reproduce.

Steps to Reproduce:
Longer description - steps to reproduce can be easily derived from here.
* The engine was started on Mar 20, at 12:40:12, with PID 22749 for the
engine and PID 22747 for the service script.

* On Mar 21, around 13:01, someone tried to make a thread dump of the
engine, maybe because it wasn't working correctly. It tried to use "kill
-3", but used the wrong PID:

  # kill -3 22747

This killed the service script, but left the engine running.

* On Mar 22, around 13:33, someone restarted the engine, but this didn't
stop the engine that was already running, because the service script was
already gone. As a result a new engine was started. This second engine
failed to listen to ports, as they were in use by the old one, but
anyhow it deployed the applications, connected to the database, and
started to try to manage the hosts. Note that the old engine was still
running, and servicing UI and API requests.

Actual results:
The signal wasn't propagated.

Expected results:
The signal should propagate. IMO on -3 it should keep the service running as well.

Comment 1 Sandro Bonazzola 2016-05-02 10:04:11 UTC
Moving from 4.0 alpha to 4.0 beta since 4.0 alpha has been already released and bug is not ON_QA.

Comment 2 Yaniv Lavi 2016-05-23 13:18:49 UTC
oVirt 4.0 beta has been released, moving to RC milestone.

Comment 3 Yaniv Lavi 2016-05-23 13:22:51 UTC
oVirt 4.0 beta has been released, moving to RC milestone.

Comment 4 Yaniv Lavi 2016-06-02 08:22:56 UTC
If you kill the process, it's up to you to make sure it is indeed dead . Closing won't fix.


Note You need to log in before you can comment on or make changes to this bug.