Bug 1468875 - Synchronize hosted-engine maintenance state after power management operation
Summary: Synchronize hosted-engine maintenance state after power management operation
Keywords:
Status: CLOSED DEFERRED
Alias: None
Product: ovirt-engine
Classification: oVirt
Component: Backend.Core
Version: 4.1.3.5
Hardware: x86_64
OS: Linux
low
medium
Target Milestone: ---
: ---
Assignee: Nobody
QA Contact: meital avital
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2017-07-09 08:51 UTC by Artyom
Modified: 2021-11-25 14:55 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-04-01 14:46:29 UTC
oVirt Team: Virt
Embargoed:
dfediuck: ovirt-4.3+


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker RHV-44082 0 None None None 2021-11-25 14:55:26 UTC

Description Artyom 2017-07-09 08:51:17 UTC
Description of problem:
We need to synchronize the HE and the engine maintenance state after power management operation

Version-Release number of selected component (if applicable):
rhevm-4.1.3.5-0.1.el7.noarch

How reproducible:
Always

Steps to Reproduce:
1. Configure HE environment with at least two hosts
2. Configure power management on the host with HE VM
3. Put host with HE VM to maintenance
4. Restart host via power management

Actual results:
After host restart, host go straight forward to UP state, when the HE maintenance state still has "LocalMaintenance" state

Expected results:
Both the engine and the HE maintenance state must be synchronized after power management operation(or both will have state maintenance or both will have state up)

Additional info:
It affects our automation tests

Comment 1 Doron Fediuck 2017-07-09 10:31:45 UTC
This is actually by design.
There's a difference between having a host in maintenance to having a hosted-engine node in maintenance. The main reason is to allow maintenance of the HE parts without interrupting vdsm and other running VMs.

So just to summarize; 
A hosted engine node can be in local maintenance while the hypervisor in not in maintenance and working properly.

Comment 2 Artyom 2017-07-09 11:38:16 UTC
I can agree with it when the user himself manually set local maintenance on the host via HE CLI, but in this case, one that brake sync the engine, so the engine also must be responsible for restoring sync between maintenance states after power management operation, for example, we have power_saving with power management policy:
1) Set the policy
2) In case if the engine has more hosts in reserve than power_saving policy needed, the engine put one of the free hosts to maintenance(automatically the engine put host to local maintenance), the engine poweroff the host via power management
3) Now if the engine will need to power on the host, host will not enter into the maintenance state but will jump straight forward to the state UP, but it will stay in the local maintenance from the HE perspective

Comment 3 Martin Sivák 2018-07-04 10:56:08 UTC
Using power management on a host in local maintenance set from the hosted engine cli is a corner case and the hosted engine flags will stay as they are.

Is the webadmin based flow works like this?

1) Put a host to local maintenance
2) Reboot the host using the UI power management options
3) The host is rebooted and automatically activated

If it is then I think the activation should be done properly on the engine side or the host should return to local maintenance and leave the job of activating the host to the admin.


This is a corner case as far as SLA is concerned, but it might be a bug in the Infra area.

Comment 4 Michal Skrivanek 2020-03-10 12:25:49 UTC
change SLA team to virt, we're not tracking SLA separately anymore

Comment 5 Michal Skrivanek 2020-03-19 15:42:26 UTC
We didn't get to this bug for more than 2 years, and it's not being considered for the upcoming 4.4. It's unlikely that it will ever be addressed so I'm suggesting to close it.
If you feel this needs to be addressed and want to work on it please remove cond nack and target accordingly.

Comment 6 Michal Skrivanek 2020-04-01 14:46:29 UTC
ok, closing. Please reopen if still relevant/you want to work on it.

Comment 7 Michal Skrivanek 2020-04-01 14:50:25 UTC
ok, closing. Please reopen if still relevant/you want to work on it.


Note You need to log in before you can comment on or make changes to this bug.