Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1595091

Summary: Sometimes the Hosted Engine is not taken out of local maintenance during deploy from Cockpit and vm is not booted.
Product: [oVirt] ovirt-hosted-engine-setup Reporter: Yihui Zhao <yzhao>
Component: GeneralAssignee: Ido Rosenzwig <irosenzw>
Status: CLOSED DUPLICATE QA Contact: Liran Rotenberg <lrotenbe>
Severity: medium Docs Contact:
Priority: medium    
Version: 2.2.22CC: bugs, cshao, huzhao, phbailey, qiyuan, rbarry, sbonazzo, stirabos, weiwang, yaniwang, ycui, yturgema
Target Milestone: ovirt-4.3.0Flags: rule-engine: ovirt-4.3?
cshao: testing_ack?
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-01-09 09:32:31 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Integration RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
cockpit
none
engineUI
none
agent_broker_vdsm_log none

Description Yihui Zhao 2018-06-26 06:04:47 UTC
Created attachment 1454551 [details]
cockpit

Description of problem:
The HostedEngine is in local maintenance and vm is down while in first deployment

Version-Release number of selected component (if applicable):
rhvh-4.2.4.3-0.20180622.0+1
rhvm-appliance-4.2-20180620.0.el7.noarch
cockpit-ovirt-dashboard-0.11.28-1.el7ev.noarch
ovirt-hosted-engine-ha-2.2.14-1.el7ev.noarch
ovirt-hosted-engine-setup-2.2.22-1.el7ev.noarch
vdsm-4.20.31-1.el7ev.x86_64

How reproducible:
20%


Steps to Reproduce:
1. Deploy HE via cockpit

Actual results:
The HostedEngine is in local maintenance and vm is down while in first deployment
[root@ibm-x3650m5-06 ~]# hosted-engine --vm-status


--== Host 1 status ==--

conf_on_shared_storage             : True
Status up-to-date                  : True
Hostname                           : ibm-x3650m5-06.lab.eng.pek2.redhat.com
Host ID                            : 1
Engine status                      : {"reason": "bad vm status", "health": "bad", "vm": "down", "detail": "Down"}
Score                              : 0
stopped                            : False
Local maintenance                  : True
crc32                              : bf57097d
local_conf_timestamp               : 4094
Host timestamp                     : 4093
Extra metadata (valid at timestamp):
	metadata_parse_version=1
	metadata_feature_version=1
	timestamp=4093 (Mon Jun 25 13:07:24 2018)
	host-id=1
	score=0
	vm_conf_refresh_time=4094 (Mon Jun 25 13:07:24 2018)
	conf_on_shared_storage=True
	maintenance=True
	state=LocalMaintenance
	stopped=False

Expected results:
Deploy HE successfully and the HE-VM is up.

Additional info:
1.  Just met this issue one time, so it is a small hit rate.

2.  Removing the host from local maintenance, the VM will be up.  It is the workaround.

3. From the engine UI, it displays a vm named "external-HostedEngineLocal", and its status is down.

Comment 1 Yihui Zhao 2018-06-26 06:05:22 UTC
Created attachment 1454552 [details]
engineUI

Comment 2 Yihui Zhao 2018-06-26 06:08:52 UTC
Created attachment 1454553 [details]
agent_broker_vdsm_log

Comment 3 Simone Tiraboschi 2018-08-06 12:16:11 UTC
I think it's pretty close if not a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1599616

See https://bugzilla.redhat.com/show_bug.cgi?id=1599616#c10

Comment 4 Sandro Bonazzola 2019-01-09 09:32:31 UTC
Closing as duplicate according to comment #3

*** This bug has been marked as a duplicate of bug 1599616 ***