Created attachment 1454551[details]
cockpit
Description of problem:
The HostedEngine is in local maintenance and vm is down while in first deployment
Version-Release number of selected component (if applicable):
rhvh-4.2.4.3-0.20180622.0+1
rhvm-appliance-4.2-20180620.0.el7.noarch
cockpit-ovirt-dashboard-0.11.28-1.el7ev.noarch
ovirt-hosted-engine-ha-2.2.14-1.el7ev.noarch
ovirt-hosted-engine-setup-2.2.22-1.el7ev.noarch
vdsm-4.20.31-1.el7ev.x86_64
How reproducible:
20%
Steps to Reproduce:
1. Deploy HE via cockpit
Actual results:
The HostedEngine is in local maintenance and vm is down while in first deployment
[root@ibm-x3650m5-06 ~]# hosted-engine --vm-status
--== Host 1 status ==--
conf_on_shared_storage : True
Status up-to-date : True
Hostname : ibm-x3650m5-06.lab.eng.pek2.redhat.com
Host ID : 1
Engine status : {"reason": "bad vm status", "health": "bad", "vm": "down", "detail": "Down"}
Score : 0
stopped : False
Local maintenance : True
crc32 : bf57097d
local_conf_timestamp : 4094
Host timestamp : 4093
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=4093 (Mon Jun 25 13:07:24 2018)
host-id=1
score=0
vm_conf_refresh_time=4094 (Mon Jun 25 13:07:24 2018)
conf_on_shared_storage=True
maintenance=True
state=LocalMaintenance
stopped=False
Expected results:
Deploy HE successfully and the HE-VM is up.
Additional info:
1. Just met this issue one time, so it is a small hit rate.
2. Removing the host from local maintenance, the VM will be up. It is the workaround.
3. From the engine UI, it displays a vm named "external-HostedEngineLocal", and its status is down.
Created attachment 1454551 [details] cockpit Description of problem: The HostedEngine is in local maintenance and vm is down while in first deployment Version-Release number of selected component (if applicable): rhvh-4.2.4.3-0.20180622.0+1 rhvm-appliance-4.2-20180620.0.el7.noarch cockpit-ovirt-dashboard-0.11.28-1.el7ev.noarch ovirt-hosted-engine-ha-2.2.14-1.el7ev.noarch ovirt-hosted-engine-setup-2.2.22-1.el7ev.noarch vdsm-4.20.31-1.el7ev.x86_64 How reproducible: 20% Steps to Reproduce: 1. Deploy HE via cockpit Actual results: The HostedEngine is in local maintenance and vm is down while in first deployment [root@ibm-x3650m5-06 ~]# hosted-engine --vm-status --== Host 1 status ==-- conf_on_shared_storage : True Status up-to-date : True Hostname : ibm-x3650m5-06.lab.eng.pek2.redhat.com Host ID : 1 Engine status : {"reason": "bad vm status", "health": "bad", "vm": "down", "detail": "Down"} Score : 0 stopped : False Local maintenance : True crc32 : bf57097d local_conf_timestamp : 4094 Host timestamp : 4093 Extra metadata (valid at timestamp): metadata_parse_version=1 metadata_feature_version=1 timestamp=4093 (Mon Jun 25 13:07:24 2018) host-id=1 score=0 vm_conf_refresh_time=4094 (Mon Jun 25 13:07:24 2018) conf_on_shared_storage=True maintenance=True state=LocalMaintenance stopped=False Expected results: Deploy HE successfully and the HE-VM is up. Additional info: 1. Just met this issue one time, so it is a small hit rate. 2. Removing the host from local maintenance, the VM will be up. It is the workaround. 3. From the engine UI, it displays a vm named "external-HostedEngineLocal", and its status is down.