Bug 1589650 - Deploy HE failed on [Initialize lockspace volume] task from cockpit
Summary: Deploy HE failed on [Initialize lockspace volume] task from cockpit
Keywords:
Status: CLOSED DUPLICATE of bug 1534197
Alias: None
Product: ovirt-hosted-engine-setup
Classification: oVirt
Component: Plugins.General
Version: ---
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: ---
Assignee: Simone Tiraboschi
QA Contact: meital avital
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-06-11 05:31 UTC by Yihui Zhao
Modified: 2018-06-19 05:34 UTC (History)
14 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-06-12 08:46:14 UTC
oVirt Team: Integration
Embargoed:
rbarry: blocker?
yzhao: testing_ack+


Attachments (Terms of Use)
create_target_vm.log (8.41 MB, text/plain)
2018-06-11 05:36 UTC, Yihui Zhao
no flags Details

Description Yihui Zhao 2018-06-11 05:31:53 UTC
Description of problem:
Deploy HE failed on [Initialize lockspace volume] task.
"""
[ INFO ] changed: [localhost]
[ INFO ] TASK [Initialize lockspace volume]
[ ERROR ] fatal: [localhost]: FAILED! => {"attempts": 5, "changed": true, "cmd": ["hosted-engine", "--reinitialize-lockspace", "--force"], "delta": "0:00:05.558402", "end": "2018-06-11 11:11:36.677061", "msg": "non-zero return code", "rc": 1, "start": "2018-06-11 11:11:31.118659", "stderr": "Traceback (most recent call last):\n File \"/usr/lib64/python2.7/runpy.py\", line 162, in _run_module_as_main\n \"__main__\", fname, loader, pkg_name)\n File \"/usr/lib64/python2.7/runpy.py\", line 72, in _run_code\n exec code in run_globals\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_setup/reinitialize_lockspace.py\", line 30, in <module>\n ha_cli.reset_lockspace(force)\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py\", line 283, in reset_lockspace\n stats = broker.get_stats_from_storage()\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py\", line 135, in get_stats_from_storage\n result = self._proxy.get_stats()\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1233, in __call__\n return self.__send(self.__name, args)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1591, in __request\n verbose=self.__verbose\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1273, in request\n return self.single_request(host, handler, request_body, verbose)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1301, in single_request\n self.send_content(h, request_body)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1448, in send_content\n connection.endheaders(request_body)\n File \"/usr/lib64/python2.7/httplib.py\", line 1037, in endheaders\n self._send_output(message_body)\n File \"/usr/lib64/python2.7/httplib.py\", line 881, in _send_output\n self.send(msg)\n File \"/usr/lib64/python2.7/httplib.py\", line 843, in send\n self.connect()\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/unixrpc.py\", line 52, in connect\n self.sock.connect(base64.b16decode(self.host))\n File \"/usr/lib64/python2.7/socket.py\", line 224, in meth\n return getattr(self._sock,name)(*args)\nsocket.error: [Errno 2] No such file or directory", "stderr_lines": ["Traceback (most recent call last):", " File \"/usr/lib64/python2.7/runpy.py\", line 162, in _run_module_as_main", " \"__main__\", fname, loader, pkg_name)", " File \"/usr/lib64/python2.7/runpy.py\", line 72, in _run_code", " exec code in run_globals", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_setup/reinitialize_lockspace.py\", line 30, in <module>", " ha_cli.reset_lockspace(force)", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py\", line 283, in reset_lockspace", " stats = broker.get_stats_from_storage()", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py\", line 135, in get_stats_from_storage", " result = self._proxy.get_stats()", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1233, in __call__", " return self.__send(self.__name, args)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1591, in __request", " verbose=self.__verbose", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1273, in request", " return self.single_request(host, handler, request_body, verbose)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1301, in single_request", " self.send_content(h, request_body)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1448, in send_content", " connection.endheaders(request_body)", " File \"/usr/lib64/python2.7/httplib.py\", line 1037, in endheaders", " self._send_output(message_body)", " File \"/usr/lib64/python2.7/httplib.py\", line 881, in _send_output", " self.send(msg)", " File \"/usr/lib64/python2.7/httplib.py\", line 843, in send", " self.connect()", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/unixrpc.py\", line 52, in connect", " self.sock.connect(base64.b16decode(self.host))", " File \"/usr/lib64/python2.7/socket.py\", line 224, in meth", " return getattr(self._sock,name)(*args)", "socket.error: [Errno 2] No such file or directory"], "stdout": "", "stdout_lines": []}
"""
Version-Release number of selected component (if applicable):
rhvh-4.2.4.2-0.20180608.0+1
rhvm-appliance-4.2-20180601.0.el7.noarch
ovirt-hosted-engine-setup-2.2.22-1.el7ev.noarch
ovirt-hosted-engine-ha-2.2.13-1.el7ev.noarch
cockpit-ovirt-dashboard-0.11.28-1.el7ev.noarch

How reproducible:
100%



Steps to Reproduce:
1. Deploy HE via cockpit

Actual results:
The same as the description

Expected results:
Deploy HE successfully from cockpit.

Additional info:

Comment 1 Yihui Zhao 2018-06-11 05:36:43 UTC
Created attachment 1449863 [details]
create_target_vm.log

Comment 2 Simone Tiraboschi 2018-06-11 07:23:36 UTC
Do you also have /var/log/ovirt-hosted-engine-ha/agent.log and /var/log/ovirt-hosted-engine-ha/broker.log ?

Comment 3 Yihui Zhao 2018-06-11 08:09:19 UTC
agent.log and broker.log

http://pastebin.test.redhat.com/601470

Comment 4 Simone Tiraboschi 2018-06-11 08:30:11 UTC
Sorry, also vdsm log for the relevant time frame.

Comment 5 Yihui Zhao 2018-06-11 09:44:30 UTC
(In reply to Simone Tiraboschi from comment #4)
> Sorry, also vdsm log for the relevant time frame.

http://pastebin.test.redhat.com/601517

Comment 6 Ryan Barry 2018-06-11 20:42:02 UTC
Is this only on RHVH? If so, I think this is the same bug as rhbz#1534197

Comment 7 Yihui Zhao 2018-06-12 02:06:39 UTC
(In reply to Ryan Barry from comment #6)
> Is this only on RHVH? If so, I think this is the same bug as rhbz#1534197

Yes, with same versions about ovirt-hosted-engine-setup or  ovirt-hosted-engine-setup on RHEL7.5, it works well.

Comment 8 Sandro Bonazzola 2018-06-12 08:46:14 UTC
Thanks, marking as duplicate of bug #1534197

*** This bug has been marked as a duplicate of bug 1534197 ***

Comment 9 Yihui Zhao 2018-06-15 07:31:29 UTC
Tested with rhvh-4.2.4.3-0.20180614.0+1 also met this issue:

from cockpit :
"""
[ INFO ] changed: [localhost]
[ INFO ] TASK [Initialize lockspace volume]
[ ERROR ] fatal: [localhost]: FAILED! => {"attempts": 5, "changed": true, "cmd": ["hosted-engine", "--reinitialize-lockspace", "--force"], "delta": "0:00:05.555190", "end": "2018-06-15 15:22:10.067281", "msg": "non-zero return code", "rc": 1, "start": "2018-06-15 15:22:04.512091", "stderr": "Traceback (most recent call last):\n File \"/usr/lib64/python2.7/runpy.py\", line 162, in _run_module_as_main\n \"__main__\", fname, loader, pkg_name)\n File \"/usr/lib64/python2.7/runpy.py\", line 72, in _run_code\n exec code in run_globals\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_setup/reinitialize_lockspace.py\", line 30, in <module>\n ha_cli.reset_lockspace(force)\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py\", line 283, in reset_lockspace\n stats = broker.get_stats_from_storage()\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py\", line 135, in get_stats_from_storage\n result = self._proxy.get_stats()\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1233, in __call__\n return self.__send(self.__name, args)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1591, in __request\n verbose=self.__verbose\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1273, in request\n return self.single_request(host, handler, request_body, verbose)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1301, in single_request\n self.send_content(h, request_body)\n File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1448, in send_content\n connection.endheaders(request_body)\n File \"/usr/lib64/python2.7/httplib.py\", line 1037, in endheaders\n self._send_output(message_body)\n File \"/usr/lib64/python2.7/httplib.py\", line 881, in _send_output\n self.send(msg)\n File \"/usr/lib64/python2.7/httplib.py\", line 843, in send\n self.connect()\n File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/unixrpc.py\", line 52, in connect\n self.sock.connect(base64.b16decode(self.host))\n File \"/usr/lib64/python2.7/socket.py\", line 224, in meth\n return getattr(self._sock,name)(*args)\nsocket.error: [Errno 2] No such file or directory", "stderr_lines": ["Traceback (most recent call last):", " File \"/usr/lib64/python2.7/runpy.py\", line 162, in _run_module_as_main", " \"__main__\", fname, loader, pkg_name)", " File \"/usr/lib64/python2.7/runpy.py\", line 72, in _run_code", " exec code in run_globals", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_setup/reinitialize_lockspace.py\", line 30, in <module>", " ha_cli.reset_lockspace(force)", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py\", line 283, in reset_lockspace", " stats = broker.get_stats_from_storage()", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py\", line 135, in get_stats_from_storage", " result = self._proxy.get_stats()", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1233, in __call__", " return self.__send(self.__name, args)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1591, in __request", " verbose=self.__verbose", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1273, in request", " return self.single_request(host, handler, request_body, verbose)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1301, in single_request", " self.send_content(h, request_body)", " File \"/usr/lib64/python2.7/xmlrpclib.py\", line 1448, in send_content", " connection.endheaders(request_body)", " File \"/usr/lib64/python2.7/httplib.py\", line 1037, in endheaders", " self._send_output(message_body)", " File \"/usr/lib64/python2.7/httplib.py\", line 881, in _send_output", " self.send(msg)", " File \"/usr/lib64/python2.7/httplib.py\", line 843, in send", " self.connect()", " File \"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/unixrpc.py\", line 52, in connect", " self.sock.connect(base64.b16decode(self.host))", " File \"/usr/lib64/python2.7/socket.py\", line 224, in meth", " return getattr(self._sock,name)(*args)", "socket.error: [Errno 2] No such file or directory"], "stdout": "", "stdout_lines": []}
"""


Log url:
http://pastebin.test.redhat.com/604553

Comment 10 Ryan Barry 2018-06-15 12:00:51 UTC
Can you please attach vdsm.log?

Comment 11 Ryan Barry 2018-06-18 12:01:17 UTC
To update, I can't reproduce this.

Is this reproducible on the CLI?

Comment 12 Yihui Zhao 2018-06-19 02:22:37 UTC
(In reply to Ryan Barry from comment #11)
> To update, I can't reproduce this.
> 
> Is this reproducible on the CLI?

Yes, can reproduce on the CLI.

vdsm.log:
http://pastebin.test.redhat.com/605603

Comment 13 Yihui Zhao 2018-06-19 05:34:50 UTC
Works well for me with the latest build:

cockpit-ovirt-dashboard-0.11.28-1.el7ev.noarch
ovirt-hosted-engine-ha-2.2.14-1.el7ev.noarch
ovirt-hosted-engine-setup-2.2.22-1.el7ev.noarch
rhvm-appliance-4.2-20180615.0.el7.noarch
rhvh-4.2.4.3-0.20180615.0+1


[root@ibm-x3650m5-06 ~]# imgbase w
You are on rhvh-4.2.4.3-0.20180615.0+1
[root@ibm-x3650m5-06 ~]# hosted-engine --vm-status


--== Host 1 status ==--

conf_on_shared_storage             : True
Status up-to-date                  : True
Hostname                           : ibm-x3650m5-06.lab.eng.pek2.redhat.com
Host ID                            : 1
Engine status                      : {"health": "good", "vm": "up", "detail": "Up"}
Score                              : 3400
stopped                            : False
Local maintenance                  : False
crc32                              : ecd6d40e
local_conf_timestamp               : 8882
Host timestamp                     : 8882
Extra metadata (valid at timestamp):
	metadata_parse_version=1
	metadata_feature_version=1
	timestamp=8882 (Tue Jun 19 13:34:18 2018)
	host-id=1
	score=3400
	vm_conf_refresh_time=8882 (Tue Jun 19 13:34:18 2018)
	conf_on_shared_storage=True
	maintenance=False
	state=EngineUp
	stopped=False


Note You need to log in before you can comment on or make changes to this bug.