Bug 1343980

Summary: ERROR:ovirt_hosted_engine_ha.agent.agent.Agent:Error: 'Connection to storage server failed' - trying to restart agent
Product: [oVirt] ovirt-hosted-engine-ha Reporter: Nikolai Sednev <nsednev>
Component: AgentAssignee: Martin Sivák <msivak>
Status: CLOSED DUPLICATE QA Contact: Nikolai Sednev <nsednev>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 2.0.0CC: bugs, mavital, stirabos
Target Milestone: ---Flags: rule-engine: planning_ack?
rule-engine: devel_ack?
rule-engine: testing_ack?
Target Release: ---   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-06-08 12:38:15 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1337711    
Attachments:
Description Flags
sosreport from host none

Description Nikolai Sednev 2016-06-08 12:32:08 UTC
Description of problem:
ERROR:ovirt_hosted_engine_ha.agent.agent.Agent:Error: 'Connection to storage server failed' - trying to restart agent

I've followed reproduction steps described in https://bugzilla.redhat.com/show_bug.cgi?id=1337711 and after host was rebooted got this error and engine failed to get started.

Version-Release number of selected component (if applicable):
The components on host:
qemu-kvm-rhev-2.3.0-31.el7_2.15.x86_64
ovirt-vmconsole-host-1.0.2-2.el7ev.noarch
ovirt-setup-lib-1.0.1-1.el7ev.noarch
vdsm-4.17.30-1.el7ev.noarch
libvirt-client-1.2.17-13.el7_2.5.x86_64
ovirt-hosted-engine-ha-1.3.5.7-1.el7ev.noarch
ovirt-vmconsole-1.0.2-2.el7ev.noarch
ovirt-hosted-engine-setup-1.3.7.1-1.el7ev.noarch
sanlock-3.2.4-2.el7_2.x86_64
mom-0.5.3-1.el7ev.noarch
ovirt-host-deploy-1.4.1-1.el7ev.noarch
Red Hat Enterprise Linux Server release 7.2 (Maipo)
Linux version 3.10.0-327.22.1.el7.x86_64 (mockbuild.eng.bos.redhat.com) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-4) (GCC) ) #1 SMP Mon May 16 13:31:48 EDT 2016
Linux 3.10.0-327.22.1.el7.x86_64 #1 SMP Mon May 16 13:31:48 EDT 2016 x86_64 x86_64 x86_64 GNU/Linux

Engine was deployed using rhevm-appliance-20160602.0-1.el7ev.noarch.

How reproducible:
100%

Steps to Reproduce:
1. Get RHEVM 3.6.7 in Hosted Engine environment.
2. Edit HE VM, aka manager, through the Admin Portal and add additional vnic to it.
3. The vnic is added successfully, the guest OS sees this and all good.
4. Reboot the host.

Actual results:
When host boots up, the engine fails to get started by the agent.

Expected results:
Engine should get started by the agent.

Additional info:
sosreport from host is attached.

Comment 1 Simone Tiraboschi 2016-06-08 12:38:15 UTC

*** This bug has been marked as a duplicate of bug 1342988 ***

Comment 2 Nikolai Sednev 2016-06-08 12:39:18 UTC
Created attachment 1165998 [details]
sosreport from host

Comment 3 Nikolai Sednev 2016-06-08 12:42:03 UTC
(In reply to Simone Tiraboschi from comment #1)
> 
> *** This bug has been marked as a duplicate of bug 1342988 ***

In my case it's not RHEVH, it's RHEL, please consider separating these two issues.

Comment 4 Simone Tiraboschi 2016-06-08 12:49:56 UTC
Nikolai, as a workaround to save the deployment you can edit /etc/ovirt-hosted-engine/hosted-engine.conf 
changing from 'mnt_options=None' to 'mnt_options='

Comment 5 Simone Tiraboschi 2016-06-08 12:51:39 UTC
(In reply to Nikolai Sednev from comment #3)
> In my case it's not RHEVH, it's RHEL, please consider separating these two
> issues.

The fix is generic on hosted-engine-setup