Bug 1093621 - After host restart, storage domain not mounted automatically
Summary: After host restart, storage domain not mounted automatically
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Virtualization Manager
Classification: Red Hat
Component: ovirt-hosted-engine-ha
Version: 3.4.0
Hardware: x86_64
OS: Linux
unspecified
urgent
Target Milestone: ---
: 3.5.0
Assignee: Doron Fediuck
QA Contact: Artyom
URL:
Whiteboard: sla
Depends On: 1123285
Blocks: 1119702 rhev3.5beta 1156165
TreeView+ depends on / blocked
 
Reported: 2014-05-02 08:39 UTC by Artyom
Modified: 2016-02-10 20:13 UTC (History)
6 users (show)

Fixed In Version: ovirt-3.5.0-beta2
Doc Type: Bug Fix
Doc Text:
Clone Of:
: 1119702 (view as bug list)
Environment:
Last Closed: 2015-02-11 21:08:34 UTC
oVirt Team: SLA
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
agent broker and vdsm logs from two hosts (1.90 MB, application/zip)
2014-05-02 08:39 UTC, Artyom
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2015:0194 0 normal SHIPPED_LIVE ovirt-hosted-engine-ha bug fix and enhancement update 2015-02-12 01:35:33 UTC
oVirt gerrit 29862 0 master MERGED try harder when connecting the storage Never
oVirt gerrit 29921 0 ovirt-hosted-engine-ha-1.2 MERGED try harder when connecting the storage Never
oVirt gerrit 29924 0 ovirt-hosted-engine-ha-1.1 MERGED try harder when connecting the storage Never

Description Artyom 2014-05-02 08:39:10 UTC
Created attachment 891747 [details]
agent broker and vdsm logs from two hosts

Description of problem:
Have two hosts with installed hosted-engine, on one of them run engine-vm, if I set global maintenance and restart hosts, after restart hosts not automatically connect hosted engine storage domain that and subsequently crash hosted-engine HA.

Version-Release number of selected component (if applicable):
ovirt-hosted-engine-ha-1.1.2-2.el6ev.noarch

How reproducible:
Always

Steps to Reproduce:
1. Have hosted engine environment with two hosts and running engine vm
2. Set global maintenance mode(from one of hosts hosted-engine --set-maintenance --mode=global) wait until hosted-engine --vm-status will receive update
3. Restart both hosts

Actual results:
After hosts restart hosted engine storage domain to mounted automatically, and it's led to HA agent crush

Expected results:
hosted engine storage domain mount automatically on both hosts, and HA agent start normally and also start engine vm

Additional info:
it also possible to mount storage domain manually after restart vi hosted-engine --connect-storage and start HA agent(service ovirt-ha-agent start) after it hosted engine continue work fine
Before it tried on this environment different cases when connection between host and storage domain blocked, but after restore ip tables all worked fine

Comment 1 Jiri Moskovcak 2014-07-07 12:42:49 UTC
I just reproduced this crash, but with a slightly different result. It seems like the agent is trying to communicate with vdsmd too soon (while it's not yet intialized) and thus fails to connect the storage. Can you please try to reproduce it and this time just wait a while after the machine boots and then just start the agent with service ovirt-ha-agent start. It should start normally.

Comment 2 Artyom 2014-07-11 09:18:39 UTC
Sorry for late answer, just had some exams on this week, I checked you proposal:
1) Installed hosted-engine environment with two hosts
2) Set global maintenance mode
3) Disable autorun for agent and broker services on both hosts:
   chkconfig ovirt-ha-agent off && chkconfig ovirt-ha-agent --del
   chkconfig ovirt-ha-broker off && chkconfig ovirt-ha-broker --del
4) Restarted both hosts
5) wait sometime after host booting
6) Start manually service on both hosts:
   service ovirt-ha-broker start && service ovirt-ha-agent start

After sometime it looks like storage mounted successfully, but now engine agent start playing ping pong with vm, because score problem, but it related to other bug.

Comment 3 Jiri Moskovcak 2014-07-11 11:37:03 UTC
(In reply to Artyom from comment #2)
> Sorry for late answer, just had some exams on this week, I checked you
> proposal:
> 1) Installed hosted-engine environment with two hosts
> 2) Set global maintenance mode
> 3) Disable autorun for agent and broker services on both hosts:
>    chkconfig ovirt-ha-agent off && chkconfig ovirt-ha-agent --del
>    chkconfig ovirt-ha-broker off && chkconfig ovirt-ha-broker --del
> 4) Restarted both hosts
> 5) wait sometime after host booting
> 6) Start manually service on both hosts:
>    service ovirt-ha-broker start && service ovirt-ha-agent start
> 
> After sometime it looks like storage mounted successfully, but now engine
> agent start playing ping pong with vm, because score problem, but it related
> to other bug.

Thanks Artiom for confirming this, the proposed patch should fix the problem

Comment 5 Artyom 2014-08-07 12:34:53 UTC
Verified on ovirt-hosted-engine-ha-1.2.1-0.2.master.20140805072346.el6.noarch

Comment 8 errata-xmlrpc 2015-02-11 21:08:34 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHBA-2015-0194.html


Note You need to log in before you can comment on or make changes to this bug.