Bug 989216

Summary: Rhevh upgrade failed and host stuck in installation state
Product: Red Hat Enterprise Virtualization Manager Reporter: Artyom <alukiano>
Component: ovirt-engineAssignee: Alon Bar-Lev <alonbl>
Status: CLOSED DUPLICATE QA Contact:
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 3.3.0CC: acathrow, alonbl, alukiano, iheim, lpeer, Rhev-m-bugs, yeylon
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Linux   
Whiteboard: infra
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2013-07-29 10:40:01 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Infra RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
host-deploy.log
none
engine.log none

Description Artyom 2013-07-28 12:56:52 UTC
Created attachment 779336 [details]
host-deploy.log

Description of problem:
Have host with rhev-hypervisor6-6.4-20130702.0.el6_4 trying to upgrade to rhev-hypervisor6-6.5-20130725.0.el6, upgrade failed with message "Host <host> installation failed" and host stuck in Installing state and just ovirt-engine restart help.

Version-Release number of selected component (if applicable):
is7

How reproducible:
always

Steps to Reproduce:
1. Install rhev-hypervisor6-6.4-20130702.0.el6_4 on host and add host to rhevm
2. Install rhev-hypervisor6-6.5-20130725.0.el6 on rhevm, put host to maintenance and try to upgrade to rhev-hypervisor6-6.5-20130725.0.el6 
3.

Actual results:
Upgrade failed with message "Host <host> installation failed" and host stuck in Installing state and just ovirt-engine restart help

Expected results:
Upgrade success and host up with rhev-hypervisor6-6.5-20130725.0.el6 on it

Additional info:
see host-deploy and engine log

Comment 1 Artyom 2013-07-28 12:57:26 UTC
Created attachment 779337 [details]
engine.log

Comment 2 Alon Bar-Lev 2013-07-29 07:38:10 UTC
Hi,
Are you sure this is the right host-deploy log? I see all success.
In engine.log I do see failure at different time:

2013-07-28 11:34:20,774 ERROR [org.ovirt.engine.core.bll.InstallerMessages] (VdsDeploy) Installation aqua-vds3.qa.lab.tlv.redhat.com: Failed to execute stage 'Closing up': Command '/bin/systemctl' failed to execute
2013-07-28 11:34:20,779 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) Correlation ID: 1bac9a6a, Call Stack: null, Custom Event ID: -1, Message: Failed to install Host aqua-vds3.qa.lab.tlv.redhat.com. Failed to execute stage 'Closing up': Command '/bin/systemctl' failed to execute.

I think that the engine setup of bridge caused this to not go into install failed, I will check.

Comment 3 Alon Bar-Lev 2013-07-29 10:40:01 UTC

*** This bug has been marked as a duplicate of bug 987891 ***