Bug 1325657

Summary: [host-deploy] host deploy fails with java.lang.NullPointerException
Product: [oVirt] ovirt-engine Reporter: Elad <ebenahar>
Component: Host-DeployAssignee: Sandro Bonazzola <sbonazzo>
Status: CLOSED CURRENTRELEASE QA Contact: Elad <ebenahar>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 4.0.0CC: bugs, ebenahar, emesika, gklein, khakimi, mperina, oourfali, sbonazzo
Target Milestone: ovirt-4.0.0-betaKeywords: AutomationBlocker, Regression
Target Release: 4.0.0Flags: rule-engine: ovirt-4.0.0+
rule-engine: blocker+
rule-engine: planning_ack+
rule-engine: devel_ack+
pstehlik: testing_ack+
Hardware: x86_64   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-08-01 12:27:37 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: Integration RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1274622, 1283499    
Attachments:
Description Flags
logs from engine and hypervisor
none
/var/log/ from engine and hypervisor none

Description Elad 2016-04-10 12:19:54 UTC
Created attachment 1145668 [details]
logs from engine and hypervisor

Description of problem:
Deployed hosted-engine over NFS and it failed to complete during host deploy operation in the engine:


2016-04-10 15:04:40,805 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) [3d0b1546] Correlation ID: 3d0b1546, Call Stack: null, Custom Event ID: -1, Message: Installing Host hosted_engine_1. Logs at host located at: '/tmp/ovirt-host-deploy-20160410150439-lptgpw.log'.
2016-04-10 15:04:40,859 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) [3d0b1546] Error during deploy dialog: java.lang.NullPointerException
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:31) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:30) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.nextCustomizationEntry(VdsDeployBase.java:251) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.processEvent(VdsDeployBase.java:635) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeploy.processEvent(VdsDeploy.java:35) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.threadMain(VdsDeployBase.java:319) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.lambda$new$0(VdsDeployBase.java:383) [bll.jar:]
        at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_77]

2016-04-10 15:04:40,870 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (org.ovirt.thread.pool-10-thread-1) [3d0b1546] Error during host blond-vdsh.qa.lab.tlv.redhat.com install: java.lang.NullPointerException
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:31) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:30) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.nextCustomizationEntry(VdsDeployBase.java:251) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.processEvent(VdsDeployBase.java:635) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeploy.processEvent(VdsDeploy.java:35) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.threadMain(VdsDeployBase.java:319) [bll.jar:]
        at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.lambda$new$0(VdsDeployBase.java:383) [bll.jar:]
        at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_77]

2016-04-10 15:04:40,885 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (org.ovirt.thread.pool-10-thread-1) [3d0b1546] Correlation ID: 3d0b1546, Call Stack: null, Custom Event ID: -1, Message: Failed to install Host hosted_engine_1. <UNKNOWN>.


Version-Release number of selected component (if applicable):
ovirt-engine-4.0.0-0.0.master.20160407161554.git4c3b9da.el7.centos.noarch
ovirt-host-deploy-1.5.0-0.0.master.20160407112754.gitb51b27a.el7.centos.noarch
ovirt-hosted-engine-setup-2.0.0-0.0.master.20160325145108.gita9a3495.el7.centos.noarch
vdsm-4.17.999-900.giteba4b25.el7.centos.noarch

How reproducible:
Seems to be all the time

Steps to Reproduce:
1. Deploy hosted-engine over NFS

Actual results:
Hosted-engine deployment fails to complete with the mentioned exception in engine.

Expected results:
Host deploy for hosted-engine deployment should succeed.

Additional info:
logs from engine and hypervisor


hosted-engine-setup log:

2016-04-10 15:04:42 ERROR otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._wait_host_ready:101 The VDSM host was found in a failed state. Please check engine and bootstrap installation logs.
2016-04-10 15:04:42 ERROR otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._closeup:873 Unable to add hosted_engine_1 to the manager

Comment 1 Sandro Bonazzola 2016-04-11 06:52:50 UTC
Can you please provide ovirt-host-deploy logs? they are within the HE VM in /var/log/ovirt-engine/host-deploy directory.

Comment 2 Sandro Bonazzola 2016-04-11 06:54:01 UTC
Oved, can you have a look as well for the engine side part?

Comment 3 Red Hat Bugzilla Rules Engine 2016-04-11 06:54:54 UTC
This bug report has Keywords: Regression or TestBlocker.
Since no regressions or test blockers are allowed between releases, it is also being identified as a blocker for this release. Please resolve ASAP.

Comment 4 Oved Ourfali 2016-04-11 07:18:33 UTC
Can you attach all logs properly?
I can't see the host deploy logs as well.

Comment 5 Elad 2016-04-11 07:34:10 UTC
I'll upload host deploy logs along with all the others soon. I just need to redeploy HE.

Comment 6 Oved Ourfali 2016-04-11 07:35:49 UTC
Also, is the issue hosted-engine specific, or host deploy in general fails?

Comment 8 Oved Ourfali 2016-04-11 12:27:11 UTC
*** Bug 1325794 has been marked as a duplicate of this bug. ***

Comment 9 Oved Ourfali 2016-04-11 12:28:55 UTC
Martin - please take a look.
Seems like it happens also regardless of hosted engine (see the duplicate), so moving to infra.

Comment 10 Martin Perina 2016-04-11 13:00:36 UTC
Could you please provide full logs? I'd like to see host-deploy logs from /var/log/ovirt-engine/host-deploy/* and also server.log

Comment 11 Elad 2016-04-11 13:14:52 UTC
Martin, Oved, Sandro, 
I need to re-deploy hosted engine since I've erased the setup where the bug was found. ovirt infra had an issue with the latest build's rpms that got deleted, it should be fixed now. 
Re-deploying, the logs will be provided soon.

Comment 12 Elad 2016-04-11 13:32:51 UTC
Created attachment 1145998 [details]
/var/log/ from engine and hypervisor

/var/log dir from engine and hypervisor.

Host-deploy dir is empty, no logs there

engine.log:

2016-04-11 16:24:42,716 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) [67e27ae6] Correlation ID: 67e27ae6, Call Stack: null, Custom Event ID: -1, Message: Installing Host hosted_engine_1. Logs at host located at: '/tmp/ovirt-host-deploy-20160411162441-p4atej.log'.
2016-04-11 16:24:42,787 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) [67e27ae6] Error during deploy dialog: java.lang.NullPointerException

Comment 13 Eli Mesika 2016-04-12 08:02:00 UTC
Hi

It seems that you are using a new host-deploy package with changes related to NGN RHEVH while the engine part of this change [1] is still not merged.

Sandro, can you please specify the version to be used prior to the NGN patch merge.

[1] https://gerrit.ovirt.org/#/q/topic:NGN

Comment 14 Eli Mesika 2016-04-12 08:50:45 UTC
host-deploy version before NGN changes is :

 df57fe1 - (tag: ovirt-host-deploy-1.5.0_alpha1) core: Fix priorities to comply with before/after (2 settimane fa) <Yedidyah Bar David>
 http://resources.ovirt.org/pub/ovirt-4.0_alpha1/

Comment 15 Sandro Bonazzola 2016-04-13 13:02:34 UTC
Fixed on master. Will be available in tomorrow nightly.

Comment 16 Yaniv Kaul 2016-05-03 17:34:55 UTC
Can this be moved to ON_QA already?

Comment 17 Sandro Bonazzola 2016-05-04 13:36:17 UTC
This has been fixed post alpha build so it's not included in alpha.
This bug has been targeted to alpha by bugzilla upon milestone renaming.
Re-targeting to beta.

Comment 18 Elad 2016-07-07 10:46:29 UTC
Host deploy

Comment 19 Elad 2016-07-07 11:57:30 UTC
Host deploy finishes successfully. 

Used:
ovirt-host-deploy-1.5.0-1.el7ev.noarch
rhevm-4.0.2-0.2.rc1.el7ev.noarch
vdsm-4.18.5.1-1.el7ev.x86_64