Created attachment 1145668 [details] logs from engine and hypervisor Description of problem: Deployed hosted-engine over NFS and it failed to complete during host deploy operation in the engine: 2016-04-10 15:04:40,805 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) [3d0b1546] Correlation ID: 3d0b1546, Call Stack: null, Custom Event ID: -1, Message: Installing Host hosted_engine_1. Logs at host located at: '/tmp/ovirt-host-deploy-20160410150439-lptgpw.log'. 2016-04-10 15:04:40,859 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) [3d0b1546] Error during deploy dialog: java.lang.NullPointerException at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:31) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:30) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.nextCustomizationEntry(VdsDeployBase.java:251) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.processEvent(VdsDeployBase.java:635) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeploy.processEvent(VdsDeploy.java:35) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.threadMain(VdsDeployBase.java:319) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.lambda$new$0(VdsDeployBase.java:383) [bll.jar:] at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_77] 2016-04-10 15:04:40,870 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (org.ovirt.thread.pool-10-thread-1) [3d0b1546] Error during host blond-vdsh.qa.lab.tlv.redhat.com install: java.lang.NullPointerException at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:31) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployVdsmUnit$1.call(VdsDeployVdsmUnit.java:30) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.nextCustomizationEntry(VdsDeployBase.java:251) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.processEvent(VdsDeployBase.java:635) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeploy.processEvent(VdsDeploy.java:35) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.threadMain(VdsDeployBase.java:319) [bll.jar:] at org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase.lambda$new$0(VdsDeployBase.java:383) [bll.jar:] at java.lang.Thread.run(Thread.java:745) [rt.jar:1.8.0_77] 2016-04-10 15:04:40,885 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (org.ovirt.thread.pool-10-thread-1) [3d0b1546] Correlation ID: 3d0b1546, Call Stack: null, Custom Event ID: -1, Message: Failed to install Host hosted_engine_1. <UNKNOWN>. Version-Release number of selected component (if applicable): ovirt-engine-4.0.0-0.0.master.20160407161554.git4c3b9da.el7.centos.noarch ovirt-host-deploy-1.5.0-0.0.master.20160407112754.gitb51b27a.el7.centos.noarch ovirt-hosted-engine-setup-2.0.0-0.0.master.20160325145108.gita9a3495.el7.centos.noarch vdsm-4.17.999-900.giteba4b25.el7.centos.noarch How reproducible: Seems to be all the time Steps to Reproduce: 1. Deploy hosted-engine over NFS Actual results: Hosted-engine deployment fails to complete with the mentioned exception in engine. Expected results: Host deploy for hosted-engine deployment should succeed. Additional info: logs from engine and hypervisor hosted-engine-setup log: 2016-04-10 15:04:42 ERROR otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._wait_host_ready:101 The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. 2016-04-10 15:04:42 ERROR otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._closeup:873 Unable to add hosted_engine_1 to the manager
Can you please provide ovirt-host-deploy logs? they are within the HE VM in /var/log/ovirt-engine/host-deploy directory.
Oved, can you have a look as well for the engine side part?
This bug report has Keywords: Regression or TestBlocker. Since no regressions or test blockers are allowed between releases, it is also being identified as a blocker for this release. Please resolve ASAP.
Can you attach all logs properly? I can't see the host deploy logs as well.
I'll upload host deploy logs along with all the others soon. I just need to redeploy HE.
Also, is the issue hosted-engine specific, or host deploy in general fails?
*** Bug 1325794 has been marked as a duplicate of this bug. ***
Martin - please take a look. Seems like it happens also regardless of hosted engine (see the duplicate), so moving to infra.
Could you please provide full logs? I'd like to see host-deploy logs from /var/log/ovirt-engine/host-deploy/* and also server.log
Martin, Oved, Sandro, I need to re-deploy hosted engine since I've erased the setup where the bug was found. ovirt infra had an issue with the latest build's rpms that got deleted, it should be fixed now. Re-deploying, the logs will be provided soon.
Created attachment 1145998 [details] /var/log/ from engine and hypervisor /var/log dir from engine and hypervisor. Host-deploy dir is empty, no logs there engine.log: 2016-04-11 16:24:42,716 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) [67e27ae6] Correlation ID: 67e27ae6, Call Stack: null, Custom Event ID: -1, Message: Installing Host hosted_engine_1. Logs at host located at: '/tmp/ovirt-host-deploy-20160411162441-p4atej.log'. 2016-04-11 16:24:42,787 ERROR [org.ovirt.engine.core.bll.hostdeploy.VdsDeployBase] (VdsDeploy) [67e27ae6] Error during deploy dialog: java.lang.NullPointerException
Hi It seems that you are using a new host-deploy package with changes related to NGN RHEVH while the engine part of this change [1] is still not merged. Sandro, can you please specify the version to be used prior to the NGN patch merge. [1] https://gerrit.ovirt.org/#/q/topic:NGN
host-deploy version before NGN changes is : df57fe1 - (tag: ovirt-host-deploy-1.5.0_alpha1) core: Fix priorities to comply with before/after (2 settimane fa) <Yedidyah Bar David> http://resources.ovirt.org/pub/ovirt-4.0_alpha1/
Fixed on master. Will be available in tomorrow nightly.
Can this be moved to ON_QA already?
This has been fixed post alpha build so it's not included in alpha. This bug has been targeted to alpha by bugzilla upon milestone renaming. Re-targeting to beta.
Host deploy
Host deploy finishes successfully. Used: ovirt-host-deploy-1.5.0-1.el7ev.noarch rhevm-4.0.2-0.2.rc1.el7ev.noarch vdsm-4.18.5.1-1.el7ev.x86_64