Created attachment 1149435 [details] vdsm log Description of problem: ====================== Addition of 3rd node always fails with "certification enrollment failed" error. Version-Release number of selected component (if applicable): ============================================================= 3.6.5.3-0.1.el6 How reproducible: ================= 100% Steps to Reproduce: Add the 3rd node to the hosted engine cluster RHEL rpm's installed: [root@rhsqa5 ~]# rpm -qa |grep rhev fence-agents-rhevm-4.0.11-27.el7_2.7.x86_64 qemu-kvm-tools-rhev-2.3.0-31.el7_2.10.x86_64 qemu-kvm-rhev-2.3.0-31.el7_2.10.x86_64 qemu-img-rhev-2.3.0-31.el7_2.10.x86_64 rhevm-sdk-python-3.6.3.0-1.el7ev.noarch libcacard-rhev-2.3.0-31.el7_2.10.x86_64 qemu-kvm-common-rhev-2.3.0-31.el7_2.10.x86_64 Actual results: Expected results: Additional info: Attaching the vdsm log.
I have tried couple of times but still fails with the same error. This is blocking us to start ROBO testing itself.
Can you attach engine.log from hosted engine VM?.
Created attachment 1149646 [details] engine log
As a workaraound, you can copy /etc/pki/ovirt-engine/serial.txt.old as /etc/pki/ovirt-engine/serial.txt.
File '/etc/pki/ovirt-engine/serial.txt' got deleted during host addition. As a result, Sign Certificate request is failing. I can see the following error in engine log. 2016-04-22 02:14:13,576 ERROR [org.ovirt.engine.core.utils.hostinstall.OpenSslCAWrapper] (VdsDeploy) [1d49336e] Sign Certificate request failed with exit code 1 2016-04-22 02:14:13,576 ERROR [org.ovirt.engine.core.utils.hostinstall.OpenSslCAWrapper] (VdsDeploy) [1d49336e] Sign Certificate request script errors: Using configuration from openssl.conf unable to load number from serial.txt error while loading serial number 140695882463048:error:0D066096:asn1 encoding routines:a2i_ASN1_INTEGER:short line:f_int.c:215: Cannot sign certificate We are seeing this issue frequently during host addition in hosted engine setup with gluster.
Bhaskarakiran please provide full sos report from the 3rd host and from the engine vm, thanks. Also please note you're using 3.6.3 on the hosts (rhevm-sdk-python-3.6.3.0-1.el7ev.noarch) and 3.6.5 in the engine (3.6.5.3-0.1.el6) Please try to reproduce with hosts and engine aligned to the same version.
Bhaskarakiran also, how were you trying to add this 3rd host? Running hosted-engine --deploy on it? Just adding it using the web ui as common host? Using ansible / gdeploy?
Sandro, i used hosted-engine --deploy to add the 3rd host. The setup if out now, will try on fresh setup and provide the logs.
Bug tickets must have version flags set prior to targeting them to a release. Please ask maintainer to set the correct version flags and only then set the target milestone.
I see that this is hit if a single network is used for both virt and gluster. I tried with seperate networks for both virt and gluster and didn't see this.
Closing this, as the recommendation in case of HC is to use multiple networks to separate virt and gluster traffic