Bug 1021152 - 3.3.1 beta, add vdsm host fails with error: Unexpected connection termination
Summary: 3.3.1 beta, add vdsm host fails with error: Unexpected connection termination
Keywords:
Status: CLOSED DUPLICATE of bug 986882
Alias: None
Product: oVirt
Classification: Retired
Component: ovirt-engine-core
Version: 3.3
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: ---
Assignee: Alon Bar-Lev
QA Contact:
URL:
Whiteboard: infra
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2013-10-19 21:12 UTC by Jason Brooks
Modified: 2013-10-20 21:16 UTC (History)
4 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2013-10-20 12:52:58 UTC
oVirt Team: ---
Embargoed:


Attachments (Terms of Use)
portion of engine.log from centos 6.4 / ovirt 3.3.1 beta engine (32.21 KB, text/x-log)
2013-10-19 21:12 UTC, Jason Brooks
no flags Details
engine log excerpt from attempt to add f19 host (with tar installed) to f19 engine (4.18 KB, text/x-log)
2013-10-20 19:43 UTC, Jason Brooks
no flags Details
log from f19 engine (485.62 KB, text/x-log)
2013-10-20 19:53 UTC, Jason Brooks
no flags Details

Description Jason Brooks 2013-10-19 21:12:03 UTC
Created attachment 814092 [details]
portion of engine.log from centos 6.4 / ovirt 3.3.1 beta engine

Description of problem:

Tested with F19 engine and with CentOS 6.4 engine, running 3.3.1 beta packages. On adding a new host, add operation fails due to "java.io.IOException: Unexpected connection termination." Fuller log passage attached.

I confirmed that both machines have resolvable DNS names, both machines can see each other, ssh to each other, selinux permissive, tried both w/ firewalls running & not running.



Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.

Actual results:


Expected results:


Additional info:

Comment 1 Alon Bar-Lev 2013-10-20 12:52:58 UTC
stderr: bash: tar: command not found

*** This bug has been marked as a duplicate of bug 986882 ***

Comment 2 Jason Brooks 2013-10-20 19:43:48 UTC
Created attachment 814309 [details]
engine log excerpt from attempt to add f19 host (with tar installed) to f19 engine

Comment 3 Alon Bar-Lev 2013-10-20 19:45:53 UTC
(In reply to Jason Brooks from comment #2)
> Created attachment 814309 [details]
> engine log excerpt from attempt to add f19 host (with tar installed) to f19
> engine

Please attach full log.

Have you installed tar on host or on engine?

Comment 4 Jason Brooks 2013-10-20 19:53:19 UTC
Created attachment 814310 [details]
log from f19 engine

tar is installed on the host and on the engine -- I installed ovirt-release-fedora on both machines, and tar is a dep of that pkg

Comment 5 Alon Bar-Lev 2013-10-20 20:14:51 UTC
(In reply to Jason Brooks from comment #4)
> Created attachment 814310 [details]
> log from f19 engine
> 
> tar is installed on the host and on the engine -- I installed
> ovirt-release-fedora on both machines, and tar is a dep of that pkg

This is different issue.

Please file separate bug, or better discuss first at user mailing list...

What do you have at /var/log/messages at host side?

Do you have valid tar at /var/cache/ovirt-engine/ovirt-host-deploy.tar, should be non empty with lots of files.

What do you get while executing the following command at engine:

( cat /var/cache/ovirt-engine/ovirt-host-deploy.tar && cat ) | ssh root.171.201 'umask 0077; MYTMP="$(mktemp -t ovirt-XXXXXXXXXX)"; trap "chmod -R u+rwX \"${MYTMP}\" > /dev/null 2>&1; rm -fr \"${MYTMP}\" > /dev/null 2>&1" 0; rm -fr "${MYTMP}" && mkdir "${MYTMP}" && tar --warning=no-timestamp -C "${MYTMP}" -x &&  "${MYTMP}"/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True'

Comment 6 Jason Brooks 2013-10-20 21:04:34 UTC
All right, this report wasn't really about tar -- after having this problem on F19, I tried on CentOS, and forgot to install the ovirt-release pkg. That pkg pulls in tar, so that's not really a problem.

Host side, /var/log/messages has NetworkManager activity -- NetworkManager appears to request a new dhcp address when the add host operation begins. I thought maybe could have been disrupting the communications, so I stopped the NetworkManager service, tried add again, no more NetworkManager activity, but the same error occurred engine-side, and on the host side, in /var/log/messages:

Oct 20 16:27:22 dhcp-10-171-201 systemd-logind[450]: New session 55 of user root.
Oct 20 16:27:23 dhcp-10-171-201 systemd-logind[450]: Removed session 55.
Oct 20 16:27:23 dhcp-10-171-201 systemd-logind[450]: New session 56 of user root.
Oct 20 16:27:23 dhcp-10-171-201 systemd-logind[450]: Removed session 56.

Engine side, ovirt-host-deploy.tar is full of files.

Running that command on the engine prompts me for the root pw of the host, then asks me to answer yes/no, and then fails with "ERROR Failed to execute stage 'Initializing': need more than 1 value to unpack"

Comment 7 Alon Bar-Lev 2013-10-20 21:16:54 UTC
please move discussion to users, bugzilla is not the place for that, especially this bug which is duplicate per your original report.

please also be consistent in the environment you are using, from this thread I do not understand what OS you are using for engine and what OS you are using for host.


Note You need to log in before you can comment on or make changes to this bug.