Bug 1147116

Summary: oo-admin-move failed if the NIC eth0 has no ip address
Product: OKD Reporter: Yudhi Widyatama <yudhi.widyatama>
Component: PodAssignee: Abhishek Gupta <abhgupta>
Status: CLOSED CURRENTRELEASE QA Contact: libra bugs <libra-bugs>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 2.xCC: akostadi, lmeyer, mmasters, mmccomas, pep, pruan
Target Milestone: ---   
Target Release: 2.x   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 1182982 (view as bug list) Environment:
Last Closed: 2015-09-23 08:04:14 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1182982    

Description Yudhi Widyatama 2014-09-27 02:04:37 UTC
Description of problem:
oo-admin-move failed if the network interface eth0 in the node has no ip address.


Version-Release number of selected component (if applicable):
Openshift Origins v4
rubygem-openshift-origin-msg-broker-mcollective-1.26.1.1-1.el6

How reproducible:
Always on the nodes where eth0 is inactive

Steps to Reproduce:
1. Install openshift origins v4 with 2 or more node hosts with eth1 as the active interface
2. create application 
3. move application using oo-admin-move

Actual results:
Move failed with this error :

DEBUG: Moving content for app '<redacted>', gear '<redacted>' to <redacted>
Identity added: /etc/openshift/rsync_id_rsa (/etc/openshift/rsync_id_rsa)
ssh: Could not resolve hostname : Name or service not known
Agent pid 31683
unset SSH_AUTH_SOCK;
unset SSH_AGENT_PID;
echo Agent pid 31683 killed;
DEBUG: Moving failed.  Rolling back gear '<redacted>' in '<redacted>' with delete on '<redacted>'
Error moving app '<redacted>',platform 'linux', gear '<redacted>' from <redacted> to <redacted>


Expected results:
Gear moved successfully.

Additional info:
Ref : 
https://github.com/openshift/origin-server/blob/master/plugins/msg-broker/mcollective/lib/openshift/mcollective_application_container_proxy.rb#L703

The function get_ip_address hard-codes eth0 as the NIC.

Comment 1 Josep 'Pep' Turro Mauri 2015-01-16 12:28:39 UTC
Submitted https://github.com/openshift/origin-server/pull/6050

Comment 2 Josep 'Pep' Turro Mauri 2015-02-10 09:03:20 UTC
Patch has been merged, commit c3286e087c015258f111bbf73ef6a08257013298

Comment 3 Luke Meyer 2015-02-16 19:53:46 UTC
Being merged, the bug can be tested by QE.

Comment 4 Peter Ruan 2015-02-17 22:26:35 UTC
Does anybody know how to change the eth0 in a devenv?

Comment 5 Luke Meyer 2015-02-18 18:27:52 UTC
I think this should work (in reverse) :
http://minimallinux.blogspot.com/2013/06/centos-6rhel-change-eth1-back-to-eth0.html

Basically:
1. Change eth0 => eth1 in /etc/udev/rules.d/70-persistent-net.rules 
2. mv /etc/sysconfig/network-scripts/ifcfg-eth{0,1}
3. Change eth0 => eth1 in same
4. Reboot (probably cannot just restart network)

Comment 6 Peter Ruan 2015-02-19 06:49:22 UTC
I followed your suggestion, but unfortunately, after reboot, I can't ssh into the devenv.

Comment 7 Aleksandar Kostadinov 2015-02-23 21:24:42 UTC
I managed to get such an instance by executing steps #2 and #3. And the running this one command in root shell:
> ( ifconfig eth0 down; ip link set eth0 name eth1; service network restart; ) &

Comment 8 Miciah Dashiel Butler Masters 2015-09-22 21:42:53 UTC
The proposed fix was shipped in rubygem-openshift-origin-msg-broker-mcollective-1.32.0.1-1.el6op and openshift-origin-msg-node-mcollective-1.29.0.1-1.el6op in RHBA-2015:0220 "Red Hat OpenShift Enterprise 2.2.4 bug fix and enhancement update".  If the fix can be verified, I would like to mark this Bugzilla report CLOSED CURRENTRELEASE.

Comment 9 Josep 'Pep' Turro Mauri 2015-09-23 08:04:14 UTC
(In reply to Miciah Dashiel Butler Masters from comment #8)
> The proposed fix was shipped in
> rubygem-openshift-origin-msg-broker-mcollective-1.32.0.1-1.el6op and
> openshift-origin-msg-node-mcollective-1.29.0.1-1.el6op in RHBA-2015:0220
> "Red Hat OpenShift Enterprise 2.2.4 bug fix and enhancement update". 

These packages were tested as part of the equivalent Enterprise bug: https://bugzilla.redhat.com/show_bug.cgi?id=1182982#c9

> If the
> fix can be verified, I would like to mark this Bugzilla report CLOSED
> CURRENTRELEASE.

closing then.