Bug 1229174

Summary: Nova computes can't resolve each other because the hostnames in /etc/hosts don't include the ".novalocal" suffix
Product: [Community] RDO Reporter: Udi Kalifon <ukalifon>
Component: openstack-tripleoAssignee: James Slagle <jslagle>
Status: CLOSED EOL QA Contact: Shai Revivo <srevivo>
Severity: high Docs Contact:
Priority: unspecified    
Version: trunk   
Target Milestone: ---   
Target Release: Kilo   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-05-19 15:57:54 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Udi Kalifon 2015-06-08 08:43:43 UTC
Description of problem:
Using a virt setup with 7 nodes, deployment has Ceph and 3 compute nodes. When trying to live-migrate an instance you find the following error in the nova-compute.log of the host that's currently running the instance:

Live Migration failure: operation failed: Failed to connect to remote libvirt URI qemu+tcp://ov-uugkso3o5k-2-exzf5z7st6mi-novacompute-pvxogl3jlxyu.novalocal/system: Unable to resolve address 'ov-uugkso3o5k-2-exzf5z7st6mi-novacompute-pvxogl3jlxyu.novalocal' service '16509': Name or service not known

Looking in /etc/hosts you can find the hostnames mapped to IPs, but all entries don't include ".novalocal" in them:

# HEAT_HOSTS_START - Do not edit manually within this section!
192.0.2.9 ov-uugkso3o5k-0-liuc7fbnfti6-novacompute-dh7mq5oubb2b
192.0.2.10 ov-uugkso3o5k-1-cdd5mzvqgcjc-novacompute-pqrwgvszrofu
192.0.2.11 ov-uugkso3o5k-2-exzf5z7st6mi-novacompute-pvxogl3jlxyu

When you run the "hostname" command on the nodes, the result as ".novalocal" in the end. I added the hostnames with the suffix in each of the above lines in /etc/hosts and was able to live migrate.


Version-Release number of selected component (if applicable):
openstack-tripleo-0.0.6-dev1717.el7.centos.noarch
openstack-nova-api-2015.1.0-4.el7ost.noarch
openstack-nova-common-2015.1.0-4.el7ost.noarch


How reproducible:
100%


Steps to Reproduce:
1. Deploy a virt setup with several compute nodes
2. Launch an instace in the overcloud
3. Determine the host the instance is running on with "nova show" and find a host to migrate it to with "nova host-list"
4. Live migrate: nova live-migration 1578a16e-009c-472c-bd1f-caf639be10b0 ov-uugkso3o5k-2-exzf5z7st6mi-novacompute-pvxogl3jlxyu.novalocal
5. Verify that it migrated with "nova show" and "nova host-describe"
6. Monitor /var/log/nova/nova-compute.log to see if there are errors

Comment 2 Chandan Kumar 2016-05-19 15:57:54 UTC
This bug is against a Version which has reached End of Life.
If it's still present in supported release (http://releases.openstack.org), please update Version and reopen.