Bug 1308562 - Ceph monitor doesn't use the correct address for binding in IPv6 deployment
Ceph monitor doesn't use the correct address for binding in IPv6 deployment
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-puppet-modules (Show other bugs)
7.0 (Kilo)
Unspecified Unspecified
high Severity high
: ga
: 8.0 (Liberty)
Assigned To: Giulio Fidente
Yogev Rabl
Depends On:
Blocks: 1299613 1302593 1308910 1309816 1309822
  Show dependency treegraph
Reported: 2016-02-15 09:00 EST by Marius Cornea
Modified: 2016-04-07 17:28 EDT (History)
11 users (show)

See Also:
Fixed In Version: openstack-puppet-modules-7.0.14-1.el7ost
Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of:
: 1308910 (view as bug list)
Last Closed: 2016-04-07 17:28:27 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---

Attachments (Terms of Use)

External Trackers
Tracker ID Priority Status Summary Last Updated
OpenStack gerrit 280351 None None None 2016-02-16 07:48 EST
Red Hat Product Errata RHEA-2016:0603 normal SHIPPED_LIVE Red Hat OpenStack Platform 8 Enhancement Advisory 2016-04-07 20:53:53 EDT

  None (edit)
Description Marius Cornea 2016-02-15 09:00:46 EST
Description of problem:
IPv6 deployment with 1 ctrl, 3 ceph nodes times out. 

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:
1. Deploy IPv6 environment with 1 x ctrl, 1 x compute, 3 ceph nodes 

Actual results:
Deployment gets stuck because the client is trying to reach the ceph mon on the local address while the ceph mon binds on the storage vip address. 

Expected results:
Deployments proceeds.
Comment 1 Giulio Fidente 2016-02-15 09:06:19 EST
it looks like ceph-mon does not find the initial members list and binds on a local ip address from the public network instead of what is configured as mon_host in ceph.conf ... could need a require in the .pp forcing ::mon to be executed only after the config is dumped, investigating.
Comment 2 Giulio Fidente 2016-02-15 11:20:45 EST
the problem is that ceph-mon won't find itself in the list of hosts given by 'mon host' and it will bind on the first local ip on the public network instead of what we pass in 'mon host'

this will be found in the ceph-mon logs

  0 ceph version 0.94.1 (e4bfad3a3c51054df7e537a724c8d0bf9be972ff), process ceph-mon, pid 6053
  0 mon.overcloud-controller-0 does not exist in monmap, will attempt to join an existing cluster
  0 using public_addr [fd00:fd00:fd00:3000::10]:0/0 -> [fd00:fd00:fd00:3000::10]:6789/0

while 'mon_host' in ceph.conf had:

mon_host = [fd00:fd00:fd00:3000::14]

this could be a clone of https://bugzilla.redhat.com/show_bug.cgi?id=1301701
Comment 3 Giulio Fidente 2016-02-15 12:46:38 EST
we might be able to override the standard binding mechanism using 'public addr' but this requires a change in puppet-ceph ; working on it
Comment 4 Jaromir Coufal 2016-02-15 16:20:33 EST
Not a blocker for 7.3. Moving to 8GA.
Comment 9 Jason Guiditta 2016-03-10 11:11:20 EST
Merged upstream, built.
Comment 12 Yogev Rabl 2016-03-29 04:34:57 EDT
Verified by automation
version openstack-puppet-modules-7.0.16-1.el7ost.noarch
Comment 13 errata-xmlrpc 2016-04-07 17:28:27 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.


Note You need to log in before you can comment on or make changes to this bug.