Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.
This project is now read‑only. Starting Monday, February 2, please use Jira Cloud for all bug tracking management.

Bug 1484189

Summary: rhosp-director: HA overcloud with ceph and IPV6 fails
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Alexander Chuzhoy <sasha>
Component: Ceph-AnsibleAssignee: Sébastien Han <shan>
Status: CLOSED ERRATA QA Contact: ceph-qe-bugs <ceph-qe-bugs>
Severity: high Docs Contact:
Priority: high    
Version: 3.0CC: adeza, aschoen, bengland, ceph-eng-bugs, dbecker, dwilson, gfidente, gmeno, jefbrown, johfulto, kdreyer, mburns, mcornea, morazi, nthomas, rhel-osp-director-maint, sankarshan, seb, twilkins
Target Milestone: rcKeywords: Triaged
Target Release: 3.0   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: RHEL: ceph-ansible-3.0.0-0.1.rc5.el7cp Ubuntu: ceph-ansible_3.0.0~rc5-2redhat1 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-12-05 23:39:47 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
ansible_run.log none

Description Alexander Chuzhoy 2017-08-23 01:14:09 UTC
rhosp-director: HA overcloud with ceph and IPV6 fails

Environment:
puppet-ceph-2.3.1-0.20170805094345.868e6d6.el7ost.noarch
instack-undercloud-7.2.1-0.20170729010706.el7ost.noarch
ceph-ansible-3.0.0-0.1.rc3.el7cp.noarch
openstack-tripleo-heat-templates-7.0.0-0.20170805163048.el7ost.noarch
openstack-puppet-modules-10.0.0-0.20170315222135.0333c73.el7.1.noarch

Steps to reproduce:
Attempt to deploy overcloud with IPV6:

openstack overcloud deploy --templates \
--libvirt-type kvm \
-e /usr/share/openstack-tripleo-heat-templates/environments/docker.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/docker-ha.yaml \
-e /home/stack/templates/nodes_data.yaml \
-e  /usr/share/openstack-tripleo-heat-templates/environments/ceph-ansible/ceph-ansible.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/network-isolation-v6.yaml \
-e /home/stack/virt/network/network-environment-v6.yaml \
-e /home/stack/rhos12.yaml

Result:
(undercloud) [stack@undercloud-0 ~]$ openstack stack failures list overcloud
overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:
  resource_type: OS::Mistral::ExternalResource
  physical_resource_id: e652d2b4-acc5-408f-9dbe-3339a5bc98a7
  status: CREATE_FAILED
  status_reason: |
    resources.WorkflowTasks_Step2_Execution: ERROR



(undercloud) [stack@undercloud-0 ~]$ heat resource-list -n5 overcloud |grep -v COMPLE
WARNING (shell) "heat resource-list" is deprecated, please use "openstack stack resource list" instead
+----------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------+-----------------+----------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------+
| resource_name                                | physical_resource_id                                                                                                                                                                 | resource_type                                                                                                                  | resource_status | updated_time         | stack_name                                                                                                                                               |
+----------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------+-----------------+----------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------+
| AllNodesDeploySteps                          | c56e3d5f-0316-4aac-914d-2f2411043482                                                                                                                                                 | OS::TripleO::PostDeploySteps                                                                                                   | CREATE_FAILED   | 2017-08-23T00:08:51Z | overcloud                                                                                                                                                |
| WorkflowTasks_Step2_Execution                | e652d2b4-acc5-408f-9dbe-3339a5bc98a7                                                                                                                                                 | OS::Mistral::ExternalResource                                                                                                  | CREATE_FAILED   | 2017-08-23T00:21:56Z | overcloud-AllNodesDeploySteps-yf77rtka7ojp                                                                                                               |
+----------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------+-----------------+----------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------+

Comment 2 Giulio Fidente 2017-08-23 15:06:22 UTC
Created attachment 1317142 [details]
ansible_run.log

Looks like a legit error happening during a task in ceph-ansible when using ipv6 (probably hardcoded references to the ipv4 addressing). I am attaching the ansible execution log.

Comment 4 seb 2017-08-23 17:11:31 UTC
Fix here: https://github.com/ceph/ceph-ansible/pull/1798

Comment 5 Alexander Chuzhoy 2017-08-24 03:00:10 UTC
Was able to deploy using the fix in comment #4

Comment 6 seb 2017-08-24 09:38:20 UTC
Thanks Alex for the quick test.
Moving this to POST.

Comment 7 Ken Dreyer (Red Hat) 2017-08-31 21:41:19 UTC
Sébastien would you please tag and announce a new upstream ceph-ansible version that includes this change? Then we can rebase to that downstream.

Comment 9 Giulio Fidente 2017-09-20 12:53:28 UTC
Ken, looks like this can be moved to ON_QA?

Comment 11 Alexander Chuzhoy 2017-09-20 16:22:22 UTC
The issue doesn't reproduce for me using ceph-ansible-3.0.0-0.1.rc4.el7cp.noarch

I notice that this version of RPM already includes this patch:
https://github.com/ceph/ceph-ansible/pull/1798


Successfully deployed and populated OC with IPV6 (docker{,-ha}.yaml are included by default now :

openstack overcloud deploy --templates \
--libvirt-type kvm \
-e /home/stack/templates/nodes_data.yaml \
-e  /usr/share/openstack-tripleo-heat-templates/environments/ceph-ansible/ceph-ansible.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/network-isolation-v6.yaml \
-e /home/stack/virt/network/network-environment-v6.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/ssl/enable-tls.yaml \
-e /home/stack/virt/public_vip.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/ssl/tls-endpoints-public-ip.yaml \
-e /home/stack/inject-trust-anchor-hiera.yaml \
-e /home/stack/rhos12.yaml

Verifying.

Comment 12 Tim Wilkinson 2017-10-11 14:12:07 UTC
I am seeing the same error with ceph-ansible-3.0.0-0.1.rc4.el7cp.noarch. Am I missing something else?

 ...
2017-10-11 14:02:25Z [overcloud.AllNodesDeploySteps.WorkflowTasks_Step2]: CREATE_COMPLETE  state changed
2017-10-11 14:02:25Z [overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution]: CREATE_IN_PROGRESS  state changed
2017-10-11 14:05:20Z [overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution]: CREATE_FAILED  resources.WorkflowTasks_Step2_Execution: ERROR
2017-10-11 14:05:21Z [overcloud.AllNodesDeploySteps]: CREATE_FAILED  Resource CREATE failed: resources.WorkflowTasks_Step2_Execution: ERROR
2017-10-11 14:05:22Z [overcloud.AllNodesDeploySteps]: CREATE_FAILED  resources.AllNodesDeploySteps: Resource CREATE failed: resources.WorkflowTasks_Step2_Execution: ERROR
2017-10-11 14:05:22Z [overcloud]: CREATE_FAILED  Resource CREATE failed: resources.AllNodesDeploySteps: Resource CREATE failed: resources.WorkflowTasks_Step2_Execution: ERROR

 Stack overcloud CREATE_FAILED 

overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:
  resource_type: OS::Mistral::ExternalResource
  physical_resource_id: 42021a2f-93c5-4643-a620-2dcae0165fe2
  status: CREATE_FAILED
  status_reason: |
    resources.WorkflowTasks_Step2_Execution: ERROR



$ rpm -qa|grep ansible
ansible-2.3.2.0-2.el7.noarch
ceph-ansible-3.0.0-0.1.rc4.el7cp.noarch

Comment 13 Tim Wilkinson 2017-10-11 14:15:34 UTC
From the ceph-install-workflow.log ...

2017-10-11 10:05:15,981 p=29317 u=mistral |  skipping: [192.168.24.53]
2017-10-11 10:05:15,992 p=29317 u=mistral |  TASK [ceph-mon : delete populate-kv-store docker] ******************************
2017-10-11 10:05:16,006 p=29317 u=mistral |  skipping: [192.168.24.53]
2017-10-11 10:05:16,018 p=29317 u=mistral |  TASK [ceph-mon : generate systemd unit file] ***********************************
2017-10-11 10:05:16,345 p=29317 u=mistral |  fatal: [192.168.24.53]: FAILED! => {"changed": false, "failed": true, "msg": "AnsibleUndefinedVariable: 'dict object' has no attribute u'ipv4'"}
2017-10-11 10:05:16,345 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : copy mon restart script] **********************
2017-10-11 10:05:16,346 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : restart ceph mon daemon(s)] *******************
2017-10-11 10:05:16,346 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : copy osd restart script] **********************
2017-10-11 10:05:16,346 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : restart containerized ceph osds daemon(s)] ****
2017-10-11 10:05:16,347 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : restart non-containerized ceph osds daemon(s)] ***
2017-10-11 10:05:16,347 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : restart ceph mdss] ****************************
2017-10-11 10:05:16,347 p=29317 u=mistral |  RUNNING HANDLER [ceph-defaults : restart ceph rgws] ****************************
2017-10-11 10:05:16,348 p=29317 u=mistral |  PLAY RECAP *********************************************************************
2017-10-11 10:05:16,348 p=29317 u=mistral |  192.168.24.53              : ok=27   changed=2    unreachable=0    failed=1
2017-10-11 10:05:16,348 p=29317 u=mistral |  192.168.24.57              : ok=1    changed=0    unreachable=0    failed=0
2017-10-11 10:05:16,348 p=29317 u=mistral |  192.168.24.58              : ok=1    changed=0    unreachable=0    failed=0

Comment 14 Giulio Fidente 2017-10-11 14:42:50 UTC
Tim, this was fixed in version ceph-ansible-3.0.0-0.1.rc5.el7cp

Comment 29 errata-xmlrpc 2017-12-05 23:39:47 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3387