Bug 1382704

Summary: OSPD failed to deploy OSD storage nodes with multiple disks
Product: Red Hat OpenStack Reporter: Yogev Rabl <yrabl>
Component: openstack-tripleo-heat-templatesAssignee: Giulio Fidente <gfidente>
Status: CLOSED NOTABUG QA Contact: Yogev Rabl <yrabl>
Severity: high Docs Contact:
Priority: urgent    
Version: 10.0 (Newton)CC: gfidente, mburns, mcornea, rhel-osp-director-maint, yrabl
Target Milestone: ---Keywords: Regression
Target Release: 10.0 (Newton)   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-10-13 07:57:30 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Attachments:
Description Flags
openstack-tripleo-heat-templates dir
none
heat-engine.log none

Description Yogev Rabl 2016-10-07 13:01:09 UTC
Description of problem:
The goal of the deployment is to install an overcloud HA environment with:
- 3 controllers 
- 2 computes 
- 3 Ceph storage nodes (1 disk as root, 10 for OSDs and 1 for journaling)

The deployment failed at the stage of the deployment of the Ceph storage node. 

Attached:
- templates directory
- heat-engine.log


Version-Release number of selected component (if applicable):
openstack-tripleo-heat-templates-5.0.0-0.20160929150845.4cdc4fc.el7ost.noarch

How reproducible:
100% (tried twice)

Steps to Reproduce:
Deploy the Overcloud with the following command (see the templates dir)
openstack overcloud deploy \
--templates \
-e /usr/share/openstack-tripleo-heat-templates/environments/puppet-pacemaker.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/network-isolation.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/network-environment.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/net-two-nic-with-vlans.yaml \
-e /usr/share/openstack-tripleo-heat-templates/environments/ceph-internal.yaml \
--control-flavor control \ 
--control-scale 3 \
--ceph-storage-flavor ceph-storage \
--ceph-storage-scale 3 \
--compute-flavor compute \
--compute-scale 3 \
--libvirt-type qemu \
--ntp-server clock.redhat.com

Actual results:
The deployment failed with the ceph osd tree looks like this
ID WEIGHT  TYPE NAME                        UP/DOWN REWEIGHT PRIMARY-AFFINITY
-1 0.29279 root default
-2 0.09760     host overcloud-cephstorage-1
 0 0.09760         osd.0                         up  1.00000          1.00000
-3 0.09760     host overcloud-cephstorage-0
 1 0.09760         osd.1                         up  1.00000          1.00000
-4 0.09760     host overcloud-cephstorage-2
 2 0.09760         osd.2                         up  1.00000          1.00000
Expected results:
The deployment should be successful with 10 OSDs on each storage node 

Additional info:

Comment 1 Yogev Rabl 2016-10-07 13:02:23 UTC
Created attachment 1208149 [details]
openstack-tripleo-heat-templates dir

Comment 2 Yogev Rabl 2016-10-07 13:03:19 UTC
Created attachment 1208150 [details]
heat-engine.log

Comment 4 Giulio Fidente 2016-10-12 11:48:41 UTC
this sounds like a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1304401#c9

could you attach the output of:

# sudo ceph-disk list

from a ceph-storage node?