Bug 1564183

Summary: deployment of overcloud with Controller-no-ceph and Hci-all-ceph failed
Product: Red Hat OpenStack Reporter: Yogev Rabl <yrabl>
Component: openstack-tripleo-validationsAssignee: RHOS Maint <rhos-maint>
Status: CLOSED NOTABUG QA Contact: nlevinki <nlevinki>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 13.0 (Queens)CC: jjoyce, johfulto, jschluet, mburns, rhel-osp-director-maint, slinaber, tvignaud
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2018-04-18 14:14:21 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:

Description Yogev Rabl 2018-04-05 15:03:47 UTC
Description of problem:
The deployment of an overcloud with the roles controller-no-ceph and Hci-all-ceph failed with the error:
overcloud.HciCephAllAllNodesValidationDeployment.1:
  resource_type: OS::Heat::StructuredDeployment
  physical_resource_id: 41dd2d31-1d40-4897-8bbe-e0a635aa8d89
  status: CREATE_FAILED
  status_reason: |
    Error: resources[1]: Deployment to server failed: deploy_status_code : Deployment exited with non-zero status code: 1
  deploy_stdout: |
    Trying to ping 10.0.0.102 for local network 10.0.0.0/24.
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    FAILURE
  deploy_stderr: |
    10.0.0.102 is not pingable. Local Network: 10.0.0.0/24
overcloud.ControllerNoCephAllNodesValidationDeployment.0:
  resource_type: OS::Heat::StructuredDeployment
  physical_resource_id: f7cfe822-8488-4e93-8f6a-ee7777330438
  status: CREATE_FAILED
  status_reason: |
    Error: resources[0]: Deployment to server failed: deploy_status_code : Deployment exited with non-zero status code: 1
  deploy_stdout: |
    Trying to ping 10.0.0.102 for local network 10.0.0.0/24.
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    Ping to 10.0.0.102 failed. Retrying...
    FAILURE
  deploy_stderr: |
    10.0.0.102 is not pingable. Local Network: 10.0.0.0/24

The deployment topology is
  - 3 controller nodes with no ceph
  - 3 Hci nodes with all of the ceph roles in it
all of the nodes had IP addresses in the 10.0.0.0/24 local network and all of them were reachable from the undercloud

Version-Release number of selected component (if applicable):
openstack-tripleo-validations-8.4.0-1.el7ost.noarch

How reproducible:
unknown

Steps to Reproduce:
1. Create role_data.yaml file from the roles ControllerNoCeph and HciAllCeph
2. Set the deployment configuration: 
parameter_defaults:
    ControllerNoCephCount: 3
    HciCephAllCount: 3
3. Deploy the overcloud

Actual results:
The deployment failed with the validation attempting to ping a non existing IP address in the 10.0.0.0/24 local network

Expected results:
The validation pings the right IP addresses

Additional info: