Bug 1250654
Summary: | rhel-osp-director: overcloud deployment fails on " CephStorageDeployment_Step1" , Error: /Stage[main]/Ceph::Osds/Ceph::Osd[/srv/data]/Exec[ceph-osd-activate-/srv/data]/returns: change from notrun to 0 failed: Command exceeded timeout. | ||||||||
---|---|---|---|---|---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Alexander Chuzhoy <sasha> | ||||||
Component: | rhosp-director | Assignee: | Jiri Stransky <jstransk> | ||||||
Status: | CLOSED DUPLICATE | QA Contact: | yeylon <yeylon> | ||||||
Severity: | high | Docs Contact: | |||||||
Priority: | high | ||||||||
Version: | unspecified | CC: | djuran, jdonohue, jstransk, mburns, morazi, rhel-osp-director-maint, rnishtal, sasha, srevivo | ||||||
Target Milestone: | y2 | Keywords: | Reopened, ZStream | ||||||
Target Release: | 7.0 (Kilo) | ||||||||
Hardware: | Unspecified | ||||||||
OS: | Unspecified | ||||||||
Whiteboard: | |||||||||
Fixed In Version: | Doc Type: | Bug Fix | |||||||
Doc Text: | Story Points: | --- | |||||||
Clone Of: | Environment: | ||||||||
Last Closed: | 2015-11-04 17:20:26 UTC | Type: | Bug | ||||||
Regression: | --- | Mount Type: | --- | ||||||
Documentation: | --- | CRM: | |||||||
Verified Versions: | Category: | --- | |||||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||||
Cloudforms Team: | --- | Target Upstream Version: | |||||||
Embargoed: | |||||||||
Bug Depends On: | 1245737 | ||||||||
Bug Blocks: | 1191185, 1243520 | ||||||||
Attachments: |
|
Description
Alexander Chuzhoy
2015-08-05 16:53:37 UTC
Created attachment 1059571 [details]
heat-engine from the undercloud
Created attachment 1059572 [details]
messages file from ceph and heat logs from the undercloud.
I think this is related to the CLI/template changes that jistr put in for bug 1247585. @mburns yeah it could be. @sasha what was the command line you used to deploy? Please try passing the environment file as described here: https://bugzilla.redhat.com/show_bug.cgi?id=1247585#c6 Here's the command I use (same as on the last puddle): openstack overcloud deploy --plan overcloud --control-scale 1 --compute-scale 1 --ceph-storage-scale 1 --block-storage-scale 0 --swift-storage-scale 0 -e /home/stack/network-environment.yaml --ntp-server [IP] --timeout 90 No yaml file for cinder. @sasha -- can you try passing -e /usr/share/openstack-tripleo-heat-templates/environments/storage-environment.yaml as well and see if that works? Environment: openstack-tripleo-heat-templates-0.8.6-45.el7ost.noarch The file /usr/share/openstack-tripleo-heat-templates/environments/storage-environment.yaml doesn't exist. (In reply to Alexander Chuzhoy from comment #9) > Environment: openstack-tripleo-heat-templates-0.8.6-45.el7ost.noarch > > The file > /usr/share/openstack-tripleo-heat-templates/environments/storage-environment. > yaml doesn't exist. Note: this was resolved in a conversation. The fix requires 0.8.6-46, not -45. Was able to deploy the overcloud using this command: openstack overcloud deploy --templates --control-scale 1 --compute-scale 1 --ceph-storage-scale 1 --block-storage-scale 0 --swift-storage-scale 0 -e /home/stack/network-environment.yaml --ntp-server [IP] --timeout 90 -e /usr/share/openstack-tripleo-heat-templates/environments/storage-environment.yaml Using this THT build: openstack-tripleo-heat-templates-0.8.6-46.el7ost.noarch Based on comment 11, this is notabug Re-opening this bug. As discussed on IRC, if a user selects to install a ceph-node, we should provide a reasonable default. Or at least point out that the template is needed. failing the deployment with a non-obvious error-message is not a good option We already had a smart default, but it wasn't overridable, causing a number of storage configurations to be impossible (see bug 1247585). We had to remove the smart default in favor of configurability. Re-adding that smart default should be possible once we have parameter overridability on CLI (bug 1245737). We are planning on providing this functionality via the param override functionality in https://bugzilla.redhat.com/show_bug.cgi?id=1245737 and we should track it there. if this solution is insufficient, please feel free to reopen this bug so we can track it distinctly. *** This bug has been marked as a duplicate of bug 1245737 *** The following files in puppet/manifests was hardcoded for ceph installation to go through. overcloud_cephstorage.pp 23 24 Exec { 25 timeout => 9000, 26 } 27 28 if str2bool(hiera('ceph_osd_selinux_permissive', true)) { overcloud_controller.pp" 33 34 Exec { 35 timeout => 9000, 36 } 37 overcloud_controller_pacemaker.pp 37 38 Exec { 39 timeout => 9000, 40 } 41 42 if hiera('step') >= 1 { The timeout has been increased to 9000. |