Bug 1783299 - Ceph deployment failed when attempted to create customized pools
Summary: Ceph deployment failed when attempted to create customized pools
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: tripleo-ansible
Version: 16.0 (Train)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: rc
: 16.0 (Train on RHEL 8.1)
Assignee: Giulio Fidente
QA Contact: Yogev Rabl
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-12-13 14:30 UTC by Yogev Rabl
Modified: 2023-09-14 05:48 UTC (History)
9 users (show)

Fixed In Version: tripleo-ansible-0.4.2-0.20200110023759.ee731ba.el8ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2020-02-06 14:44:12 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
environment files and logs (3.00 MB, application/gzip)
2019-12-13 14:30 UTC, Yogev Rabl
no flags Details
overcloud deployment log file (7.28 MB, application/gzip)
2020-01-09 20:16 UTC, Yogev Rabl
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1854209 0 None None None 2020-01-10 17:29:42 UTC
OpenStack gerrit 702015 0 None MERGED Remove extra backslash in awk for tripleo-ceph-uuid role 2020-01-23 19:22:57 UTC
Red Hat Product Errata RHEA-2020:0283 0 None None None 2020-02-06 14:44:39 UTC

Description Yogev Rabl 2019-12-13 14:30:13 UTC
Created attachment 1644872 [details]
environment files and logs

Description of problem:
An overcloud deployment with Ceph with customized crush maps, pools and an additional pool for Cinder as a second back end failed with the errors:
"failed: [ceph-5 -> 192.168.24.46] (item=[{'application': 'rbd', 'name': 'tier2', 'pg_num': 64, 'rule_name': 'fast'}, {'msg': 'non-zero return code', 
'cmd': ['podman', 'exec', 'ceph-mon-controller-0', 'ceph', '--cluster', 'ceph', 'osd', 'pool', 'get', 'tier2', 'size'], 'stdout': '', 'stderr': \"Error ENOENT
: unrecognized pool 'tier2'\\nError: exit status 2\", 'rc': 2, 'start': '2019-12-12 17:36:14.986390', 'end': '2019-12-12 17:36:15.728381', 'delta': '0:00:00.7
41991', 'changed': False, 'failed': False, 'invocation': {'module_args': {'_raw_params': 'podman exec ceph-mon-controller-0 ceph --cluster ceph osd pool get t
ier2 size\\n', 'warn': True, '_uses_shell': False, 'stdin_add_newline': True, 'strip_empty_ends': True, 'argv': None, 'chdir': None, 'executable': None, 'crea
tes': None, 'removes': None, 'stdin': None}}, 'stdout_lines': [], 'stderr_lines': [\"Error ENOENT: unrecognized pool 'tier2'\", 'Error: exit status 2'], 'fail
ed_when_result': False, 'item': {'application': 'rbd', 'name': 'tier2', 'pg_num': 64, 'rule_name': 'fast'}, 'ansible_loop_var': 'item'}]) => changed=false ",


Version-Release number of selected component (if applicable):
ceph-ansible-4.0.5-1.el8cp.noarch


How reproducible:
100% (happened 3 times) 

Steps to Reproduce:
1. Deploy an overcloud with customized pools and crush map


Actual results:
the deployment fails during the Ceph cluster deployment

Expected results:
The ceph cluster is deployed with the customized parameters

Additional info:

Comment 5 Yogev Rabl 2020-01-09 20:16:19 UTC
Created attachment 1651066 [details]
overcloud deployment log file

Comment 6 Yogev Rabl 2020-01-09 20:17:12 UTC
Tried to add the flag and it didn't work, I have attached the logs to the bug

Comment 7 Giulio Fidente 2020-01-10 16:19:23 UTC
host_vars are empty but nodes_uuid_data.json isn't , we need to make sure the uuids match

Comment 12 Yogev Rabl 2020-01-14 18:10:43 UTC
The fix is not in the latest compose

Comment 15 Giulio Fidente 2020-01-15 12:45:57 UTC
Moving to VERIFIED and tracking the new ceph-ansible issue via BZ#1791283

Comment 19 errata-xmlrpc 2020-02-06 14:44:12 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2020:0283

Comment 20 Red Hat Bugzilla 2023-09-14 05:48:41 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 1000 days


Note You need to log in before you can comment on or make changes to this bug.