Bug 1644713 - Ceph-ansible fails in handler with missing data for ceph_osd_container_stat
Summary: Ceph-ansible fails in handler with missing data for ceph_osd_container_stat
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Ceph-Ansible
Version: 3.1
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: rc
: 3.*
Assignee: Sébastien Han
QA Contact: ceph-qe-bugs
URL:
Whiteboard:
Depends On:
Blocks: 1578730
TreeView+ depends on / blocked
 
Reported: 2018-10-31 13:08 UTC by Tim Rozet
Modified: 2018-11-22 11:08 UTC (History)
7 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-11-22 11:08:08 UTC
Embargoed:


Attachments (Terms of Use)
ceph log (329.66 KB, text/plain)
2018-10-31 13:10 UTC, Tim Rozet
no flags Details
ceph hieradata (3.67 KB, text/plain)
2018-10-31 13:18 UTC, Tim Rozet
no flags Details

Description Tim Rozet 2018-10-31 13:08:29 UTC
Description of problem:
Deployment fails during ceph-ansible deployment. Only Controller/Compute roles are used in deployment to 5 nodes (3 control, 2 compute). All nodes have ceph osd docker service on them. The ceph-ansible deployment fails with:


The error was: error while evaluating conditional (hostvars[item]['ceph_osd_container_stat'].get('rc') == 0): 'dict object' has no attribute 'ceph_osd_container_stat'

Version-Release number of selected component (if applicable):
ceph-ansible-3.1.9-1.el7.noarch

How reproducible:
Seems to happen about 50% of the time.

Additional info:
The corresponding code that is failing is here:
https://github.com/ceph/ceph-ansible/blob/60bc1e38db0e797ad6553584927f86486ae09c19/roles/ceph-handler/handlers/main.yml#L109

Comment 1 Tim Rozet 2018-10-31 13:10:24 UTC
Created attachment 1499379 [details]
ceph log

Comment 2 Tim Rozet 2018-10-31 13:18:14 UTC
Created attachment 1499383 [details]
ceph hieradata

Comment 4 Tim Rozet 2018-10-31 15:46:26 UTC
Preliminary testing shows that the error does not happen with ceph-ansible-3.1.6. Running more tests to confirm it.

Comment 5 Tim Rozet 2018-11-09 16:37:37 UTC
Confirmed the problem does not happen in ceph-ansible-3.1.6


Note You need to log in before you can comment on or make changes to this bug.