Description of problem: Have following playbook: 2 Monitors 1 MDS 2 OSD server ( 2x3 devices) Full Logs: http://magna002.ceph.redhat.com/vasu-2016-05-11_11:33:39-smoke:ceph-ansible-master---basic-clara/230938/teuthology.log when running this it fails during following task TASK: [ceph.ceph-common | install rados gateway vhost] ************************ skipping: [clara006.ceph.redhat.com] TASK: [ceph.ceph-common | enable rados gateway vhost and disable default site] *** skipping: [clara006.ceph.redhat.com] => (item=a2ensite rgw.conf) skipping: [clara006.ceph.redhat.com] => (item=a2dissite *default) TASK: [ceph.ceph-common | install s3gw.fcgi script] *************************** skipping: [clara006.ceph.redhat.com] TASK: [ceph.ceph-common | get ceph version] *********************************** failed: [clara006.ceph.redhat.com] => {"changed": false, "cmd": "ceph --version", "failed": true, "rc": 2} msg: [Errno 2] No such file or directory FATAL: all hosts have already failed -- aborting PLAY RECAP ******************************************************************** to retry, use: --limit @/home/ubuntu/site.retry clara006.ceph.redhat.com : ok=22 changed=1 unreachable=0 failed=1 clara007.ceph.redhat.com : ok=85 changed=13 unreachable=0 failed=0 clara008.ceph.redhat.com : ok=157 changed=10 unreachable=0 failed=0 clara010.ceph.redhat.com : ok=157 changed=10 unreachable=0 failed=0 clara011.ceph.redhat.com : ok=85 changed=13 unreachable=0 failed=0 2016-05-11T16:15:45.123 ERROR:teuthology.task.ceph_ansible:Failed during ansible execution TASK: [ceph.ceph-common | install s3gw.fcgi script] *************************** skipping: [clara006.ceph.redhat.com] TASK: [ceph.ceph-common | get ceph version] *********************************** failed: [clara006.ceph.redhat.com] => {"changed": false, "cmd": "ceph --version", "failed": true, "rc": 2} msg: [Errno 2] No such file or directory FATAL: all hosts have already failed -- aborting PLAY RECAP ******************************************************************** to retry, use: --limit @/home/ubuntu/site.retry clara006.ceph.redhat.com : ok=22 changed=1 unreachable=0 failed=1 clara007.ceph.redhat.com : ok=85 changed=13 unreachable=0 failed=0 clara008.ceph.redhat.com : ok=157 changed=10 unreachable=0 failed=0 clara010.ceph.redhat.com : ok=157 changed=10 unreachable=0 failed=0 clara011.ceph.redhat.com : ok=85 changed=13 unreachable=0 failed=0
I logged into clara006, and no ceph packages are installed there, so that's why ceph --version fails.
I think we need the commits from this upstream PR, we're not installing the ceph-mds package. https://github.com/ceph/ceph-ansible/pull/763
*** This bug has been marked as a duplicate of bug 1336571 ***