Description of problem: ======================= Ceph Intsallation using ceph-anisble got hung at task 'collect admin and bootstrap keys' Version-Release number of selected component (if applicable): ============================================================= ceph-ansible-1.0.5-3.el7.noarch ceph-10.1.1-1.el7cp.x86_64 How reproducible: ================= 3 times out of 4 Steps to Reproduce: ================== 1. prepare node for ceph installation.(follow prerequisites steps.) 2. create host file having only mon and osd roles. 3.run ansible command with below values ansible-playbook site.yml -vv -i /etc/ansible/hosts_harish --extra-vars '{"ceph_stable": true, "ceph_origin": "distro", "ceph_stable_rh_storage": true, "monitor_interface": "eno1", "journal_collocation": true, "devices": ["/dev/sdb", "/dev/sdc", "/dev/sdd"], "journal_size": 100, "public_network": "10.8.128.0/21", "cephx": true, "fetch_directory": "~/ceph-ansible-keys"}' -u root Actual results: =============== Installation is stuck at below task TASK: [ceph-mon | collect admin and bootstrap keys] *************************** <magna051> REMOTE_MODULE command ceph-create-keys --cluster ceph --id magna051 <magna063> REMOTE_MODULE command ceph-create-keys --cluster ceph --id magna063 <magna067> REMOTE_MODULE command ceph-create-keys --cluster ceph --id magna067 ok: [magna067] => {"changed": false, "cmd": ["ceph-create-keys", "--cluster", "ceph", "--id", "magna067"], "delta": "0:00:01.251416", "end": "2016-04-20 16:13:46.546810", "failed": false, "failed_when_result": false, "rc": 0, "start": "2016-04-20 16:13:45.295394", "stderr": "INFO:ceph-create-keys:Talking to monitor...\nINFO:ceph-create-keys:Talking to monitor...\n2016-04-20 16:13:45.860068 7fe341032700 -1 asok(0x7fe33c001680) AdminSocketConfigObs::init: failed: AdminSocket::bind_and_listen: failed to bind the UNIX domain socket to '/var/run/ceph/rbd-clients/ceph-client.admin.18881.140613940942768.asok': (2) No such file or directory\nINFO:ceph-create-keys:Talking to monitor...\n2016-04-20 16:13:46.103704 7f6725e22700 -1 asok(0x7f6720001680) AdminSocketConfigObs::init: failed: AdminSocket::bind_and_listen: failed to bind the UNIX domain socket to '/var/run/ceph/rbd-clients/ceph-client.admin.18906.140080895236016.asok': (2) No such file or directory\nINFO:ceph-create-keys:Talking to monitor...\n2016-04-20 16:13:46.372301 7febb3864700 -1 asok(0x7febac001680) AdminSocketConfigObs::init: failed: AdminSocket::bind_and_listen: failed to bind the UNIX domain socket to '/var/run/ceph/rbd-clients/ceph-client.admin.18945.140650179729328.asok': (2) No such file or directory", "stdout": "", "warnings": []}
I think this might be a duplicate of https://bugzilla.redhat.com/show_bug.cgi?id=1329013 Rachana, When you see this issue, does the 'mon addr' section of ceph.conf for each mon contain '0.0.0.0'? I think the problem here is a failure to communicate to the monitors. You're only seeing the warning about the rbd client admin socket because of unneeded config in ceph.conf. This BZ address that: https://bugzilla.redhat.com/show_bug.cgi?id=1328594
Andrew, ceph.conf has '0.0.0.0' in mon section.
Andrew is is a duplicate?
(In reply to Gregory Meno from comment #5) > Andrew is is a duplicate? Yes, I believe it is caused by the same problem as https://bugzilla.redhat.com/show_bug.cgi?id=1329013 It should be fixed in ceph-ansible-1.0.5-4.el7 Rachana, can you confirm that this is no longer an issue with the latest version of ceph-ansible?
Andrew, yes it is not coming with latest version.
Rachana, this sounds like a candidate for moving to Verified. Would you please qa_ack, clear needinfo and move to Verified state?
working with latest build ceph-ansible-1.0.5-4.el7. Hence moving to verified.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2016:1754