Created attachment 1601912 [details] File contains playbook log Description of problem: Rolling update from ceph-ansible 2.x to 3.x failed in task "ceph-osd : run the systemd-device-to-id.sh script" saying "No such file or directory" It seems like task which copies the script has not been delegated to other nodes. Version-Release number of selected component (if applicable): ceph-ansible-3.2.22-1.el7cp.noarch How reproducible: Always (1/1) Steps to Reproduce: 1. Get a RHCS 2.x containerized cluster (With OSDs having device name in their service name) 2. Try to upgrade it to 3.3 Actual results: "bash: /tmp/systemd-device-to-id.sh: No such file or directory" Expected results: rolling-update must complete successfully Additional info:
Created attachment 1601932 [details] File contains playbook log I think following lines from start_osds.yml needs to be removed https://github.com/ceph/ceph-ansible/blob/stable-3.2/roles/ceph-osd/tasks/start_osds.yml#L131-L132 It seemed to be be working for me, cluster got updated and all new OSD services are up. But old services (service with device name) were present and flapping on nodes on which script was not run first time when I had initiated (Logs of run 1 is at previous attachment). Regards, Vasishta Shastry QE, Ceph
Working fine with ceph-ansible-3.2.24-1.el7cp.noarch Moving to VERIFIED state
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2019:2538