Bug 1548357 - [ceph-ansible] [ceph-container] : during rolling update playbook failing trying to restart mgr without copying restart script to mgr
Summary: [ceph-ansible] [ceph-container] : during rolling update playbook failing tryi...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Ceph-Ansible
Version: 3.0
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: z1
: 3.0
Assignee: Sébastien Han
QA Contact: Vasishta
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-02-23 10:03 UTC by Vasishta
Modified: 2018-03-08 15:54 UTC (History)
11 users (show)

Fixed In Version: RHEL: ceph-ansible-3.0.27-1.el7cp Ubuntu: ceph-ansible_3.0.27-2redhat1
Doc Type: Bug Fix
Doc Text:
.A required script is copied when doing a rolling upgrade with Ansible Previously, if the active Ceph Manager node is not the first node to be upgraded, when running the `ceph-ansible` rolling update playbook, then a required restart script was not copied to the Ceph Manager node. This would cause the rolling update to fail. In this release, the required script does get copied to the Ceph Manager node.
Clone Of:
Environment:
Last Closed: 2018-03-08 15:54:03 UTC
Embargoed:


Attachments (Terms of Use)
File contains contents of inventory file, ansible-playbook log (481.51 KB, text/plain)
2018-02-23 10:03 UTC, Vasishta
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Github ceph ceph-ansible pull 2400 0 None None None 2018-02-23 10:12:19 UTC
Red Hat Product Errata RHBA-2018:0474 0 normal SHIPPED_LIVE Red Hat Ceph Storage 3.0 bug fix update 2018-03-08 20:51:53 UTC

Description Vasishta 2018-02-23 10:03:36 UTC
Created attachment 1399797 [details]
File contains contents of inventory file, ansible-playbook log

Description of problem:
During rolling update playbook failing trying to restart (handler) mgr without copying restart script to mgr

As per the observation, this issue is happening when the first node that playbook takes up is not the active mgr 

Version-Release number of selected component (if applicable):
ceph-ansible-3.0.26-1.el7cp.noarch

How reproducible:
Always (2/2)

Steps to Reproduce:
1. Configure containerized cluster of (latest-1) version 
2. Upgrade to latest version running rolling_update


Actual results:
RUNNING HANDLER [ceph-defaults : restart ceph mgr daemon(s) - container] **
failed: [magna035 -> magna043] (item=magna043) => {"changed": false, "cmd": "/tmp/restart_mgr_daemon.sh", "item": "magna043", "msg": "[Errno 2] No such file or directory", "rc": 2}


Expected results:
Playbook must not try to restart daemon without copying respective script

Additional info:

Comment 3 Guillaume Abrioux 2018-02-23 10:19:21 UTC
will be included in v3.0.27

Comment 4 Vasishta 2018-02-23 13:40:35 UTC
Working fine with workaround -

Mention active mgr's name in top of the mon group (when monitor and mgrs are collocated) in the inventory file and run rolling_update again.

Comment 5 Ken Dreyer (Red Hat) 2018-02-23 21:52:00 UTC
Based on the workaround, retargeting to z2

Comment 12 Vasishta 2018-03-01 18:10:41 UTC
1) Used - ceph-ansible-3.0.27-1.el7cp.noarch

2) Ensured that active mgr is not the first one to be listed in both mon group and mgr. 

3) rolling updated worked fine upgrading cluster from 3.0 live to ceph-3.0-rhel-7-docker-candidate-99411-20180228192608 .


Moving to VERIFIED state.

Regards,
Vasishta Shastry
AQE, Ceph

Comment 16 errata-xmlrpc 2018-03-08 15:54:03 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:0474


Note You need to log in before you can comment on or make changes to this bug.