Bug 1373919

Summary: [ceph-ansible] : rolling update will fail if cluster name is other than 'ceph'
Product: [Red Hat Storage] Red Hat Storage Console Reporter: Rachana Patel <racpatel>
Component: ceph-ansibleAssignee: Sébastien Han <shan>
Status: CLOSED ERRATA QA Contact: Rachana Patel <racpatel>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 2CC: adeza, aschoen, ceph-eng-bugs, gmeno, kdreyer, nthomas, racpatel, rghatvis, sankarshan, seb, vsarmila
Target Milestone: ---   
Target Release: 2   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: ceph-ansible-1.0.5-34.el7scon Doc Type: Bug Fix
Doc Text:
Rolling upgrade fails when a custom cluster name other than "ceph" is used as it causes the ceph-ansible play to abort. To overcome this behavior, include the flags to indicate the cluster name, defaulting to 'ceph' when unspecified. As a result, the Ansible playbook succeeds with custom cluster names
Story Points: ---
Clone Of: Environment:
Last Closed: 2016-10-19 15:22:20 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1357777    

Description Rachana Patel 2016-09-07 12:28:55 UTC
Description of problem:
======================
rolling_update.yml will fail at multiple step if cluster name is other than ceph.

e.g.
TASK: [set osd flags] ********************************************************* 
failed: [magna100 -> magna095] => (item=noout) => {"changed": true, "cmd": ["ceph", "osd", "set", "noout"], "delta": "0:00:00.067159", "end": "2016-09-01 21:15:46.597727", "item": "noout", "rc": 1, "start": "2016-09-01 21:15:46.530568", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)
failed: [magna100 -> magna095] => (item=noscrub) => {"changed": true, "cmd": ["ceph", "osd", "set", "noscrub"], "delta": "0:00:00.067466", "end": "2016-09-01 21:15:46.833746", "item": "noscrub", "rc": 1, "start": "2016-09-01 21:15:46.766280", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)
failed: [magna100 -> magna095] => (item=nodeep-scrub) => {"changed": true, "cmd": ["ceph", "osd", "set", "nodeep-scrub"], "delta": "0:00:00.067018", "end": "2016-09-01 21:15:47.069549", "item": "nodeep-scrub", "rc": 1, "start": "2016-09-01 21:15:47.002531", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)

FATAL: all hosts have already failed -- aborting
 	




Version-Release number of selected component (if applicable):
============================================================
update from 10.2.2-38.el7cp.x86_64 to 10.2.2-39.el7cp.x86_64


How reproducible:
=================
always



Steps to Reproduce:
===================
1. Create a cluster via ceph-ansible having 3 MON, 3 OSD and 1 RGW node (10.2.2-38.el7cp.x86_64). make sure cluster name is other than 'ceph'


2. create repo fie on all nodes which points to 10.2.2-39.el7cp.x86_64 bits
3. Change the value of 'serial:' to adjust the number of server to be updated.
4. use rolling_update.yml to update all nodes

Actual results:
===============
TASK: [set osd flags] ********************************************************* 
failed: [magna100 -> magna095] => (item=noout) => {"changed": true, "cmd": ["ceph", "osd", "set", "noout"], "delta": "0:00:00.067159", "end": "2016-09-01 21:15:46.597727", "item": "noout", "rc": 1, "start": "2016-09-01 21:15:46.530568", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)
failed: [magna100 -> magna095] => (item=noscrub) => {"changed": true, "cmd": ["ceph", "osd", "set", "noscrub"], "delta": "0:00:00.067466", "end": "2016-09-01 21:15:46.833746", "item": "noscrub", "rc": 1, "start": "2016-09-01 21:15:46.766280", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)
failed: [magna100 -> magna095] => (item=nodeep-scrub) => {"changed": true, "cmd": ["ceph", "osd", "set", "nodeep-scrub"], "delta": "0:00:00.067018", "end": "2016-09-01 21:15:47.069549", "item": "nodeep-scrub", "rc": 1, "start": "2016-09-01 21:15:47.002531", "warnings": []}
stderr: Error initializing cluster client: Error('error calling conf_read_file: error code 22',)

FATAL: all hosts have already failed -- aborting


Expected results:
=================
update should work with each cluster name


Additional info:

Comment 4 seb 2016-09-13 09:44:43 UTC
Fix upstream https://github.com/ceph/ceph-ansible/pull/972

Comment 9 Rachana Patel 2016-10-06 13:38:26 UTC
rolling update is working for cluster name other than ceph hence moving to verified

Comment 13 errata-xmlrpc 2016-10-19 15:22:20 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2016:2082