Bug 1984880 - rolling upgrade to rhcs 4.2 z2 failing due to wrong mon host name
Summary: rolling upgrade to rhcs 4.2 z2 failing due to wrong mon host name
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Ceph Storage
Classification: Red Hat Storage
Component: Ceph-Ansible
Version: 4.2
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: 4.2z3
Assignee: Dimitri Savineau
QA Contact: Vasishta
Aron Gunn
URL:
Whiteboard:
Depends On:
Blocks: 1890121
TreeView+ depends on / blocked
 
Reported: 2021-07-22 11:50 UTC by Avi Avraham
Modified: 2024-10-01 19:03 UTC (History)
13 users (show)

Fixed In Version: ceph-ansible-4.0.62-1.el8cp, ceph-ansible-4.0.62-1.el7cp
Doc Type: Bug Fix
Doc Text:
.Rolling upgrade fails when Ceph containers are collocated The `rolling_update.yml` Ansible playbook fails when the Ceph Monitor and Ceph Object Gateway daemons are collocated with containers, and when the multi-site Ceph Object Gateway is enabled. This failure was caused by the `radosgw-admin` commands not able to execute because of the Ceph Monitor container is stopped during the upgrade process. With this release, the multi-site Ceph Object Gateway code within the `ceph-handler` role is skipped during the upgrade process. As a result, the `rolling_update.yml` Ansible playbook runs successfully.
Clone Of:
Environment:
Last Closed: 2021-09-27 18:26:56 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Github ceph ceph-ansible pull 6582 0 None closed rolling_update: fix mon+rgw/multisite collocation (backport #6577) 2021-07-27 21:04:02 UTC
Red Hat Issue Tracker RHCEPH-620 0 None None None 2021-09-09 18:59:36 UTC
Red Hat Product Errata RHBA-2021:3670 0 None None None 2021-09-27 18:27:31 UTC

Description Avi Avraham 2021-07-22 11:50:18 UTC
Description of problem:

We are attempting an upgrade from RHCS 4.2z1 to RHCS 4.2z2 in a multi-site environment, and we are facing an issue where the playbook fails.

The issue appears to be in the container_exec_cmd variable. 
In a rolling update, the container_exec_cmd gets the value of mon_host.

However, mon_host gets the value of a difference mon than the one we are running on, so container_exec_cmd gets a wrong value. 

If my inventory is

[mons] 
Mon1 
Mon2 
Mon3

Then the ansible is delegating to Mon1, but mon_host is equal to Mon3. The playbook fails when it tries to run Ceph commands.

Version-Release number of selected component (if applicable):
RHCS 4.2z1 

How reproducible:
run rolling upgrade 

Steps to Reproduce:
1.
2.
3.

Actual results:
Upgrade aborts with an error in the task "add endpoints to their zone groups (s)."
 

Expected results:
rolling update end successfully 

Additional info:

Comment 22 errata-xmlrpc 2021-09-27 18:26:56 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat Ceph Storage 4.2 Bug Fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:3670


Note You need to log in before you can comment on or make changes to this bug.