Bug 2249690 - [FFU][DCN] ceph-ansible package is removed at the end of ceph adopt" stage during ceph upgrade
Summary: [FFU][DCN] ceph-ansible package is removed at the end of ceph adopt" stage du...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-tripleo-heat-templates
Version: 17.1 (Wallaby)
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: z3
: 17.1
Assignee: Manoj Katari
QA Contact: Marian Krcmarik
URL:
Whiteboard:
Depends On:
Blocks: 1997638
TreeView+ depends on / blocked
 
Reported: 2023-11-14 21:11 UTC by Marian Krcmarik
Modified: 2024-05-22 20:42 UTC (History)
12 users (show)

Fixed In Version: openstack-tripleo-heat-templates-14.3.1-17.1.20231103010833.el9ost
Doc Type: Bug Fix
Doc Text:
Before this update, after every successful Red Hat Ceph Storage adoption during upgrade, the `ceph-ansible` package was removed by default. + This update introduces the tag, `cleanup_cephansible`, to the task that removes `ceph-ansible`. You can use this tag with `--skip-tags` while running the adoption playbook to avoid removal.
Clone Of:
Environment:
Last Closed: 2024-05-22 20:42:25 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 900653 0 None MERGED Add tag to skip ceph-ansible removal 2023-11-17 00:44:39 UTC
Red Hat Issue Tracker OSP-30440 0 None None None 2023-11-14 21:14:05 UTC
Red Hat Issue Tracker RHOSPDOC-1554 0 None None None 2024-04-09 09:18:59 UTC
Red Hat Product Errata RHSA-2024:2736 0 None None None 2024-05-22 20:42:31 UTC

Description Marian Krcmarik 2023-11-14 21:11:04 UTC
Description of problem:
The ceph-ansible rpm is always removed from the undercloud during the ceph adopt stage of FFU ceph upgrade procedure when executing following command:
openstack overcloud external-upgrade run ${EXTERNAL_ANSWER} \
    --stack ${STACK} \
    --skip-tags "ceph_health,opendev-validation,ceph_ansible_remote_tmp" \
    --tags cephadm_adopt  2>&1

DCN with storage consists of multiple ceph clusters which are upgrade one by one by running the upgrade procedure for each of the ceph cluster so there should be a way how to disable the removal of the ceph-ansible rpm always when ceph adopt is executed.

The workaround is to reinstall the ceph-ansible but one would not realize and ceph upgrade would fail at first.

The relevant THT code:
https://github.com/openstack/tripleo-heat-templates/blob/stable/wallaby/deployment/ceph-ansible/ceph-base.yaml#L733

Version-Release number of selected component (if applicable):
openstack-tripleo-heat-templates-14.3.1-17.1.20231103003743.e7c7ce3.el8ost.noarch

How reproducible:
Always

Steps to Reproduce:
1. Perform FFU of DCN env with storage with mulitple ceph clusters.
2. Execute ceph adopt as the last step.

Actual results:
The upgrade fails since relevant playbooks are not found.

Comment 1 John Fulton 2023-11-14 22:14:10 UTC
Please add a tag to the task which removes the ceph-ansible pacakge and then we'll update the DCN upgrade docs to pass that tag so that it's not removed.

Comment 6 Manoj Katari 2023-12-14 10:42:44 UTC
Hi Gareth, updated doc text and known issue looks good to me.

Comment 12 Manoj Katari 2024-05-02 07:39:44 UTC
Thanks Erin,

Updated Doc text looks good to me.

Comment 19 errata-xmlrpc 2024-05-22 20:42:25 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: openstack-tripleo-heat-templates and tripleo-ansible update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:2736


Note You need to log in before you can comment on or make changes to this bug.