Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 2249690

Summary: [FFU][DCN] ceph-ansible package is removed at the end of ceph adopt" stage during ceph upgrade
Product: Red Hat OpenStack Reporter: Marian Krcmarik <mkrcmari>
Component: openstack-tripleo-heat-templatesAssignee: Manoj Katari <mkatari>
Status: CLOSED ERRATA QA Contact: Marian Krcmarik <mkrcmari>
Severity: medium Docs Contact:
Priority: medium    
Version: 17.1 (Wallaby)CC: dhughes, erpeters, fpantano, gbrinn, gcharot, gregraka, jamsmith, joflynn, johfulto, mariel, mburns, mkatari
Target Milestone: z3Keywords: Triaged
Target Release: 17.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-tripleo-heat-templates-14.3.1-17.1.20231103010833.el9ost Doc Type: Bug Fix
Doc Text:
Before this update, after every successful Red Hat Ceph Storage adoption during upgrade, the `ceph-ansible` package was removed by default. + This update introduces the tag, `cleanup_cephansible`, to the task that removes `ceph-ansible`. You can use this tag with `--skip-tags` while running the adoption playbook to avoid removal.
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-05-22 20:42:25 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1997638    

Description Marian Krcmarik 2023-11-14 21:11:04 UTC
Description of problem:
The ceph-ansible rpm is always removed from the undercloud during the ceph adopt stage of FFU ceph upgrade procedure when executing following command:
openstack overcloud external-upgrade run ${EXTERNAL_ANSWER} \
    --stack ${STACK} \
    --skip-tags "ceph_health,opendev-validation,ceph_ansible_remote_tmp" \
    --tags cephadm_adopt  2>&1

DCN with storage consists of multiple ceph clusters which are upgrade one by one by running the upgrade procedure for each of the ceph cluster so there should be a way how to disable the removal of the ceph-ansible rpm always when ceph adopt is executed.

The workaround is to reinstall the ceph-ansible but one would not realize and ceph upgrade would fail at first.

The relevant THT code:
https://github.com/openstack/tripleo-heat-templates/blob/stable/wallaby/deployment/ceph-ansible/ceph-base.yaml#L733

Version-Release number of selected component (if applicable):
openstack-tripleo-heat-templates-14.3.1-17.1.20231103003743.e7c7ce3.el8ost.noarch

How reproducible:
Always

Steps to Reproduce:
1. Perform FFU of DCN env with storage with mulitple ceph clusters.
2. Execute ceph adopt as the last step.

Actual results:
The upgrade fails since relevant playbooks are not found.

Comment 1 John Fulton 2023-11-14 22:14:10 UTC
Please add a tag to the task which removes the ceph-ansible pacakge and then we'll update the DCN upgrade docs to pass that tag so that it's not removed.

Comment 6 Manoj Katari 2023-12-14 10:42:44 UTC
Hi Gareth, updated doc text and known issue looks good to me.

Comment 12 Manoj Katari 2024-05-02 07:39:44 UTC
Thanks Erin,

Updated Doc text looks good to me.

Comment 19 errata-xmlrpc 2024-05-22 20:42:25 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Moderate: openstack-tripleo-heat-templates and tripleo-ansible update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2024:2736