Bug 2257738

Summary: Upgrade from 16.2 to 17.1 fails on Ceph nodes due to missing repo
Product: Red Hat OpenStack Reporter: nalmond
Component: distributionAssignee: OSP Team <rhos-maint>
Status: CLOSED ERRATA QA Contact: Lon Hohberger <lhh>
Severity: high Docs Contact:
Priority: urgent    
Version: 17.1 (Wallaby)CC: alfrgarc, dhill, dhughes, eharney, fpantano, jelle.hoylaerts.ext, jjoyce, johfulto, jschluet, jsoliman, jveiraca, ktordeur, lbezdick, madgupta, mariel, mburns, mflusche, mgarciac, morazi, rhos-maint, shrjoshi, tpetr
Target Milestone: z3Keywords: Triaged
Target Release: 17.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-05-22 21:04:11 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description nalmond 2024-01-10 16:51:50 UTC
Description of problem:
Following https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/17.1/html-single/framework_for_upgrades_16.2_to_17.1/index#upgrading-to-ceph-storage-5-upgrading-ceph, section 5.2 step 5 fails on all non-hci ceph nodes trying to ensure the python3-heat-agent* rpms are installed:

2023-12-21 23:47:43,995 p=140032 u=stack n=ansible | 2023-12-21 23:47:43.995372 | d08e79c1-ae3a-8215-0682-0000000000e6 |      FATAL | Ensure TripleO prerequisite packages are installed | os-ceph-srv-01 | error={"changed": false, "failures": ["python3-heat-agent* All matches were filtered out by exclude filtering for argument: python3-heat-agent*"], "msg": "Failed to install some of the specified packages", "rc": 1, "results": []}

These packages are provided by openstack-17.1-for-rhel-8-x86_64-rpms, but these ceph nodes were deployed with the minimal overcloud images and do not have access to this repo.

How reproducible:
Consistently on director-deployed Ceph nodes.

Steps to Reproduce:
1. Deploy RHOSP 16.2 with ceph using minimal overcloud images
2. Attempt to upgrade to RHOSP 17.1
3.

Actual results:
Ceph playbook fails

Expected results:
Ceph playbook completes and upgrade is able to continue

Additional info:
The error does not show this but I suspect these packages are also impacted:

          - openstack-selinux
          - os-net-config
          - puppet-tripleo
          - python3-heat-agent*

Comment 6 Lukas Bezdicka 2024-01-19 16:08:38 UTC
I think docs are clear on this - first you upgrade to RHCEPH5 https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/17.1/html-single/framework_for_upgrades_16.2_to_17.1/index#upgrading-an-overcloud-with-director-deployed-ceph-deployments_preparing-overcloud
After the whole upgrade there is separate procedure to get to RHCEPH6 https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/17.1/html-single/framework_for_upgrades_16.2_to_17.1/index#assembly_ceph-5-to-6_upgrade_post-upgrade 

Only issue is openstack tools repo was not available on EL8 which is worked around by manually providing the packages to the nodes. Having RHCEP6 on EL8 is not supported.

Comment 13 John Fulton 2024-01-23 13:04:14 UTC
The documentation has been updated so that chapter 4.1 [1] step 8iii builds a containers-prepare-parameter.yaml with the correct ceph_image

[1] https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/17.1/html/framework_for_upgrades_16.2_to_17.1/performing-the-overcloud-adoption_overcloud-adoption#running-the-overcloud-upgrade-preparation_overcloud-adoption

Comment 49 errata-xmlrpc 2024-05-22 21:04:11 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 17.1.3 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:2742

Comment 50 Red Hat Bugzilla 2024-09-20 04:25:18 UTC
The needinfo request[s] on this closed bug have been removed as they have been unresolved for 120 days