Bug 2278832

Summary: check for an nfs-container fails during ceph upgrade
Product: Red Hat OpenStack Reporter: Kenny Tordeurs <ktordeur>
Component: openstack-tripleo-heat-templatesAssignee: Manoj Katari <mkatari>
Status: CLOSED ERRATA QA Contact: Joe H. Rahme <jhakimra>
Severity: medium Docs Contact:
Priority: medium    
Version: 17.1 (Wallaby)CC: astupnik, fpantano, gfidente, gouthamr, jelle.hoylaerts.ext, jelynch, johfulto, jslagle, kgilliga, madgupta, mariel, mburns, mkatari
Target Milestone: z4Keywords: Triaged
Target Release: 17.1   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: openstack-tripleo-heat-templates-14.3.1-17.1.20240826190755.e7c7ce3.el9ost openstack-tripleo-validations-14.3.2-17.1.20240828162215.2b526f8.el9ost Doc Type: Enhancement
Doc Text:
This enhancement introduces a validation to verify that the appropriate CephFS-NFS resources are enabled when preparing the overcloud upgrade from RHOSP 16.2. This validation only runs when the use of the Shared File Systems service (manila) with CephFS-NFS is detected in the environment. This validation prevents the CephFS-NFS service from being inadvertently omitted from the upgrade.
Story Points: ---
Clone Of: Environment:
Last Closed: 2024-11-21 09:40:16 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
tripleo_hf.tar none

Description Kenny Tordeurs 2024-05-03 09:25:26 UTC
Description of problem:
The ansible run fails at the following task:

2024-04-22 15:59:04,667 p=376556 u=root n=ansible | TASK [ceph-container-common : inspect ceph nfs container] **********************

2024-04-22 15:59:04,668 p=376556 u=root n=ansible | Monday 22 April 2024  15:59:04 +0200 (0:00:00.122)       0:21:36.507 **********

2024-04-22 15:59:05,455 p=376556 u=root n=ansible | fatal: [openstack]: FAILED! =>

{"changed": false, "cmd": ["podman", "inspect", "220b9108465a"], "delta": "0:00:00.288657", "end": "2024-04-22 15:59:05.411620", "msg": "non-zero retu rn code", "rc": 125, "start": "2024-04-22 15:59:05.122963", "stderr": "Error: error inspecting object: no such object: \"220b9108465a\"", "stderr_lines": ["Error: error inspecting object: no such object: \"220b9108465a\""], "stdout": "[] ", "stdout_lines": ["[]"]}

Version-Release number of selected component (if applicable):
OSP 16.2

How reproducible:
uncertain

Comment 2 Kenny Tordeurs 2024-05-03 09:25:51 UTC
To workaround the issue you can run but this is not ideal for production as it causes downtime:
~~~
pcs resource disable ceph-nfs
~~~

Comment 12 John Fulton 2024-05-08 14:05:08 UTC
*** Bug 2278873 has been marked as a duplicate of this bug. ***

Comment 13 Manoj Katari 2024-05-14 06:22:37 UTC
@fpantano Thanks for the clarification, i will use this bug to add the validation.

Comment 21 Manoj Katari 2024-10-16 12:34:58 UTC
Hi Katie,

Updated doc text looks good to me.

Comment 22 Alex Stupnikov 2024-11-11 13:15:09 UTC
Created attachment 2057016 [details]
tripleo_hf.tar

Comment 27 errata-xmlrpc 2024-11-21 09:40:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (RHOSP 17.1.4 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:9974