Bug 2145269

Summary: controller node replacement fails with internal ceph
Product: Red Hat OpenStack Reporter: John Fulton <johfulto>
Component: tripleo-ansibleAssignee: John Fulton <johfulto>
Status: CLOSED ERRATA QA Contact: David Rosenfeld <drosenfe>
Severity: medium Docs Contact:
Priority: medium    
Version: 17.0 (Wallaby)CC: jschluet
Target Milestone: ---Keywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: tripleo-ansible-3.3.1-0.20221123230736.fa5422f.el9ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2023-01-25 12:29:58 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description John Fulton 2022-11-23 16:59:51 UTC
When replacing a controller node which uses internal ceph, the stack update fails and the ansible log has the following message:

This command fails with "Error: statfs /home/ceph-admin/specs/ceph_spec.yaml: no such file or directory"

version used: tripleo-ansible-3.3.1-0.20220720020866.fa5422f.el9ost.noarch

Comment 1 John Fulton 2022-11-23 17:05:59 UTC
root cause:

tripleo-ansible-3.3.1-0.20220720020866.fa5422f.el9ost.noarch is missing this patch:

  https://review.opendev.org/c/openstack/tripleo-ansible/+/858637/4/tripleo_ansible/roles/tripleo_cephadm/tasks/post.yaml

In our testing scenario controller-3 replaces controller-0 and the spec file it's trying to mount is only is in the missing controller-0 but not controller-3. However, we don't need to mount that spec file so we can remove the directive to do so.

The above patch was done to help with upgrades in 17.1 but it's also relevant for controller replacement in 17.0. Thus, I'll cherry pick it in 17.0.

Comment 13 errata-xmlrpc 2023-01-25 12:29:58 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 17.0.1 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2023:0271