Description of problem: This was found in bz 2136393 initially. Updating overcloud by running the deployment command fails because of the following workflow error if octavia or ceph is enabled. ~~~ Waiting for messages on queue 'tripleo' with no timeout. The action raised an exception [action_ex_id=xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx, msg='[Errno 13] Permission denied: 'local_dir'', action_cls='<class 'mistral.actions.action_factory.DownloadConfigAction'>', attributes='{}', params='{'work_dir': '/var/lib/mistral/overcloud', 'container_config': 'overcloud-config'}'] ~~~ This is the regression caused by the fix for bz 2125078 . The change introduced the step to purge files in /var/lib/mistral/<stack name> to enforce the proper permission but the deployment tasks for octavia/ceph create files/directories owned by tripleo-admin in the directory and the cleanup process fails with permission error. Version-Release number of selected component (if applicable): How reproducible: Always Steps to Reproduce: 1. Deploy overcloud with Octavia enabled 2. Run the same deployment command Actual results: The 2nd deployment fails because of the workflow error Expected results: The 2nd deployment should not fail. Additional info:
*** Bug 2139138 has been marked as a duplicate of this bug. ***
Stack update is successful in a ceph deployment using: RHOS-16.2-RHEL-8-20221111.n.1
We deployed this fix [1] for Octavia in our lab with 16.2.3 and we were able to get a successful deployment. This was a fresh deployment and not an upgrade. However, running the deployment again (i.e. running the exact same overcloud deploy command after the successful deployment) resulted in Permission Denied errors on the ceph-ansible directory immediately after the stack deploy and before the ansible deploy: The action raised an exception [action_ex_id=0f0dc7ed-5415-4fca-a9bf-5828612ba391, msg='[Errno 13] Permission denied: '/var/lib/mistral/overcloud/ceph-ansible'', action_cls='<class 'mistral.actions.action_factory.DownloadConfigAction'>', attributes='{}', params='{'work_dir': '/var/lib/mistral/overcloud', 'container_config': 'overcloud-config'}']Overcloud Endpoint: https://10.74.169.211:13000 I assume this would be a problem during an upgrade, too. [1] https://review.opendev.org/c/openstack/tripleo-heat-templates/+/861945/1/deployment/octavia/octavia-deployment-config.j2.yaml#310
@Matthew That error is definitely what we are fixing in this bug. The package including the fix is not yet shipped in CDN. Note that the octavia error you earlier mentioned is also fixed in bug 2136393 , which is mentioned in the problem description of this bug.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Release of components for Red Hat OpenStack Platform 16.2.4), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2022:8794
*** Bug 2149283 has been marked as a duplicate of this bug. ***