There is validation framework that can be used to validate the stack. So let's use it. - Use the validation framework to ensure that the environment is in a good shape to back it up. - Use the validation framework once control-plane or the undercloud were restored to validate that everything is ok.
This feature groups the following existing validations under the 'backup-and-restore' validation group: tripleo-validations: * ceph-health * container-status * healthcheck-service-status * image-serve * neutron-sanity-check * nova-status * pacemaker-status * undercloud-service-status validations-common: * check-cpu * service-status * validate-selinux
Polarian Testcase: https://polarion.engineering.redhat.com/polarion/#/project/RHELOpenStackPlatform/workitem?id=RHELOSP-148687
Backup and restore validation group executes. The one failed validation is a problem with the validator and is tracked by: https://bugzilla.redhat.com/show_bug.cgi?id=2187307 validation run --group backup-and-restore -i /home/stack/config-download/overcloud/tripleo-ansible-inventory.yaml Validation config file found: /etc/validation.cfg +--------------------------------------+----------------------------+--------+--------------------------+----------------------------------------------------------------------------------------------------+-------------------+-------------+------------------------------------+ | UUID | Validations | Status | Host_Group | Status_by_Host | Unreachable_Hosts | Duration | Reasons | +--------------------------------------+----------------------------+--------+--------------------------+----------------------------------------------------------------------------------------------------+-------------------+-------------+------------------------------------+ | d76e974a-ccb3-42ff-a722-a2baa065e867 | undercloud-service-status | PASSED | undercloud | undercloud | | 0:00:00.493 | | | 17e265d0-bf0d-4e0a-8b64-cc16017a260b | neutron-sanity-check | PASSED | Controller | controller-0, controller-1, controller-2 | | 0:00:02.359 | | | 235a997d-1d1f-4601-8787-c9c3f0573668 | healthcheck-service-status | PASSED | undercloud, allovercloud | ceph-0, ceph-1, ceph-2, compute-0, compute-1, controller-0, controller-1, controller-2, undercloud | | 0:00:02.053 | | | feff035c-1e88-45da-b23b-a559fba50e5a | nova-status | PASSED | nova_api | controller-0, controller-1, controller-2 | | 0:00:03.419 | | | 52440320-0610-486d-a78b-b41e5f0abecc | ceph-health | FAILED | ceph_mon | controller-0, controller-1, controller-2 | | 0:00:02.606 | controller-0: non-zero return code | | | | | | | | | , | | | | | | | | | controller-1: non-zero return code | | | | | | | | | , | | | | | | | | | controller-2: non-zero return code | | | | | | | | | | | 11f90c48-6233-47b9-89a3-ac6ce5d36bea | check-cpu | PASSED | localhost | localhost | | 0:00:01.000 | | | 52065a8a-f92c-4f75-af85-1acf696bd88d | service-status | PASSED | all | ceph-0, ceph-1, ceph-2, compute-0, compute-1, controller-0, controller-1, controller-2, undercloud | | 0:00:09.558 | | | 6f01e6bf-7ff1-429c-b11a-62785ceec6b4 | image-serve | PASSED | undercloud | undercloud | | 0:00:01.776 | | | 2f5fec69-4dcf-4ddd-9b2f-dd2db4818f5a | pacemaker-status | PASSED | Controller | controller-0, controller-1, controller-2 | | 0:00:02.038 | | | 82464b6a-749a-429f-aa45-f28af8916892 | validate-selinux | PASSED | all | ceph-0, ceph-1, ceph-2, compute-0, compute-1, controller-0, controller-1, controller-2, undercloud | | 0:00:06.499 | | | 57faad95-7dd3-4665-9263-67fd03c101f5 | container-status | PASSED | undercloud, allovercloud | ceph-0, ceph-1, ceph-2, compute-0, compute-1, controller-0, controller-1, controller-2, undercloud | | 0:00:02.704 | | +--------------------------------------+----------------------------+--------+--------------------------+----------------------------------------------------------------------------------------------------+-------------------+-------------+------------------------------------+ One or more validations have failed.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Release of components for Red Hat OpenStack Platform 17.1 (Wallaby)), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2023:4577