Description of problem: Reported by Florella fyanac. Every time we try to restart galera resource with "pcs resource restart ..." it fails and we see this in the logs: http://pastebin.test.redhat.com/1069412 It happens on one galera node. Version-Release number of selected component (if applicable): How reproducible: Steps to Reproduce: 1. 2. 3. Actual results: galera is failed on one node: * galera-bundle-2 (ocf:heartbeat:galera): FAILED Promoted controller-0 Expected results: Galera cluster is healthy - all galera resources are in 'Promoted' state. Additional info:
This has been tracked down to a corruption of galera gcache file, preventing one of the nodes from starting. We will try to figure out a way to deal with this condition, maybe in the pacemaker resource agent.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenStack Platform 17.0.1 bug fix and enhancement advisory), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2023:0271