Bug 2156886 - OSPdO 17.0 After a controller reset , Galera is an unhealthy state [NEEDINFO]
Summary: OSPdO 17.0 After a controller reset , Galera is an unhealthy state
Keywords:
Status: NEW
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: osp-director-operator-container
Version: 17.0 (Wallaby)
Hardware: x86_64
OS: Linux
medium
medium
Target Milestone: ---
: ---
Assignee: Damien Ciabrini
QA Contact: pkomarov
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-12-29 08:36 UTC by pkomarov
Modified: 2023-08-03 15:46 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Target Upstream Version:
Embargoed:
pkomarov: needinfo? (dciabrin)
ifrangs: needinfo? (dciabrin)


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Issue Tracker OSP-21059 0 None None None 2022-12-29 08:39:26 UTC

Description pkomarov 2022-12-29 08:36:17 UTC
Description of problem:
after a fresh OSPdO 17.0 deployment Galera is an unhealthy state: 

  * Container bundle set: galera-bundle [cluster.common.tag/mariadb:pcmklatest]:
    * galera-bundle-0	(ocf:heartbeat:galera):	 Unpromoted controller-1
    * galera-bundle-1	(ocf:heartbeat:galera):	 FAILED Promoted controller-2 (blocked)
    * galera-bundle-2	(ocf:heartbeat:galera):	 Unpromoted controller-0
 

Version-Release number of selected component (if applicable):


 DIRECTOR_OPERATOR_CSV_VERSION=17.0.35-17.0

osp_release_auto_version: 17.0-RHEL-9

osp_release_defaults:
  base_image_url: http://download.devel.redhat.com/brewroot/packages/rhel-guest-image/9.0/20221216.0/images/rhel-guest-image-9.0-20221216.0.x86_64.qcow2

Additional info:
sosreports,all overcloud_nodes /var/log, are at :  http://file.tlv.redhat.com/~pkomarov/sos_reports_2126730

Comment 2 pkomarov 2022-12-29 13:51:28 UTC
I did a pcs cluster restart on all controllers , and reran the HA controller reboot tests 
Now it did not reproduce : 
    * galera-bundle-0   (ocf:heartbeat:galera):  Promoted controller-1
    * galera-bundle-1   (ocf:heartbeat:galera):  Promoted controller-2
    * galera-bundle-2   (ocf:heartbeat:galera):  Promoted controller-0
Not sure what is the root cause though..


Note You need to log in before you can comment on or make changes to this bug.