Description of problem: HAProxy should use clustercheck for galera nodes health checks in order to get valid status of the db servers. Version-Release number of selected component (if applicable): openstack-puppet-modules-2015.1.7-2.el7ost.noarch Additional info: https://review.openstack.org/#/c/194960/2
On HA environment: galera_start failed after rebooting of controller_0 : pcs status : ------------- Failed actions: openstack-cinder-volume_start_0 on overcloud-controller-2 'not running' (7): call=314, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:47 2015', queued=2001ms, exec=4ms galera_start_0 on overcloud-controller-0 'unknown error' (1): call=216, status=Timed Out, exit-reason='none', last-rc-change='Wed Jun 24 15:47:30 2015', queued=0ms, exec=120003ms redis_start_0 on overcloud-controller-0 'unknown error' (1): call=219, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:49:35 2015', queued=0ms, exec=21910ms openstack-nova-scheduler_start_0 on overcloud-controller-0 'not running' (7): call=236, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:30 2015', queued=2001ms, exec=2ms openstack-nova-consoleauth_start_0 on overcloud-controller-0 'not running' (7): call=238, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:34 2015', queued=2001ms, exec=5ms openstack-cinder-api_start_0 on overcloud-controller-0 'not running' (7): call=242, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:39 2015', queued=2002ms, exec=5ms neutron-server_start_0 on overcloud-controller-0 'not running' (7): call=246, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:46 2015', queued=2001ms, exec=3ms openstack-cinder-volume_start_0 on overcloud-controller-1 'not running' (7): call=325, status=complete, exit-reason='none', last-rc-change='Wed Jun 24 15:50:41 2015', queued=2001ms, exec=2ms PCSD Status: overcloud-controller-0: Online overcloud-controller-1: Online overcloud-controller-2: Online
The puppet-tripleo change should be included in openstack-puppet-modules-2015.1.7-5.el7ost
Verified: openstack-tripleo-heat-templates-0.8.6-19.el7ost.noarch from sudo vi /etc/haproxy/haproxy.cfg listen cinder bind 192.168.0.6:8776 option httpchk GET / server overcloud-controller-0 192.168.0.11:8776 check fall 5 inter 2000 rise 2 server overcloud-controller-1 192.168.0.12:8776 check fall 5 inter 2000 rise 2 server overcloud-controller-2 192.168.0.10:8776 check fall 5 inter 2000 rise 2 [heat-admin@overcloud-controller-1 ~]$ sudo grep httpchk /etc/haproxy/haproxy.cfg option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET / option httpchk GET /info
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2015:1549