1311005 – rabbitmq-cluster resource instance may not be able to rejoin the rabbitmq cluster.

Bug 1311005 - rabbitmq-cluster resource instance may not be able to rejoin the rabbitmq cluster.

Summary: rabbitmq-cluster resource instance may not be able to rejoin the rabbitmq clu...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	openstack-tripleo-heat-templates
Sub Component:
Version:	8.0 (Liberty)
Hardware:	All
OS:	All
Priority:	urgent
Severity:	urgent
Target Milestone:	---
Target Release:	8.0 (Liberty)
Assignee:	Jiri Stransky
QA Contact:	Leonid Natapov
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1311025
TreeView+	depends on / blocked

Reported:	2016-02-23 07:41 UTC by Michele Baldessari
Modified:	2016-04-07 21:48 UTC (History)
CC List:	3 users (show)
Fixed In Version:	openstack-tripleo-heat-templates-0.8.10-1.el7ost
Doc Type:	Bug Fix
Doc Text:	The notify=true parameter was previously missing from the RabbitMQ Pacemaker resource. Consequently, RabbitMQ instances were unable to rejoin the RabbitMQ cluster. This update adds support for notify=true to the pacemaker resource agent for RabbitMQ, and adds notify=true to OpenStack director. As a result, RabbitMQ instances are now able to rejoin the RabbitMQ cluster.
Clone Of:
Clones:	1311025 (view as bug list)
Environment:
Last Closed:	2016-04-07 21:48:01 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
OpenStack gerrit	283461	0	None	None	None	2016-02-23 09:24:05 UTC
Red Hat Product Errata	RHEA-2016:0604	0	normal	SHIPPED_LIVE	Red Hat OpenStack Platform 8 director Enhancement Advisory	2016-04-08 01:03:56 UTC

Description Michele Baldessari 2016-02-23 07:41:59 UTC

Description of problem:
Sometimes when a controller is rebooted the node is unable to join the rabbitmq
cluster. This issue needs the following to happen to be fixed:

1) BZ https://bugzilla.redhat.com/show_bug.cgi?id=1247303 needs a fix in the resource agent (see this BZ for more history about this issue)

2) We need to add the meta parameter "notify=true" on fresh installs and on updates.


Note, that if the fix to THT makes it in before the fix of the resource agent,
nothing happens (i.e. things keep going like before)

Comment 5 Leonid Natapov 2016-03-16 12:24:27 UTC

openstack-tripleo-heat-templates-0.8.10-2.el7ost

[root@overcloud-controller-0 heat-admin]# pcs resource show rabbitmq
 Resource: rabbitmq (class=ocf provider=heartbeat type=rabbitmq-cluster)
  Attributes: set_policy="ha-all ^(?!amq\.).* {"ha-mode":"all"}" 
  Meta Attrs: notify=true 
  Operations: start interval=0s timeout=100 (rabbitmq-start-interval-0s)
              stop interval=0s timeout=90 (rabbitmq-stop-interval-0s)
              monitor interval=10 timeout=40 (rabbitmq-monitor-interval-10)
[root@overcloud-controller-0 heat-admin]#

Comment 7 errata-xmlrpc 2016-04-07 21:48:01 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHEA-2016-0604.html

Note You need to log in before you can comment on or make changes to this bug.