1845331 – Message collection size is too large for Zaqar

Bug 1845331 - Message collection size is too large for Zaqar

Summary: Message collection size is too large for Zaqar

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	instack-undercloud
Sub Component:
Version:	13.0 (Queens)
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Adriano Petrich
QA Contact:	David Rosenfeld
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2020-06-08 22:05 UTC by Brendan Shephard
Modified:	2023-12-15 18:06 UTC (History)
CC List:	8 users (show)
Fixed In Version:	instack-undercloud-8.4.9-10.el7ost
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2020-10-28 18:23:50 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Issue Tracker	OSP-29467	0	None	None	None	2023-10-06 20:32:26 UTC
Red Hat Product Errata	RHBA-2020:4388	0	None	None	None	2020-10-28 18:24:09 UTC

Description Brendan Shephard 2020-06-08 22:05:49 UTC

Description of problem:
After applying the patch from this BZ:
https://bugzilla.redhat.com/show_bug.cgi?id=1712278
https://review.opendev.org/#/c/680688/
https://review.opendev.org/#/c/663688/
https://access.redhat.com/errata/RHBA-2019:3794

We're still hitting this issue and it required us to increase the following:
sudo crudini --set /etc/zaqar/zaqar.conf transport max_messages_post_size 2097152
sudo crudini --set /etc/zaqar/zaqar.conf oslo_messaging_kafka producer_batch_size 32768
sudo crudini --set /etc/mistral/mistral.conf engine execution_field_size_limit_kb 32768

We hit this during the update converge step for a minor update:
https://bugzilla.redhat.com/show_bug.cgi?id=1712278#c11


Version-Release number of selected component (if applicable):
openstack-tripleo-common-8.7.1-15.el7ost.noarch.rpm 

How reproducible:
Difficult to reproduce. 

Possibly Mistral reporting messages from the ceph-ansible deployment and exceeding the message size?

Actual results:
The overcloud converge fails without much info to tell us why. Until we look in the Mistral logs and can see the error message:
ActionException: ZaqarAction.queue_post failed: Error response from Zaqar. Code: 400. Title: Invalid API request. Description: Message collection size is too large. Max size 1048576.


Expected results:
Either we avoid posting large messages to Zaqar, or increase the sizes here by default to cover such scenarios where the messages coming in might be quite large.

Additional info:

Comment 18 errata-xmlrpc 2020-10-28 18:23:50 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Red Hat OpenStack Platform 13.0 director bug fix advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2020:4388

Note You need to log in before you can comment on or make changes to this bug.