Description of problem: All existing amphora instances are unexpectedly deleted when RabbitMQ is accidentally down. Unfortunately, delete requests will fail because RabbitMQ is down, and then all amphora instances cannot be failed over. Recovery of amphorae cannot is not possible without recreating all load balancers. This is a critical bug and we have to avoid this critical situation even when RabbitMQ is not operational. How reproducible: 100% Steps to Reproduce: 1. configure octavia to use a queue_event_streamer 2. create load balancers 3. shutdown RabbitMQ 4. wait a few seconds/minutes Actual results: all amphorae get deleted Expected results: all amphorae would continue running Additional info: this bug was first raised upstream: https://storyboard.openstack.org/#!/story/2002937
Patch merged upstream in master. Backport to stable/queens under review now.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2019:0567