This bug has been migrated to another issue tracking site. It has been closed here and may no longer be being monitored.

If you would like to get updates for this issue, or to participate in it, you may do so at Red Hat Issue Tracker .
Bug 1812379 - [RFE] Octavia failover circuit breaker
Summary: [RFE] Octavia failover circuit breaker
Keywords:
Status: CLOSED MIGRATED
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-octavia
Version: 17.0 (Wallaby)
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ---
Assignee: Tom Weininger
QA Contact: Bruna Bonguardo
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2020-03-11 08:11 UTC by Gregory Thiemonge
Modified: 2023-09-26 17:52 UTC (History)
10 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2023-09-26 17:51:42 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack Storyboard 2005604 0 None None None 2020-03-11 16:32:22 UTC
OpenStack gerrit 656811 0 None MERGED Failover stop threshold / circuit breaker 2022-09-20 09:33:59 UTC
OpenStack gerrit 835049 0 None MERGED Add new amphora failover stopped status 2022-09-20 09:34:02 UTC
OpenStack gerrit 837878 0 None NEW Add scenario tests for failover circuit breaker 2022-12-15 12:17:09 UTC
OpenStack gerrit 843285 0 None NEW Add support for Octavia failover circuit breaker 2022-12-15 12:17:12 UTC
Red Hat Issue Tracker OSP-29003 0 None None None 2023-09-26 17:52:04 UTC
Red Hat Issue Tracker   OSP-6568 0 None None None 2023-09-26 17:51:41 UTC

Description Gregory Thiemonge 2020-03-11 08:11:56 UTC
This bug was initially created as a copy of Bug #1774459



Customer thinks that RHOSP13 needs to have the "octavia failover circuit breaker"
feature as follows:

~~~
  When a disaster (e.g. loss of amphora boot network reachability) happens,
  Octavia health manager try to execute failover for all amphorae and these failovers will fail.

  We need to prepare the disaster situations before the situation actually happens.

  As discussed in this story https://storyboard.openstack.org/#!/story/2005604,
  failover circuit breaker is required for production octavia service.
  
  RFE: https://storyboard.openstack.org/#!/story/2005604
  Patch: https://review.opendev.org/#/c/656811/
  
  Step to Reproduce:
    For example, drop all heartbeat from all amphorae in controller nodes. 

  Actual Results:
    Health manager tries to execute failover for all existing amphorae

  Expected Results:
    Health manager stops some failovers based on the number of simultaneous failovers.

  Version-Release number of selected component:
    Red Hat OpenStack Platform Version Number: OSP13 
    Release Number: 
    Related Package Version:
      openstack-octavia-*-2.1.0-1.el7ost.noarch.rpm
    Related Middleware/Application: 
    Underlying RHEL Release Number: 7.6 
    Underlying RHEL Architecture: x86_64
    Underlying RHEL Kernel Version: 3.10.0-957.1.3.el7.x86_64

~~~

Comment 1 Brian Haley 2021-01-14 19:27:32 UTC
We had a discussion upstream on this and I updated the code based on that, so I'll take ownership of this BZ.

Comment 3 Tom Weininger 2022-09-20 09:33:05 UTC
This feature has been integrated upstream for the Zed release.


Note You need to log in before you can comment on or make changes to this bug.