Bug 1550923
| Summary: | [RFE] Implement a rebalancing mechanism for dhcp/l3 agents to spread the load among all controllers | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | David Manchado <dmanchad> |
| Component: | openstack-neutron | Assignee: | OSP Team <rhos-maint> |
| Status: | NEW --- | QA Contact: | Toni Freger <tfreger> |
| Severity: | low | Docs Contact: | |
| Priority: | low | ||
| Version: | unspecified | CC: | bcafarel, bsawyers, chrisw, kforde, srevivo |
| Target Milestone: | Upstream M2 | Keywords: | FutureFeature |
| Target Release: | --- | ||
| Hardware: | x86_64 | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | Type: | Bug | |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
David Manchado
2018-03-02 09:53:00 UTC
I'll jot down some initial thoughts: The DHCP case isn't as urgent as the majority (84% according to UP) use A/A/A, so rebalancing is not relevant. Rebalancing L3 HA routers invokes data plane downtime. Therefor I don't think automatic rebalancing is desirable. I think in the case of downtime it's something operators would like to control. With that in mind I think we could provide a script that would use the API to rebalance routers (and DHCP if needed). If this boils down to a CLI driven script shipped in a Neutron or TripleO RPM that operators would invoke manually on demand, would that solve the issue as you see it? Assaf, Some tooling like the script you mentioned would work for me, not sure if this should have to sorted by any PM. I filed the RFE based on the internal discussion about if we felt that current behavior was the best way to manage HA setups. Or at least raise awareness that operators will tend to do staged reboots and, as result of it, consolidate most of the agents in one of the controllers. Approving for OSP 14, under the understanding that we're treating this as a low priority RFE. According to email discussion, Networking DFG will be responsible to run regression testing when, David Manchado, will perform functional testing. Pushing this out of RHOSP 14, given other priorities. |