Bug 1042964 - [RFE][oslo]: Automatic recovery from transient db connection failures
Summary: [RFE][oslo]: Automatic recovery from transient db connection failures
Keywords:
Status: CLOSED UPSTREAM
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: RFEs
Version: unspecified
Hardware: Unspecified
OS: Unspecified
unspecified
medium
Target Milestone: Upstream M3
: 5.0 (RHEL 7)
Assignee: RHOS Maint
QA Contact:
URL: https://blueprints.launchpad.net/oslo...
Whiteboard: upstream_milestone_icehouse-3 upstrea...
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2013-12-13 16:48 UTC by RHOS Integration
Modified: 2014-09-08 05:42 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: Enhancement
Doc Text:
Clone Of:
Environment:
Last Closed: 2014-03-27 13:49:43 UTC
Target Upstream Version:


Attachments (Terms of Use)

Description RHOS Integration 2013-12-13 16:48:35 UTC
Cloned from launchpad blueprint https://blueprints.launchpad.net/oslo/+spec/db-reconnect.

Description:

There are a variety of circumstances which can cause a transient failure in database connections, for example:
- restart / upgrade of the database,
- migration of VIP between HA pair,
- just a network failure
- and so on.

All projects, connected to a database, would benefit from the db/api catching these "db-has-gone-away" errors and automatically reconnecting and retrying the last operation, in such a way that the caller is able to continue what ever operation was in process.

It is not necessary to abort long-running operations (such as nova boot or glance image-create) just because of a momentary interruption in db connectivity.

A (slightly brute-force) patch was previously proposed to Nova: https://review.openstack.org/#/c/10797/

Current bp is similar to Nova blueprint, proposed by Devananda van der Veen. See https://blueprints.launchpad.net/nova/+spec/db-reconnect

Specification URL (additional information):

None

Comment 2 Stephen Gordon 2014-02-06 14:08:45 UTC
Updating based on BP milestone


Note You need to log in before you can comment on or make changes to this bug.