Bug 1085006
Summary: | Internal Error from python-qpid can cause qpid connection to never recover | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Russell Bryant <rbryant> | |
Component: | openstack-nova | Assignee: | Russell Bryant <rbryant> | |
Status: | CLOSED ERRATA | QA Contact: | Toure Dunnon <tdunnon> | |
Severity: | high | Docs Contact: | ||
Priority: | unspecified | |||
Version: | 4.0 | CC: | ajeain, breeler, cpelland, ebarrera, ggillies, iwienand, ndipanov, sgordon, stoner, tdunnon, vpopovic, yeylon | |
Target Milestone: | async | Keywords: | OtherQA, ZStream | |
Target Release: | 4.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | openstack-nova-2013.2.3-9.el6ost | Doc Type: | Bug Fix | |
Doc Text: |
There was an internal error in the python-qpid library which Compute would fail to handle gracefully and Qpid communication would be broken.
This has been fixed so that Compute gracefully handles the failure and restarts Qpid communication.
Now, Compute services recover after an internal error in the python-qpid library.
|
Story Points: | --- | |
Clone Of: | ||||
: | 1085994 1085995 1085996 1085997 1086006 (view as bug list) | Environment: | ||
Last Closed: | 2014-08-21 00:40:06 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1040649, 1086006 |
Description
Russell Bryant
2014-04-07 13:59:55 UTC
This patch has been merged into both oslo.messaging and the rpc library in oslo-incubator. In RHOS 4.0, nothing had been converted to oslo.messaging, so this fix needs to be backported to all of the projects that include rpc from oslo-incubator. I will be cloning this bug to all affected projects. *** Bug 1098827 has been marked as a duplicate of this bug. *** *** Bug 1108959 has been marked as a duplicate of this bug. *** How do we repro this condition? Is there a way to determine which of the qpid child threads is the connection thread? If so, I could try to kill the connection thread, and see if the problem persists. Actually, as I am writing this, I see Russel posted this as an internal error in the python-qpid library. Is there a version of python-qpid with a fix for this? (In reply to Sean Toner from comment #6) > How do we repro this condition? Is there a way to determine which of the > qpid child threads is the connection thread? If so, I could try to kill the > connection thread, and see if the problem persists. Reproducing this will be very difficult. I honestly wouldn't bother. Time is better spent elsewhere. > Actually, as I am writing this, I see Russel posted this as an internal > error in the python-qpid library. Is there a version of python-qpid with a > fix for this? Yes, this was triggered when we hit this bug: https://issues.apache.org/jira/browse/QPID-5700 https://bugzilla.redhat.com/show_bug.cgi?id=1088004 The fix was in python-qpid-0.18-10.el7 Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHSA-2014-1084.html |