Bug 1584269 - [RFE] Utilize active call monitoring feature from oslo.messaging
Summary: [RFE] Utilize active call monitoring feature from oslo.messaging
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-nova
Version: 14.0 (Rocky)
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: Upstream M2
: 14.0 (Rocky)
Assignee: Dan Smith
QA Contact: OSP DFG:Compute
URL:
Whiteboard:
: 1584268 (view as bug list)
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-05-30 15:22 UTC by Dan Smith
Modified: 2023-03-21 18:51 UTC (History)
10 users (show)

Fixed In Version: openstack-nova-18.0.0-0.20180710150340.8469fa7
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2019-01-11 11:49:59 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 566696 0 'None' MERGED Use oslo.messaging per-call monitoring 2021-02-17 00:13:26 UTC
Red Hat Bugzilla 1536146 1 None None None 2024-06-13 20:56:05 UTC
Red Hat Product Errata RHEA-2019:0045 0 None None None 2019-01-11 11:50:16 UTC

Internal Links: 1536146

Description Dan Smith 2018-05-30 15:22:35 UTC
The oslo.messaging package now supports active call monitoring which allows tolerating much longer timeout values for RPC calls without sacrificing down-service detection time. Nova should use this feature, especially for known to be long-running activities, such as live migration.

Comment 2 Dan Smith 2018-06-01 14:03:49 UTC
*** Bug 1584268 has been marked as a duplicate of this bug. ***

Comment 3 Dan Smith 2018-06-18 15:04:32 UTC
This is merged upstream in rocky.

I'm not really sure how best to test this in an automated way, because it requires an environment where things are happening abnormally slowly, which will be hard to synthesize. We have tested this with a one-off patch that introduces an artificial delay during setup, which would have failed before the introduction of this feature. Perhaps a one-time manual verification of this is the best we can do at the moment.

The upstream hack to make the pre-migration setup call take longer is here:

https://review.openstack.org/#/c/574482/2/nova/compute/manager.py

Likely a longer delay would be useful for manual testing. Also, overriding the oslo.messaging log level during the run will generate "received heartbeat, reset timeout" messages which can be used to verify that the heartbeat functionality is working.

Comment 9 errata-xmlrpc 2019-01-11 11:49:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2019:0045


Note You need to log in before you can comment on or make changes to this bug.