Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 1584269

Summary: [RFE] Utilize active call monitoring feature from oslo.messaging
Product: Red Hat OpenStack Reporter: Dan Smith <dasmith>
Component: openstack-novaAssignee: Dan Smith <dasmith>
Status: CLOSED ERRATA QA Contact: OSP DFG:Compute <osp-dfg-compute>
Severity: medium Docs Contact:
Priority: medium    
Version: 14.0 (Rocky)CC: berrange, dasmith, eglynn, jhakimra, kchamart, lyarwood, sbauza, sgordon, srevivo, vromanso
Target Milestone: Upstream M2Keywords: FutureFeature, Triaged
Target Release: 14.0 (Rocky)   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openstack-nova-18.0.0-0.20180710150340.8469fa7 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2019-01-11 11:49:59 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Dan Smith 2018-05-30 15:22:35 UTC
The oslo.messaging package now supports active call monitoring which allows tolerating much longer timeout values for RPC calls without sacrificing down-service detection time. Nova should use this feature, especially for known to be long-running activities, such as live migration.

Comment 2 Dan Smith 2018-06-01 14:03:49 UTC
*** Bug 1584268 has been marked as a duplicate of this bug. ***

Comment 3 Dan Smith 2018-06-18 15:04:32 UTC
This is merged upstream in rocky.

I'm not really sure how best to test this in an automated way, because it requires an environment where things are happening abnormally slowly, which will be hard to synthesize. We have tested this with a one-off patch that introduces an artificial delay during setup, which would have failed before the introduction of this feature. Perhaps a one-time manual verification of this is the best we can do at the moment.

The upstream hack to make the pre-migration setup call take longer is here:

https://review.openstack.org/#/c/574482/2/nova/compute/manager.py

Likely a longer delay would be useful for manual testing. Also, overriding the oslo.messaging log level during the run will generate "received heartbeat, reset timeout" messages which can be used to verify that the heartbeat functionality is working.

Comment 9 errata-xmlrpc 2019-01-11 11:49:59 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHEA-2019:0045