Bug 2085588 - [RHOS 16.2] ovs reconnects can block the nova-compute agent
Summary: [RHOS 16.2] ovs reconnects can block the nova-compute agent
Keywords:
Status: ON_QA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-os-vif
Version: 16.2 (Train)
Hardware: x86_64
OS: Linux
medium
high
Target Milestone: z6
: 16.2 (Train on RHEL 8.4)
Assignee: smooney
QA Contact: Nobody
URL:
Whiteboard:
Depends On: 2085583
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-05-13 16:34 UTC by smooney
Modified: 2023-07-27 16:09 UTC (History)
7 users (show)

Fixed In Version: python-os-vif-1.17.0-2.20230717165051.3a08cc4.el8ost
Doc Type: If docs needed, set a value
Doc Text:
Clone Of: 2085583
Environment:
Last Closed:
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Launchpad 1929446 0 None None None 2022-05-13 16:36:00 UTC
OpenStack gerrit 841779 0 None NEW Use TCP keepalives for ovsdb connections 2023-06-19 15:27:51 UTC
OpenStack gerrit 841780 0 None NEW only register tables used by os-vif 2023-06-19 15:27:52 UTC
Red Hat Issue Tracker OSP-15230 0 None None None 2022-05-13 16:43:25 UTC

Description smooney 2022-05-13 16:34:09 UTC
+++ This bug was initially created as a clone of Bug #2085583 +++

This bug was initially created as a copy of Bug #2085517

I am copying this bug because: 



Description of problem: Unable to resize a stopped VM with latest phase3 puddle RHOS-17.0-RHEL-9-20220511.n.1, this is the first puddle this has happened for 17 and was previously passing. Also this test is failing across a couple of jobs [1,2].  Example build failure and logs here [3,4].  Failure logs attached as well.

Version-Release number of selected component (if applicable):
RHOS-17.0-RHEL-9-20220511.n.1

How reproducible:
First time seeing this for 17 with latest puddle, happened across several jobs.

Steps to Reproduce:
1. Deploy environment with RHOS-17.0-RHEL-9-20220511.n.1 and execute tempest test tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id
2.
3.

Actual results:
VM remains in SHUTOFF state, fails to migrate, and never resizes

Expected results:
VM resizes

Additional info:
[1] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-live-migration-shared-storage-config-drive-vfat-phase3/9//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[2] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[3] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/
[4] http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/

Comment 1 smooney 2023-06-07 14:10:03 UTC
note this has already merged upstream in stable wallaby so we do not need to tack this for 17.1 its already included via the upstream work.


Note You need to log in before you can comment on or make changes to this bug.