Bug 2085588

Summary: [RHOS 16.2] ovs reconnects can block the nova-compute agent
Product: Red Hat OpenStack Reporter: smooney
Component: python-os-vifAssignee: smooney
Status: ON_QA --- QA Contact: Nobody <nobody>
Severity: high Docs Contact:
Priority: medium    
Version: 16.2 (Train)CC: alifshit, jjoyce, jschluet, nlevinki, rhos-maint, slinaber, tvignaud
Target Milestone: z6Keywords: Patch, Triaged
Target Release: 16.2 (Train on RHEL 8.4)   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: python-os-vif-1.17.0-2.20230717165051.3a08cc4.el8ost Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 2085583 Environment:
Last Closed: Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 2085583    
Bug Blocks:    

Description smooney 2022-05-13 16:34:09 UTC
+++ This bug was initially created as a clone of Bug #2085583 +++

This bug was initially created as a copy of Bug #2085517

I am copying this bug because: 



Description of problem: Unable to resize a stopped VM with latest phase3 puddle RHOS-17.0-RHEL-9-20220511.n.1, this is the first puddle this has happened for 17 and was previously passing. Also this test is failing across a couple of jobs [1,2].  Example build failure and logs here [3,4].  Failure logs attached as well.

Version-Release number of selected component (if applicable):
RHOS-17.0-RHEL-9-20220511.n.1

How reproducible:
First time seeing this for 17 with latest puddle, happened across several jobs.

Steps to Reproduce:
1. Deploy environment with RHOS-17.0-RHEL-9-20220511.n.1 and execute tempest test tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id
2.
3.

Actual results:
VM remains in SHUTOFF state, fails to migrate, and never resizes

Expected results:
VM resizes

Additional info:
[1] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-live-migration-shared-storage-config-drive-vfat-phase3/9//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[2] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[3] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/
[4] http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/

Comment 1 smooney 2023-06-07 14:10:03 UTC
note this has already merged upstream in stable wallaby so we do not need to tack this for 17.1 its already included via the upstream work.