Bug 2124910

Summary: [RHOS 17.1] ovs reconnects can block the nova-compute agent
Product: Red Hat OpenStack Reporter: smooney
Component: openstack-novaAssignee: smooney
Status: VERIFIED --- QA Contact: Jason Grosso <jgrosso>
Severity: high Docs Contact:
Priority: high    
Version: 17.1 (Wallaby)CC: dasmith, eglynn, jhakimra, kchamart, lsvaty, prgutier, sbauza, sgordon, vromanso
Target Milestone: gaKeywords: Patch, Regression, Triaged
Target Release: 17.1Flags: prgutier: needinfo? (smooney)
ifrangs: needinfo? (smooney)
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description smooney 2022-09-07 12:47:19 UTC
This bug was initially created as a copy of Bug #2085583

I am copying this bug because: we have branced 17.1 and we need a bz tracker for both branches.



Description of problem: Unable to resize a stopped VM with latest phase3 puddle RHOS-17.0-RHEL-9-20220511.n.1, this is the first puddle this has happened for 17 and was previously passing. Also this test is failing across a couple of jobs [1,2].  Example build failure and logs here [3,4].  Failure logs attached as well.

Version-Release number of selected component (if applicable):
RHOS-17.0-RHEL-9-20220511.n.1

How reproducible:
First time seeing this for 17 with latest puddle, happened across several jobs.

Steps to Reproduce:
1. Deploy environment with RHOS-17.0-RHEL-9-20220511.n.1 and execute tempest test tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id
2.
3.

Actual results:
VM remains in SHUTOFF state, fails to migrate, and never resizes

Expected results:
VM resizes

Additional info:
[1] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-live-migration-shared-storage-config-drive-vfat-phase3/9//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[2] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com//job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8//testReport/tempest.api.compute.servers.test_server_actions/ServerActionsTestJSON/test_resize_server_confirm_from_stopped_id_138b131d_66df_48c9_a171_64f45eb92962_/
[3] https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/job/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/
[4] http://rhos-ci-logs.lab.eng.tlv2.redhat.com/logs/rcj/DFG-compute-nova-17.0_director-rhel-virthost-1cont_2comp_3ceph-ipv4-geneve-smt-whitebox-numa-tests-ceph-phase3/8/