Bug 2119584 - openstack stack update fails on host with HW-Offload VM during os-net-config
Summary: openstack stack update fails on host with HW-Offload VM during os-net-config
Keywords:
Status: POST
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: os-net-config
Version: 16.2 (Train)
Hardware: Unspecified
OS: Linux
medium
medium
Target Milestone: zstream
: 16.2 (Train on RHEL 8.4)
Assignee: Karthik Sundaravel
QA Contact: Nobody
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-08-18 20:29 UTC by Florin Boboc
Modified: 2023-08-16 06:29 UTC (History)
10 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 866312 0 None NEW Avoid reconfiguring SR-IOV devices during stack update 2022-12-01 10:11:11 UTC
OpenStack gerrit 867292 0 None MERGED Avoid reconfiguring SR-IOV devices during stack update 2023-08-16 06:28:28 UTC
Red Hat Issue Tracker NFV-2609 0 None None None 2022-08-18 21:25:14 UTC
Red Hat Issue Tracker OSP-18271 0 None None None 2022-08-18 20:32:32 UTC

Description Florin Boboc 2022-08-18 20:29:40 UTC
Description of problem:
 The client is running a stack update on a OSP 16.2.3 VIM having in templates: "NetworkDeploymentActions: ['CREATE','UPDATE']"

 In task: TASK [tripleo-network-config : Disable cloud-init network config]

 ERROR: os-net-config configuration it's falling as it's trying to use a VF that it's used by a instance / vm

Version-Release number of selected component (if applicable): 
  Distro:   [redhat-release] Red Hat Enterprise Linux release 8.4 (Ootpa)
            [rhosp-release] Red Hat OpenStack Platform release 16.2.3 (Train)
            [os-release] Red Hat Enterprise Linux 8.4 (Ootpa) 8.4 (Ootpa)

Actual results: 
        ...
        "[2022/08/18 12:42:08 PM] [INFO] Unbinding 0000:d8:04.1",
        "OSError: [Errno 19] No such device",
        "",
        "During handling of the above exception, another exception occurred:",
        "",
        "Traceback (most recent call last):",
        "  File \"/bin/os-net-config\", line 10, in <module>",
        "    sys.exit(main())",
        "  File \"/usr/lib/python3.6/site-packages/os_net_config/cli.py\", line 331, in main",
        "    restart_openvswitch=restart_ovs)",
        "  File \"/usr/lib/python3.6/site-packages/os_net_config/utils.py\", line 618, in configure_sriov_pfs",
        "    restart_openvswitch=restart_openvswitch)",
        "  File \"/usr/lib/python3.6/site-packages/os_net_config/sriov_config.py\", line 302, in configure_sriov_pf",
        "    f.write(vf_pci)",
        "OSError: [Errno 19] No such device",
        "+ RETVAL=1",
        "+ set -e",
        "+ [[ 1 == 2 ]]",
        "+ [[ 1 != 0 ]]",
        "+ echo 'ERROR: os-net-config configuration failed.'",
        "ERROR: os-net-config configuration failed.",
        ...

Expected results: Client expects not to use the VF of instance and stack update to be successful

Additional info:
[Logs]
  - sosreport from undercloud: 0020-sosreport-tpa-vim-puc-03293078-2022-08-18-iirfjux.tar.xz
  - ansible log attached to the case: 0030-overcloud_deploy-22-08-18-08.log
  - Templates are attached to the case: 0040-hqp2.5templates.tgz
[Workaround]
  - Client changed in his template (home/stack/local-environments/overcloud-environment.yaml)  and stack update worked as network creation was performed.
     FROM: NetworkDeploymentActions: ['CREATE','UPDATE']
       TO: NetworkDeploymentActions: ['CREATE']

"NetworkDeploymentActions: ['CREATE']"


Note You need to log in before you can comment on or make changes to this bug.