Bug 2107306 - [OSP17.1] _numa_cells_support_network_metadata does not log any output even at debug
Summary: [OSP17.1] _numa_cells_support_network_metadata does not log any output even a...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-nova
Version: 17.1 (Wallaby)
Hardware: Unspecified
OS: Unspecified
low
low
Target Milestone: z4
: 17.1
Assignee: melanie witt
QA Contact: OSP DFG:Compute
URL:
Whiteboard:
Depends On: 2249578
Blocks:
TreeView+ depends on / blocked
 
Reported: 2022-07-14 17:36 UTC by Jean-Francois Beaudoin
Modified: 2024-11-21 09:38 UTC (History)
13 users (show)

Fixed In Version: openstack-nova-23.2.3-17.1.20240614020750.2ace99d.el9ost
Doc Type: No Doc Update
Doc Text:
Clone Of:
: 2249578 (view as bug list)
Environment:
Last Closed: 2024-11-21 09:38:14 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
OpenStack gerrit 860930 0 None MERGED Improve logging at '_numa_cells_support_network_metadata' 2023-11-30 15:25:14 UTC
OpenStack gerrit 900845 0 None NEW Improve logging at '_numa_cells_support_network_metadata' 2023-11-30 15:25:22 UTC
Red Hat Issue Tracker OSP-17626 0 None None None 2022-07-14 17:37:41 UTC
Red Hat Product Errata RHBA-2024:9974 0 None None None 2024-11-21 09:38:17 UTC

Description Jean-Francois Beaudoin 2022-07-14 17:36:20 UTC
Description of problem:
Instance creation fails onto NUMATopologyFilter when it seems there's at least 1 numa node with enough ressources.


Version-Release number of selected component (if applicable):
Red Hat OpenStack Platform release 16.2.2 (Train)

How reproducible:
Every time we try to spawn a instance using this flavor.

Steps to Reproduce:
1. Try to create a VM using the flavor.
2.
3.

Actual results:
Being able to create a VM within a numa node with available ressource.

Expected results:
Creation gets block at NUMATopologyFilter.

Additional info:
[stack@director ]$ openstack flavor show ovn-dpdk
+----------------------------+------------------------------------------------------------------------------------------------------------------------+
| Field                      | Value                                                                                                                  |
+----------------------------+------------------------------------------------------------------------------------------------------------------------+
| OS-FLV-DISABLED:disabled   | False                                                                                                                  |
| OS-FLV-EXT-DATA:ephemeral  | 0                                                                                                                      |
| access_project_ids         | None                                                                                                                   |
| description                | None                                                                                                                   |
| disk                       | 20                                                                                                                     |
| extra_specs                | {'hw:cpu_policy': 'dedicated', 'hw:emulator_threads_policy': 'isolate', 'hw:mem_page_size': '1GB', 'ovn-dpdk': 'true'} |

| name                       | ovn-dpdk                                                                                                               |
| os-flavor-access:is_public | True                                                                                                                   |
| properties                 | hw:cpu_policy='dedicated', hw:emulator_threads_policy='isolate', hw:mem_page_size='1GB', ovn-dpdk='true'               |
| ram                        | 4096                                                                                                                   |
| rxtx_factor                | 1.0                                                                                                                    |
| swap                       | 0                                                                                                                      |
| vcpus                      | 4                                                                                                                      |
+----------------------------+------------------------------------------------------------------------------------------------------------------------+

Comment 7 smooney 2022-07-18 12:04:19 UTC
updating the title to reject that this is being used to track improving logging.

tl;dr
the original bug report was invalid because the customer did not actually have enough space to boot all the vms they wanted on the host in question.
however while debugging this we noticed that  _numa_cells_support_network_metadata does not have any logging so when it eliminates a host cell
because the numa aware switch feature is in use there is not log to indicate that. As such it makes debugging scheduling issues related to numa
aware vswitchs very difficult without intimate knowledge of the code. we can improve this trivially by adding logging at debug and or info level
when a cell is eliminated.

Comment 9 Artom Lifshitz 2022-10-04 19:24:03 UTC
I'm going to convert this to a bug to improve logging in that area of the code, target 16.x because we'll need it for customer cases.

Comment 10 Jorge San Emeterio 2022-10-10 13:26:01 UTC
Upstream bug at:
https://bugs.launchpad.net/nova/+bug/1751784

Comment 12 Artom Lifshitz 2023-06-05 18:42:11 UTC
I think aiming for 16.2.6 with this is realistic, given how small the patch is.

Comment 29 errata-xmlrpc 2024-11-21 09:38:14 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (RHOSP 17.1.4 bug fix and enhancement advisory), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2024:9974


Note You need to log in before you can comment on or make changes to this bug.