1918555 – [ppc64le][numa][regression]the maximum numa node is conflict in guest and qemu

Bug 1918555 - [ppc64le][numa][regression]the maximum numa node is conflict in guest and qemu

Summary: [ppc64le][numa][regression]the maximum numa node is conflict in guest and qemu

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Enterprise Linux Advanced Virtualization
Classification:	Red Hat
Component:	qemu-kvm
Sub Component:
Version:	8.4
Hardware:	ppc64le
OS:	Linux
Priority:	low
Severity:	low
Target Milestone:	rc
Target Release:	8.5
Assignee:	Daniel Henrique Barboza (IBM)
QA Contact:	Min Deng
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-01-21 02:59 UTC by Min Deng
Modified:	2021-11-16 08:07 UTC (History)
CC List:	9 users (show)
Fixed In Version:	qemu-kvm-6.0.0-16.module+el8.5.0+10848+2dccc46d
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2021-11-16 07:51:32 UTC
Type:	---
Target Upstream Version:
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
129 (14.26 KB, text/plain) 2021-01-21 03:22 UTC, Min Deng	no flags	Details
128 (14.16 KB, text/plain) 2021-01-21 03:22 UTC, Min Deng	no flags	Details
View All

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2021:4684	0	None	None	None	2021-11-16 07:51:58 UTC

Description Min Deng 2021-01-21 02:59:36 UTC

Description of problem:
[ppc64le][numa][regresssion]the maxmum numa node is conflict in guest and qemu
Version-Release number of selected component (if applicable):
qemu-kvm-5.2.0-3.module+el8.4.0+9499+42e58f08

How reproducible:
3/3
Steps to Reproduce:
1.boot up a guest with over 128 nodes 

Actual results:
[root@ibm-p9wr-02 home]# sh 129.sh 
QEMU 5.2.0 monitor - type 'help' for more information
(qemu) qemu-kvm: -numa node,memdev=mem-mem128: Max number of NUMA nodes reached: 128

According to the current output by the following cmdline in the guest
#cat /sys/devices/system/node/possible
0-255
the two numbers didn't matched, we should support 256 nodes but not 128 according to the current result.But with older versions, the issues wasn't reproducible (both guest and qemu show 128 nodes)
It's should be a regression issue, please help to fix it, thanks.

qemu-kvm-5.1.0-14.module+el8.3.0+8438+644aff69.ppc64le 
[root@localhost ~]# cat /sys/devices/system/node/possible
cat /sys/devices/system/node/possible
0-127

Expected results:
The maximum numa number should be the same in guest and qemu level


Additional info:

Comment 1 Min Deng 2021-01-21 03:18:58 UTC

Guest/host kernel
kernel-4.18.0-276.el8.ppc64le/4.18.0-274.el8.ppc64le

Comment 2 Min Deng 2021-01-21 03:22:32 UTC

Created attachment 1749292 [details]
129

Comment 3 Min Deng 2021-01-21 03:22:54 UTC

Created attachment 1749293 [details]
128

Comment 4 Min Deng 2021-01-21 08:09:00 UTC

I didn't reproduce it on x86 platform
qemu-kvm-5.2.0-3.module+el8.4.0+9499+42e58f08.x86_64

Comment 5 David Gibson 2021-01-28 03:20:43 UTC

AFAICT, the only issue here is that qemu has a limit of 128 NUMA nodes, whereas the guest kernel has a limit of 256 nodes, which is arguably not a bug.  Since we use a common kernel for all platforms, we might have legitimate reasons to have a large limit in the kernel, to support more nodes on bare metal or PowerVM LPAR deployments.

On x86, in what way does it not reproduce?  Does qemu allow for more nodes?  Or does the kernel report a lower number of possible nodes?

Comment 6 Min Deng 2021-01-29 06:34:19 UTC

(In reply to David Gibson from comment #5)
> AFAICT, the only issue here is that qemu has a limit of 128 NUMA nodes,
> whereas the guest kernel has a limit of 256 nodes, which is arguably not a
> bug.  Since we use a common kernel for all platforms, we might have
> legitimate reasons to have a large limit in the kernel, to support more
> nodes on bare metal or PowerVM LPAR deployments.
> 
> On x86, in what way does it not reproduce?  Does qemu allow for more nodes? 
> Or does the kernel report a lower number of possible nodes?

Hi David,
x86 reported lower number of possible nodes in the guest.

[root@localhost ~]# cat /sys/devices/system/node/possible
cat /sys/devices/system/node/possible
0-127

Thanks
Min

Comment 7 Min Deng 2021-01-29 06:36:57 UTC

(In reply to David Gibson from comment #5)
> AFAICT, the only issue here is that qemu has a limit of 128 NUMA nodes,
> whereas the guest kernel has a limit of 256 nodes, which is arguably not a
> bug.  Since we use a common kernel for all platforms, we might have
> legitimate reasons to have a large limit in the kernel, to support more
> nodes on bare metal or PowerVM LPAR deployments. 
  I understand your points but it didn't happened on old builds.

Comment 8 David Gibson 2021-02-01 04:49:05 UTC

Ok, what happens on older builds?  Both with 128 nodes specified in qemu, and with a smaller number of nodes specified in qemu?

This still seems more or less a cosmetic problem to me.

Comment 9 Daniel Henrique Barboza (IBM) 2021-02-01 19:32:05 UTC

Min, 

What is happening here is that the pseries machine is reporting the double of the
NUMA nodes specified from the command line. This is easily reproducible: just create
a guest with 2 nodes and /sys/devices/system/node/possible will report 4, 4 user
nodes will become 8 and so on.


David,

I think we have a lucky break in this one. This is another instance of the bug I already
fixed upstream, the one that was reported by Cedric:


https://lists.gnu.org/archive/html/qemu-devel/2021-01/msg07408.html


As such, this bug also happens with QEMU upstream, and with the above fix this is the output
when running the 128.sh script:

# cat /sys/devices/system/node/possible
0-127


The patches were reviewed and accepted, but they didn't land upstream yet. I would do a backport but
I'm not sure if I can backport patches from David's tree. Let me know if that's allowed and I'll
backport them. Otherwise we can wait for them to land upstream.

Comment 10 David Gibson 2021-02-02 05:16:24 UTC

Thanks for the analysis Daniel.  You'll need to wait until the upstream merge before backporting this.  I'm not yet sure when my next pull request will be; probably 1-2 weeks away.  Since this is now low priority and aimed at RHEL8.5, that shouldn't be a big problem.

Comment 11 Min Deng 2021-02-05 02:52:45 UTC

> qemu-kvm-5.1.0-14.module+el8.3.0+8438+644aff69.ppc64le 
> [root@localhost ~]# cat /sys/devices/system/node/possible
> cat /sys/devices/system/node/possible
> 0-127
> 
> Expected results:
> The maximum numa number should be the same in guest and qemu level
> 
> 
> Additional info:

According to the bug's description, the number of numa node matched from qemu and guest on old build.

Comment 12 David Gibson 2021-02-15 05:32:24 UTC

The fix is now upstream, so we should get this fix via rebase.

Comment 15 Min Deng 2021-05-12 08:09:29 UTC

Hi mrezanin,
Can you take a look on this bz since it' in Devmissed status now, per lvivier, the fix is included in rebase code. Should we move it to be modified directly or something else ? Thanks.
Best regards
Min

Comment 18 Min Deng 2021-05-15 02:01:05 UTC

Verified the bug on build 
qemu-kvm-6.0.0-16.module+el8.5.0+10848+2dccc46d.ppc64le
The steps please refer to description, the original issue has been fixed, thanks.

Comment 20 errata-xmlrpc 2021-11-16 07:51:32 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (virt:av bug fix and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2021:4684

Note You need to log in before you can comment on or make changes to this bug.