2005767 – Perf: Enable tcmalloc and disable mempool as a default option

Bug 2005767 - Perf: Enable tcmalloc and disable mempool as a default option

Summary: Perf: Enable tcmalloc and disable mempool as a default option

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	core
Sub Component:
Version:	rhgs-3.5
Hardware:	All
OS:	All
Priority:	urgent
Severity:	urgent
Target Milestone:	---
Target Release:	RHGS 3.5.z Batch Update 7
Assignee:	Mohit Agrawal
QA Contact:	Vivek Das
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-09-20 06:35 UTC by Mohit Agrawal
Modified:	2022-05-31 12:37 UTC (History)
CC List:	12 users (show)
Fixed In Version:	glusterfs-6.0-62
Doc Type:	Enhancement
Doc Text:	Previously, glusterFS used its own thread based memory pool along with the glibc pool to allocate or deallocate memory blocks. With this update, TCMalloc is used for memory operations to improve the performance across all file operations. Make sure to update the client and the server to get the maximum performance benefits.
Clone Of:
Environment:
Last Closed:	2022-05-31 12:37:31 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Github	gluster glusterfs pull 2787	None	Merged	configure: Enable tcmalloc and disable mempool as a default option	2021-09-20 06:46:23 UTC
Red Hat Issue Tracker	CLOUDBLD-8110	None	None	None	2021-11-24 07:09:24 UTC
Red Hat Product Errata	RHBA-2022:4840	None	None	None	2022-05-31 12:37:46 UTC

Description Mohit Agrawal 2021-09-20 06:35:17 UTC

As we know, we always tried to improve Smallfile performance in glusterfs.After doing a long duration testing we have found memory allocation is a big area that affects smallfile performance significantly in glusterfs. As we know gluster uses own thread based mempool to allocate/deallocate memory blocks.In testing we observed it performs well as compare to glibc thread based pool but it does not perform well as compare to tcmalloc pool so we have decided to move to tcmalloc as a default option instead of using glibc pool.

I have executed a smallfile perf test case for 4.8M files 64K (more than 20 times on usual day operation) on latest devel branch.I have setup 12x3(nvme) volume after configure 4 event threads on 12 physical machines, there is no other configurable option i have enabled.

Hardware details:
12 physical machines (6 client, 6 servers)
Every machine is having 64 CPU 32G RAM, 10g NIC

Run smallfile tool to run operations and total number of files are 4.8M
date ;
for i in {1..5}
do
/root/cleanup.sh;./smallfile_cli.py --operation create --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation ls-l --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation chmod --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation stat --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation read --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation append --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation mkdir --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation rmdir --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
/root/cleanup.sh;./smallfile_cli.py --operation cleanup --threads 16 --file-size 64 --files 50000 --top /mnt/test  --host-set client01.perf.cloud,client02.perf.cloud,client03.perf.cloud,client04.perf.cloud,client05.perf.cloud,client06.perf.cloud
done

date;


We got significant performance improvement and data is available at upstream
link https://github.com/gluster/glusterfs/issues/2771.

Comment 2 SATHEESARAN 2021-10-20 09:07:42 UTC

Thanks Mohit for working on this patch and thanks Xavi for the suggestions for improvement.

This patch allows the glusterfs to utilize 'tcmalloc' while moving away from the traditional 'mempool' used by glusterfs for long time.
This brings in a new underlying feature which is transparent to the customers, but a significant enough change of managing the memory
allocation. So this change falls under RFE as the core memory management/allocation methodology is swapped with a better memory allocation
methodology optimized for performance, though there is no change exposed to the user level.

As per our 'RHGS and Layered Product batch update model -2.2[1], we agreed not to include any RFE as part of maintenance release.
Also, this patch is useful for small file workloads of glusterfs. Small file workloads doesn't suit well with glusterfs, and
this fact was well advertised to our customers. Providing a solution to this problem is really good, but this will come as a
added advantage for our existing customers who were using small file workloads against the RHGS general recommendation. This has a
direct customer impact. Irrespective of the workload, definitely it will improve performance for any workload classifications
but still there is a risk involved with the unknown.

On the other hand, the patch is merged on Sep 15, 2021 and haven't made in to any of the glusterfs upstream release so far, as I checked.
I see that the next glusterfs-9.5 release is scoped for Dec 30, 2021 and I believe that's the release which will carry this change to the
upstream users. The logic here is we would need good amount of soak time in the community, before we could make decision about including the
patch in RHGS 3.5.z

With this thought, I would like to retarget this bug for RHGS 3.5.8 and revisit the verdict of including this bug in RHGS 3.5.z with
that relevant information.

@Mohit, @Sunil - What do you suggest ?

[1] - https://docs.google.com/document/d/1KvdyoI8-BNJJuADBkN0OVTW4sMZ4dX6LqxUXPJtK8ME/edit

Comment 3 Yaniv Kaul 2021-10-20 10:23:41 UTC

Same response as the iobuf patch, although this one is even less intrusive and could be useful by itself.

Comment 4 SATHEESARAN 2021-10-28 10:30:22 UTC

After conversation with Mohit, I understood that 'tcmalloc' requires 'gperftools' package.
This particular package is not available in RHGS specific repos both in RHEL 7 and RHEL 8.

In the case of RHEL 7, RHEL 7 server repo contains this package 'gperftools' but as I 
checked in RHEL 8 this package is not available with baseos or appstream.

In the case of using tcmalloc, glusterfs should have a hard dependency of this new package - gperftools.

So glusterfs package should have a hard dependency on 'gperftools' package both in RHEL 7 and RHEL 8.

On the other hand, as per RHGS Batch update model[1] understanding we agreed not to include new packages.

@Sunil, What are your thoughts ?


[1] - https://docs.google.com/document/d/1KvdyoI8-BNJJuADBkN0OVTW4sMZ4dX6LqxUXPJtK8ME/edit

Comment 5 Yaniv Kaul 2021-10-28 10:32:07 UTC

(In reply to SATHEESARAN from comment #4)
> After conversation with Mohit, I understood that 'tcmalloc' requires
> 'gperftools' package.
> This particular package is not available in RHGS specific repos both in RHEL
> 7 and RHEL 8.
> 
> In the case of RHEL 7, RHEL 7 server repo contains this package 'gperftools'
> but as I 
> checked in RHEL 8 this package is not available with baseos or appstream.
> 
> In the case of using tcmalloc, glusterfs should have a hard dependency of
> this new package - gperftools.
> 
> So glusterfs package should have a hard dependency on 'gperftools' package
> both in RHEL 7 and RHEL 8.
> 
> On the other hand, as per RHGS Batch update model[1] understanding we agreed
> not to include new packages.

It's not a new package, it has been used for years by the Ceph team.

> 
> @Sunil, What are your thoughts ?
> 
> 
> [1] -
> https://docs.google.com/document/d/1KvdyoI8-
> BNJJuADBkN0OVTW4sMZ4dX6LqxUXPJtK8ME/edit

Comment 8 Sunil Kumar Acharya 2021-11-24 07:09:24 UTC

(In reply to SATHEESARAN from comment #4)
> After conversation with Mohit, I understood that 'tcmalloc' requires
> 'gperftools' package.
> This particular package is not available in RHGS specific repos both in RHEL
> 7 and RHEL 8.
> 
> In the case of RHEL 7, RHEL 7 server repo contains this package 'gperftools'
> but as I 
> checked in RHEL 8 this package is not available with baseos or appstream.
> 
> In the case of using tcmalloc, glusterfs should have a hard dependency of
> this new package - gperftools.
> 
> So glusterfs package should have a hard dependency on 'gperftools' package
> both in RHEL 7 and RHEL 8.
Yes, we need to include this new package. This has been brought up during the program call
and the ticket CLOUDBLD-8110 is raised to track the same.
> 
> On the other hand, as per RHGS Batch update model[1] understanding we agreed
> not to include new packages.
> 
> @Sunil, What are your thoughts ?
> 
> 
> [1] -
> https://docs.google.com/document/d/1KvdyoI8-
> BNJJuADBkN0OVTW4sMZ4dX6LqxUXPJtK8ME/edit

Comment 26 errata-xmlrpc 2022-05-31 12:37:31 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (glusterfs bug fix update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2022:4840

Note You need to log in before you can comment on or make changes to this bug.