1475721 – 'gluster volume profile' shouldn't cause negative impact on volume performance

Bug 1475721 - 'gluster volume profile' shouldn't cause negative impact on volume performance

Summary: 'gluster volume profile' shouldn't cause negative impact on volume performance

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	core
Sub Component:
Version:	unspecified
Hardware:	Unspecified
OS:	Linux
Priority:	unspecified
Severity:	high
Target Milestone:	---
Target Release:	RHGS 3.3.1
Assignee:	Krutika Dhananjay
QA Contact:	Karan Sandha
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:	1475687
TreeView+	depends on / blocked

Reported:	2017-07-27 08:36 UTC by Amar Tumballi
Modified:	2017-11-29 03:29 UTC (History)
CC List:	7 users (show)
Fixed In Version:	glusterfs-3.8.4-45
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed:	2017-11-29 03:29:14 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Private	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2017:3276	0	normal	SHIPPED_LIVE	glusterfs bug fix update	2017-11-29 08:28:52 UTC

Description Amar Tumballi 2017-07-27 08:36:52 UTC

Description of problem:
Till now, `gluster volume profile` command is recommended to be used in a production system only when there are issues, and there is a need to understand the I/O pattern. The major reason was the impact on volume performance because of enabling the option.


Version-Release number of selected component (if applicable):
3.3.0

How reproducible:
- this is an hypothesis - 


Actual results:
* Recommended only in few cases and not in actual production setup.

Expected results:
* Always have the option enabled, and use the metrics from filesystem to show the performance pattern over time.

Additional info:

Gluster volume profile can provide a lot of internal information on gluster's performance, and this can be used to plot metrics pretty well.

Comment 4 Krutika Dhananjay 2017-08-03 11:12:21 UTC

I don't have context on this issue except for knowing that we plan to enable profile by default and analyse the perf data. Do we know or have evidence to support the claim that enabling volume profiling does have a negative impact on performance?

-Krutika

Comment 5 Amar Tumballi 2017-08-03 11:37:09 UTC

>  we plan to enable profile by default and analyse the perf data.

Yes, above is the context for this bug to exist.

>  Do we know or have evidence to support the claim that enabling volume profiling does have a negative impact on performance?

Nothing where it was 'analyzed' before. Mostly hypothesis:

1. We don't turn it on by default, but ask customers to give the output when they get into perf issues. 
2. Inside code, if 'measure_latency' flag is set, it is a gettimeofday() syscall for every layer of translator for every fop().
3. Earlier in 'debug/io-stats' translator, we had xlator->lock which was used to update all the counters, which surely caused locking contentions at io-stats.

Hence, we want a performance run with profiling on, so we can see how much we regress, and see if there are anything more to enhance it further.

Comment 7 Atin Mukherjee 2017-09-04 09:54:49 UTC

downstream patch : https://code.engineering.redhat.com/gerrit/#/c/117185

Comment 11 errata-xmlrpc 2017-11-29 03:29:14 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3276

Note You need to log in before you can comment on or make changes to this bug.