1081495 – [Nagios] Status Information of Disk Utilization should show the current disk usage status as well

Bug 1081495 - [Nagios] Status Information of Disk Utilization should show the current disk usage status as well

Summary: [Nagios] Status Information of Disk Utilization should show the current disk ...

Keywords:
Status:	CLOSED CANTFIX
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	nagios-server-addons
Sub Component:
Version:	rhgs-3.0
Hardware:	Unspecified
OS:	Unspecified
Priority:	medium
Severity:	medium
Target Milestone:	---
Target Release:	---
Assignee:	Timothy Asir
QA Contact:	RHS-C QE
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2014-03-27 13:00 UTC by Prasanth
Modified:	2018-08-14 11:15 UTC (History)
CC List:	6 users (show)
Fixed In Version:	nagios-server-addons-0.1.3-2.el6rhs, gluster-nagios-addons-0.1.3-1.el6rhs
Doc Type:	Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed:	2018-01-30 07:55:00 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
screenshot (179.62 KB, image/png) 2014-07-14 07:55 UTC, Shruti Sampat	no flags	Details
View All

Description Prasanth 2014-03-27 13:00:28 UTC

Description of problem:

Status Information of Disk Utilization should show the actual disk usage status as well

Version-Release number of selected component (if applicable):
gluster-nagios-1.1-1.noarch.rpm 
gluster-nrpe-1.1-1.x86_64.rpm

How reproducible: 100%


Steps to Reproduce:
1.Add few RHS nodes and see the Disk Utilization in the Status Information
2.
3.

Actual results: It lists only the disk names as given below

----
OK : disks:mounts:(/dev/sda1=/boot,/dev/mapper/vg_rhsclient16-lv_root=/,/dev/mapper/vg_rhsclient16-lv_home=/home) 
----


Expected results: It should also show the current disk utilization of the disks listed similar to the ones seen under CPU, Memory and Swap Utilization status


Additional info:

Comment 2 Dusmant 2014-05-30 04:31:13 UTC

As discussed on 29-May-2014 : Though most of the team felt it's not a blocker, PM had input that considering it's related to storage, he would like to make it a blocker. Had it been for some other entity like CPU or memory, it would not have been a blocker. Marked blocker.

Comment 3 Timothy Asir 2014-05-30 06:34:14 UTC

Disk utilization percentage is already available in the performance data and we have recently sent a patch to added an associated unit to the value. Adding anything more into status would cause increasing the status message length when the disk count is increasing in a node. Whereas same is not a case with cpu and memory.

Comment 4 Timothy Asir 2014-05-30 09:32:31 UTC

Whenever a disk or inode reaches the threshold we are showing the fs, path and the usage% also.

@Prasanth can you copy here the latest status output of disk usage

Comment 5 Timothy Asir 2014-05-30 09:55:34 UTC

Currently it does not show the usage% only for the disks status "OK"
Whenever a disk/inode reaches the threshold, it will immediately changes the status into "WARNING or CRITICAL" and will show the usage % also in the status message itself.

Comment 6 Timothy Asir 2014-05-30 13:33:57 UTC

patch sent to upstream for review: http://review.gluster.org/#/c/7936/

Comment 8 Timothy Asir 2014-06-19 11:09:02 UTC

use of -i option in the plugin

-i MOUNTPATH, --include=MOUNTPATH
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Using this option one can add any disk (which is not listed for any reason)
along with an existing list. Suppose if the disk is added twice/thrice/anytime
unknowingly (due to huge list) the plugin will simply omits duplications.

Comment 9 Timothy Asir 2014-06-20 07:47:16 UTC

(In reply to Timothy Asir from comment #8)
> use of -i option in the plugin
> 
> -i MOUNTPATH, --include=MOUNTPATH
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Using this option one can add any disk (which is not listed for any reason)
> along with an existing list. Suppose if the disk is added
> twice/thrice/anytime
> unknowingly (due to huge list) the plugin will simply omits duplications.

We can update the config file /etc/nagios/nrpe.cfg to use this option to include any extra disk to be monitored under section: "### START - configuration section for gluster nrpe plugins ###"

Comment 11 Prasanth 2014-06-20 11:25:28 UTC

However, following is what I'm seeing in the "Status Information" of "Disk Utilization" on trying to verify this bug:

----
WARNING : 19.2% used (19.0GB out of 99.0GB) 

CRITICAL : 20.2% used (20.0GB out of 99.0GB) 
----

As you can see, I'm NOT seeing the "disks:mounts" along with the WARNING and CRITICAL state of disk-utilization as mentioned by Dusmant on the closire of this bug. Is that a miss or am I missing something here? Please clarify.

I'll attach the screenshots for the same.

Moving this bug back to Assigned as it failed to verify.

Comment 12 Dusmant 2014-06-20 14:18:42 UTC

Removing the blocker flag from this, after talking to PMs and Eric. We need to document this. Target it for 3.0.1 release.

Comment 15 Timothy Asir 2014-06-24 12:37:42 UTC

(In reply to Prasanth from comment #11)
> However, following is what I'm seeing in the "Status Information" of "Disk
> Utilization" on trying to verify this bug:
> 
> ----
> WARNING : 19.2% used (19.0GB out of 99.0GB) 
> 
> CRITICAL : 20.2% used (20.0GB out of 99.0GB) 
> ----
> 
> As you can see, I'm NOT seeing the "disks:mounts" along with the WARNING and
> CRITICAL state of disk-utilization as mentioned by Dusmant on the closire of
> this bug. Is that a miss or am I missing something here? Please clarify.
> 
> I'll attach the screenshots for the same.
> 
> Moving this bug back to Assigned as it failed to verify.

It is there in  Status Information only but it will be displayed only in the (detail view) Service state information page. We have added this feature to provide single line details as a short summery.

Comment 16 Shruti Sampat 2014-07-14 07:55:08 UTC

Hi,

I noticed that the disk utilization performance data showed two sets of figures for each disk -

/boot=8.00%;80;90;0;1.0 /boot=1.00%;80;90;0;128016.0 /=15.00%;80;90;0;18.0 /=4.00%;80;90;0;1148304.0 /rhs/brick1=40.00%;80;90;0;50.0 /rhs/brick1=1.00%;80;90;0;26212352.0 /rhs/brick2=40.00%;80;90;0;50.0 /rhs/brick2=1.00%;80;90;0;26212352.0 /rhs/brick3=1.00%;80;90;0;50.0 /rhs/brick3=1.00%;80;90;0;26212352.0 /rhs/brick4=91.00%;80;90;0;50.0 /rhs/brick4=1.00%;80;90;0;10179536.0 /rhs/brick5=1.00%;80;90;0;50.0 /rhs/brick5=1.00%;80;90;0;26212352.0 /rhs/brick6=1.00%;80;90;0;50.0 /rhs/brick6=1.00%;80;90;0;26212352.0 /rhs/brick7=1.00%;80;90;0;50.0 /rhs/brick7=1.00%;80;90;0;26212352.0 /rhs/brick8=1.00%;80;90;0;50.0 /rhs/brick8=1.00%;80;90;0;26212352.0

Also see screenshot attached.

Is this behavior expected?

Comment 17 Shruti Sampat 2014-07-14 07:55:30 UTC

Created attachment 917698 [details]
screenshot

Comment 18 Timothy Asir 2014-07-18 10:11:26 UTC

yes one is for disk and another one is for inode.

Comment 19 Shruti Sampat 2014-07-18 10:50:43 UTC

Hi,

I think it should be documented what each set of information denotes, as it is not clear by looking at the UI.

Comment 21 Timothy Asir 2014-09-24 05:58:47 UTC

Its already documented in rhsc admin guide and rhs admin guide in
"Using Nagios GUI" section.

Comment 23 Sahina Bose 2018-01-30 07:55:00 UTC

Thank you for the bug report. However, closing this as the bug is filed against gluster nagios monitoring for which no further new development is being undertaken.

Note You need to log in before you can comment on or make changes to this bug.