Bug 1081495 - [Nagios] Status Information of Disk Utilization should show the current disk usage status as well
Summary: [Nagios] Status Information of Disk Utilization should show the current disk ...
Keywords:
Status: CLOSED CANTFIX
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: nagios-server-addons
Version: rhgs-3.0
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: ---
: ---
Assignee: Timothy Asir
QA Contact: RHS-C QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2014-03-27 13:00 UTC by Prasanth
Modified: 2018-08-14 11:15 UTC (History)
6 users (show)

Fixed In Version: nagios-server-addons-0.1.3-2.el6rhs, gluster-nagios-addons-0.1.3-1.el6rhs
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2018-01-30 07:55:00 UTC
Embargoed:


Attachments (Terms of Use)
screenshot (179.62 KB, image/png)
2014-07-14 07:55 UTC, Shruti Sampat
no flags Details

Description Prasanth 2014-03-27 13:00:28 UTC
Description of problem:

Status Information of Disk Utilization should show the actual disk usage status as well

Version-Release number of selected component (if applicable):
gluster-nagios-1.1-1.noarch.rpm 
gluster-nrpe-1.1-1.x86_64.rpm

How reproducible: 100%


Steps to Reproduce:
1.Add few RHS nodes and see the Disk Utilization in the Status Information
2.
3.

Actual results: It lists only the disk names as given below

----
OK : disks:mounts:(/dev/sda1=/boot,/dev/mapper/vg_rhsclient16-lv_root=/,/dev/mapper/vg_rhsclient16-lv_home=/home) 
----


Expected results: It should also show the current disk utilization of the disks listed similar to the ones seen under CPU, Memory and Swap Utilization status


Additional info:

Comment 2 Dusmant 2014-05-30 04:31:13 UTC
As discussed on 29-May-2014 : Though most of the team felt it's not a blocker, PM had input that considering it's related to storage, he would like to make it a blocker. Had it been for some other entity like CPU or memory, it would not have been a blocker. Marked blocker.

Comment 3 Timothy Asir 2014-05-30 06:34:14 UTC
Disk utilization percentage is already available in the performance data and we have recently sent a patch to added an associated unit to the value. Adding anything more into status would cause increasing the status message length when the disk count is increasing in a node. Whereas same is not a case with cpu and memory.

Comment 4 Timothy Asir 2014-05-30 09:32:31 UTC
Whenever a disk or inode reaches the threshold we are showing the fs, path and the usage% also.

@Prasanth can you copy here the latest status output of disk usage

Comment 5 Timothy Asir 2014-05-30 09:55:34 UTC
Currently it does not show the usage% only for the disks status "OK"
Whenever a disk/inode reaches the threshold, it will immediately changes the status into "WARNING or CRITICAL" and will show the usage % also in the status message itself.

Comment 6 Timothy Asir 2014-05-30 13:33:57 UTC
patch sent to upstream for review: http://review.gluster.org/#/c/7936/

Comment 8 Timothy Asir 2014-06-19 11:09:02 UTC
use of -i option in the plugin

-i MOUNTPATH, --include=MOUNTPATH
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Using this option one can add any disk (which is not listed for any reason)
along with an existing list. Suppose if the disk is added twice/thrice/anytime
unknowingly (due to huge list) the plugin will simply omits duplications.

Comment 9 Timothy Asir 2014-06-20 07:47:16 UTC
(In reply to Timothy Asir from comment #8)
> use of -i option in the plugin
> 
> -i MOUNTPATH, --include=MOUNTPATH
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Using this option one can add any disk (which is not listed for any reason)
> along with an existing list. Suppose if the disk is added
> twice/thrice/anytime
> unknowingly (due to huge list) the plugin will simply omits duplications.

We can update the config file /etc/nagios/nrpe.cfg to use this option to include any extra disk to be monitored under section: "### START - configuration section for gluster nrpe plugins ###"

Comment 11 Prasanth 2014-06-20 11:25:28 UTC
However, following is what I'm seeing in the "Status Information" of "Disk Utilization" on trying to verify this bug:

----
WARNING : 19.2% used (19.0GB out of 99.0GB) 

CRITICAL : 20.2% used (20.0GB out of 99.0GB) 
----

As you can see, I'm NOT seeing the "disks:mounts" along with the WARNING and CRITICAL state of disk-utilization as mentioned by Dusmant on the closire of this bug. Is that a miss or am I missing something here? Please clarify.

I'll attach the screenshots for the same.

Moving this bug back to Assigned as it failed to verify.

Comment 12 Dusmant 2014-06-20 14:18:42 UTC
Removing the blocker flag from this, after talking to PMs and Eric. We need to document this. Target it for 3.0.1 release.

Comment 15 Timothy Asir 2014-06-24 12:37:42 UTC
(In reply to Prasanth from comment #11)
> However, following is what I'm seeing in the "Status Information" of "Disk
> Utilization" on trying to verify this bug:
> 
> ----
> WARNING : 19.2% used (19.0GB out of 99.0GB) 
> 
> CRITICAL : 20.2% used (20.0GB out of 99.0GB) 
> ----
> 
> As you can see, I'm NOT seeing the "disks:mounts" along with the WARNING and
> CRITICAL state of disk-utilization as mentioned by Dusmant on the closire of
> this bug. Is that a miss or am I missing something here? Please clarify.
> 
> I'll attach the screenshots for the same.
> 
> Moving this bug back to Assigned as it failed to verify.

It is there in  Status Information only but it will be displayed only in the (detail view) Service state information page. We have added this feature to provide single line details as a short summery.

Comment 16 Shruti Sampat 2014-07-14 07:55:08 UTC
Hi,

I noticed that the disk utilization performance data showed two sets of figures for each disk -

/boot=8.00%;80;90;0;1.0 /boot=1.00%;80;90;0;128016.0 /=15.00%;80;90;0;18.0 /=4.00%;80;90;0;1148304.0 /rhs/brick1=40.00%;80;90;0;50.0 /rhs/brick1=1.00%;80;90;0;26212352.0 /rhs/brick2=40.00%;80;90;0;50.0 /rhs/brick2=1.00%;80;90;0;26212352.0 /rhs/brick3=1.00%;80;90;0;50.0 /rhs/brick3=1.00%;80;90;0;26212352.0 /rhs/brick4=91.00%;80;90;0;50.0 /rhs/brick4=1.00%;80;90;0;10179536.0 /rhs/brick5=1.00%;80;90;0;50.0 /rhs/brick5=1.00%;80;90;0;26212352.0 /rhs/brick6=1.00%;80;90;0;50.0 /rhs/brick6=1.00%;80;90;0;26212352.0 /rhs/brick7=1.00%;80;90;0;50.0 /rhs/brick7=1.00%;80;90;0;26212352.0 /rhs/brick8=1.00%;80;90;0;50.0 /rhs/brick8=1.00%;80;90;0;26212352.0

Also see screenshot attached.

Is this behavior expected?

Comment 17 Shruti Sampat 2014-07-14 07:55:30 UTC
Created attachment 917698 [details]
screenshot

Comment 18 Timothy Asir 2014-07-18 10:11:26 UTC
yes one is for disk and another one is for inode.

Comment 19 Shruti Sampat 2014-07-18 10:50:43 UTC
Hi,

I think it should be documented what each set of information denotes, as it is not clear by looking at the UI.

Comment 21 Timothy Asir 2014-09-24 05:58:47 UTC
Its already documented in rhsc admin guide and rhs admin guide in
"Using Nagios GUI" section.

Comment 23 Sahina Bose 2018-01-30 07:55:00 UTC
Thank you for the bug report. However, closing this as the bug is filed against gluster nagios monitoring for which no further new development is being undertaken.


Note You need to log in before you can comment on or make changes to this bug.