Bug 886471 - [RHSC] Brick status shows 'UP' even after brick process is killed.
Summary: [RHSC] Brick status shows 'UP' even after brick process is killed.
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: rhsc
Version: 2.0
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: ---
Assignee: Vinayaga Raman
QA Contact: Shruti Sampat
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2012-12-12 11:16 UTC by Shruti Sampat
Modified: 2014-03-31 01:29 UTC (History)
6 users (show)

Fixed In Version: qa4
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2013-09-23 22:25:30 UTC
Embargoed:


Attachments (Terms of Use)
engine logs (2.87 MB, text/x-log)
2012-12-12 11:16 UTC, Shruti Sampat
no flags Details

Description Shruti Sampat 2012-12-12 11:16:26 UTC
Created attachment 662289 [details]
engine logs

Description of problem:
---------------------------------------
When a brick that is part of a volume is brought down by killing the brick process, the brick status on the Console still shows 'UP'.

The following is seen in the engine logs - 
---------------------------------------
2012-12-12 09:46:01,420 INFO  [org.ovirt.engine.core.vdsbroker.gluster.GetGlusterVolumeAdvancedDetailsVDSCommand] (QuartzScheduler_Worker-29) START, G
etGlusterVolumeAdvancedDetailsVDSCommand(HostName = 10.70.35.13, HostId = 62b0de10-703f-4456-9dcc-174284ae996e), log id: 2c20be69
2012-12-12 09:46:01,563 ERROR [org.ovirt.engine.core.vdsbroker.vdsbroker.VdsBrokerCommand] (QuartzScheduler_Worker-29) Failed in GetGlusterVolumeAdvan
cedDetailsVDS method, for vds: 10.70.35.13; host: 10.70.35.13
2012-12-12 09:46:01,563 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] (QuartzScheduler_Worker-29) Command GetGlusterVolumeAdvancedDetailsVDS 
execution failed. Exception: NumberFormatException: For input string: "N/A"
2012-12-12 09:46:01,563 INFO  [org.ovirt.engine.core.vdsbroker.gluster.GetGlusterVolumeAdvancedDetailsVDSCommand] (QuartzScheduler_Worker-29) FINISH, 
GetGlusterVolumeAdvancedDetailsVDSCommand, log id: 2c20be69
2012-12-12 09:46:01,563 ERROR [org.ovirt.engine.core.bll.gluster.GlusterManager] (QuartzScheduler_Worker-29) Error while refreshing brick statuses for
 volume test of cluster cluster1: org.ovirt.engine.core.common.errors.VdcBLLException: VdcBLLException: java.lang.NumberFormatException: For input str
ing: "N/A"
        at org.ovirt.engine.core.bll.VdsHandler.handleVdsResult(VdsHandler.java:214) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.VDSBrokerFrontendImpl.RunVdsCommand(VDSBrokerFrontendImpl.java:33) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.gluster.GlusterManager.runVdsCommand(GlusterManager.java:216) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.gluster.GlusterManager.getVolumeAdvancedDetails(GlusterManager.java:823) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.gluster.GlusterManager.refreshBrickStatuses(GlusterManager.java:796) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.gluster.GlusterManager.refreshClusterHeavyWeightData(GlusterManager.java:783) [engine-bll.jar:]
        at org.ovirt.engine.core.bll.gluster.GlusterManager.refreshHeavyWeightData(GlusterManager.java:761) [engine-bll.jar:]
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) [rt.jar:1.7.0_09-icedtea]
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) [rt.jar:1.7.0_09-icedtea]
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [rt.jar:1.7.0_09-icedtea]
        at java.lang.reflect.Method.invoke(Method.java:601) [rt.jar:1.7.0_09-icedtea]
        at org.ovirt.engine.core.utils.timer.JobWrapper.execute(JobWrapper.java:60) [engine-scheduler.jar:]
        at org.quartz.core.JobRunShell.run(JobRunShell.java:213) [quartz-2.1.2.jar:]
        at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:557) [quartz-2.1.2.jar:]


Version-Release number of selected component (if applicable):
Red Hat Storage Console Version: 2.1-qa18.el6ev 
vdsm version: vdsm-4.9.6-32.0.qa3.el6rhs.x86_64

How reproducible:
Always

Steps to Reproduce:
1. Create a volume
2. Start the volume.
3. Kill one of the brick processes on the storage node.
  
Actual results:
The brick status on the Console continues to show 'UP'

Expected results:
The brick status should show 'DOWN'

Additional info:
Output of 'gluster volume status' command on the storage node is - 
---------------------------------------

[root@localhost ~]# gluster volume status
Status of volume: test
Gluster process						Port	Online	Pid
------------------------------------------------------------------------------
Brick 10.70.35.13:/opt/gluster/bricks/testBrick		N/A	N	N/A
NFS Server on localhost					38467	Y	2778
NFS Server on 10.70.35.28				38467	Y	9682

Comment 2 Dhandapani 2012-12-13 10:58:14 UTC
Patch sent to upstream : http://gerrit.ovirt.org/#/c/10022/

Comment 3 Shruti Sampat 2013-01-11 09:34:20 UTC
Verfied in Red Hat Storage Console Version: 2.1-0.qa4d.el6rhs. Brick status on the Console now reflects the status of the brick correctly whenever it is brought down/up.

Comment 5 Scott Haines 2013-09-23 22:25:30 UTC
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. 

For information on the advisory, and where to find the updated files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHBA-2013-1262.html


Note You need to log in before you can comment on or make changes to this bug.