Bug 1207150 - [New] - Host status does not move to non-operational when glusterd is down.
Summary: [New] - Host status does not move to non-operational when glusterd is down.
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: oVirt
Classification: Retired
Component: ovirt-engine-webadmin
Version: 3.6
Hardware: Unspecified
OS: Unspecified
high
high
Target Milestone: ---
: 3.5.3
Assignee: Darshan
QA Contact: RamaKasturi
URL:
Whiteboard: gluster
Depends On:
Blocks: 1184376 1184380 rhsc_qe_tracker_everglades 1203154
TreeView+ depends on / blocked
 
Reported: 2015-03-30 10:23 UTC by RamaKasturi
Modified: 2016-02-10 19:29 UTC (History)
13 users (show)

Fixed In Version: vdsm-4.16.16-1.el6ev
Clone Of:
Environment:
Last Closed: 2015-06-15 08:39:10 UTC
oVirt Team: Gluster
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 39573 0 master MERGED gluster: Handle gluster exceptions in json rpc mode. Never
oVirt gerrit 39788 0 ovirt-3.5 MERGED gluster: Handle gluster exceptions in json rpc mode. Never
oVirt gerrit 39937 0 ovirt-3.5-gluster ABANDONED gluster: Handle gluster exceptions in json rpc mode. Never

Description RamaKasturi 2015-03-30 10:23:40 UTC
Description of problem:
When glusterd is down on one of the node in the cluster host should go to non operational which is not happening as of now. Host status is always shown as UP even when glusterd is down on that node.

Version-Release number of selected component (if applicable):
ovirt-engine-webadmin-portal-3.6.0-0.0.master.20150329172249.git660d494.el6.noarch

How reproducible:
Always

Steps to Reproduce:
1. Add gluster nodes to ovirt
2. Now stop glusterd in one the node in the cluster.
3. 

Actual results:
Ovirt still shows the node with up status and node does not go to non-operational.

Expected results:
When glusterd is down on the node, host status in UI should show it as "non-operational".

Additional info:

Comment 1 Ala Hino 2015-03-31 08:51:45 UTC
Please note that glusterd is the management daemon and it's job is to look for other peers and distribute information across nodes. 
Once a client is connected to a gluster volume, it ends up talking directly to the brick processes - i.e glusterfsd processes. There is a glusterfsd process per volume.
If you want to simulate one of the gluster nodes being unreachable from gluster perspective, shut down glusterd and kill the glusterfsd processes.

Comment 2 Ramesh N 2015-04-02 10:18:54 UTC
This happens only in the case when JSON RPC is used for communication. Exception handling is different in JSON RPC and currently its not handled properly for gluster related exception. As result, always 'JsonRpcInternalError"(Code: -32603) is returned from VDSM for any exception.

Comment 3 Max Kovgan 2015-04-14 08:00:46 UTC
accidentally caught this with our script and moved it to "ON_QA"
reverting back to how it was.

Comment 4 RamaKasturi 2015-05-19 07:18:25 UTC
Verified and works fine with build ovirt-engine-3.6.0-0.0.master.20150517172245.git089e126.el6.noarch.

When host is added to ovirt in json/xml rpc mode and if glusterd is down on that node ovirt moves the node to non operational.

Comment 5 Sandro Bonazzola 2015-06-15 08:39:10 UTC
This is an automated message.
oVirt 3.5.3 has been released on June 15th 2015 and should include the fix for this BZ. Moving to closed current release.


Note You need to log in before you can comment on or make changes to this bug.