1152877 – gluster bricks marked down in ovirt after vdsm restarted

Bug 1152877 - gluster bricks marked down in ovirt after vdsm restarted

Summary: gluster bricks marked down in ovirt after vdsm restarted

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat Gluster Storage
Classification:	Red Hat Storage
Component:	rhsc
Sub Component:
Version:	rhgs-3.0
Hardware:	x86_64
OS:	Linux
Priority:	high
Severity:	high
Target Milestone:	---
Target Release:	RHGS 3.0.3
Assignee:	Sahina Bose
QA Contact:	Shruti Sampat
Docs Contact:
URL:
Whiteboard:	gluster
Duplicates (1):	1152882 (view as bug list)
Depends On:	1103973 1152882
Blocks:
TreeView+	depends on / blocked

Reported:	2014-10-15 06:38 UTC by Sahina Bose
Modified:	2015-05-13 17:42 UTC (History)
CC List:	13 users (show)
Fixed In Version:	rhsc-3.0.2-1.16.el6rhs.noarch.rpm
Doc Type:	Bug Fix
Doc Text:	Previously, when a host had multiple network addresses, the system failed to identify the brick correctly from the output of 'gluster volume status' command. As a result, the brick status appeared to be offline after a node restart, though the bricks were up. With this fix, changes are made to ensure that the brick statuses are displayed appropriately.
Clone Of:	1103973
Environment:
Last Closed:	2015-01-15 13:49:51 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Red Hat Product Errata	RHBA-2015:0039	normal	SHIPPED_LIVE	Red Hat Storage Console 3.0 enhancement and bug fix update #3	2015-01-15 18:46:40 UTC
oVirt gerrit	33614	None	None	None	Never
oVirt gerrit	33631	None	None	None	Never

Description Sahina Bose 2014-10-15 06:38:55 UTC

+++ This bug was initially created as a clone of Bug #1103973 +++

Description of problem:
restart of vdsm on a gluster node causes the bricks in replicated volumes on the affected node to be marked as down even after vdsm comes back up. Gluster reports Bricks are fine and Volumes are fine.  Only stopping and restarting the volume fixes issue in ovirt-console.

Version-Release number of selected component (if applicable):
engine node:
ovirt-engine-cli-3.4.0.5-1.fc19.noarch
ovirt-engine-userportal-3.4.1-1.fc19.noarch
ovirt-engine-3.4.1-1.fc19.noarch
ovirt-engine-setup-plugin-ovirt-engine-3.4.1-1.fc19.noarch
ovirt-engine-setup-base-3.4.1-1.fc19.noarch
ovirt-release34-1.0.1-1.noarch
ovirt-engine-sdk-python-3.4.1.1-1.fc19.noarch
ovirt-engine-setup-plugin-ovirt-engine-common-3.4.1-1.fc19.noarch
ovirt-engine-webadmin-portal-3.4.1-1.fc19.noarch
ovirt-log-collector-3.4.2-1.fc19.noarch
ovirt-host-deploy-java-1.2.1-1.fc19.noarch
ovirt-engine-websocket-proxy-3.4.1-1.fc19.noarch
ovirt-iso-uploader-3.4.1-1.fc19.noarch
ovirt-engine-restapi-3.4.1-1.fc19.noarch
ovirt-engine-tools-3.4.1-1.fc19.noarch
ovirt-engine-setup-plugin-websocket-proxy-3.4.1-1.fc19.noarch
ovirt-host-deploy-1.2.1-1.fc19.noarch
ovirt-engine-lib-3.4.1-1.fc19.noarch
ovirt-engine-setup-3.4.1-1.fc19.noarch
ovirt-engine-dbscripts-3.4.1-1.fc19.noarch
ovirt-image-uploader-3.4.1-1.fc19.noarch
libgovirt-0.1.0-1.fc19.x86_64
ovirt-engine-backend-3.4.1-1.fc19.noarch
glusterfs-3.5.0-3.fc19.x86_64
glusterfs-api-3.5.0-3.fc19.x86_64
glusterfs-fuse-3.5.0-3.fc19.x86_64
glusterfs-libs-3.5.0-3.fc19.x86_64

gluster node:
glusterfs-server-3.5.0-2.el6.x86_64
glusterfs-api-3.5.0-2.el6.x86_64
glusterfs-3.5.0-2.el6.x86_64
glusterfs-fuse-3.5.0-2.el6.x86_64
glusterfs-rdma-3.5.0-2.el6.x86_64
glusterfs-libs-3.5.0-2.el6.x86_64
glusterfs-cli-3.5.0-2.el6.x86_64
vdsm-4.14.8.1-0.el6.x86_64
vdsm-python-zombiereaper-4.14.8.1-0.el6.noarch
vdsm-gluster-4.14.8.1-0.el6.noarch
vdsm-python-4.14.8.1-0.el6.x86_64
vdsm-cli-4.14.8.1-0.el6.noarch
vdsm-xmlrpc-4.14.8.1-0.el6.noarch


How reproducible:
Always

Steps to Reproduce:
1.on gluster0 stop vdsm
2.on ovirt-console bricks unavailable
3.on gluster0 start vsdm
4.on ovirt-console bricks remains unavailable

Actual results:
ovirt-admin console continues to show the brick as down even though gluster is healthy and unaffected

Expected results:
brick should show as down while vdsm is offline and then be shown as up when vdsm comes back up

Additional info:

I have a vm store volume which I cannot add new vm disks too because all bricks are marked down.  I cannot restart it because I have active VMs - hell I should not have to restart it it is not a gluster issue.

--- Additional comment from Sahina Bose on 2014-07-08 04:15:29 EDT ---

Darshan, can you check what glusterVolumesList vdsm command returns. If this returns the brick status correctly, it may be an engine issue.

--- Additional comment from Darshan on 2014-07-08 06:39:28 EDT ---

GlusterVolumeStatus command is returning the brick status correctly.

--- Additional comment from Ludek Finstrle on 2014-09-29 12:28:19 EDT ---

Is there any progress/workaround? I see exactly the same problem with oVirt 3.4.4 + gluster 3.5.2:

engine: ovirt-engine-3.4.4-1.el6.noarch
hosts: vdsm-4.14.17-0.el6.x86_64, vdsm-gluster-4.14.17-0.el6.noarch, glusterfs-server-3.5.2-1.el6.x86_64

--- Additional comment from Ludek Finstrle on 2014-09-29 12:45:49 EDT ---

How is the hostname in glusterVolumeStatus gathered?
I have multiple nics in host machines (gluster listening on 0.0.0.0) with:
vm1.lab:
eth1: 192.168.254.1/30
ovirtmgmt (bridge on eth2): 192.168.254.129/27, 192.168.254.161/27
$ grep vm1 /etc/hosts
192.168.254.1    vm1.lab.host
192.168.254.129  vm1.lab.gluster

vm2.lab:
eth1: 192.168.254.5/30
ovirtmgmt (bridge on eth2): 192.168.254.130/27, 192.168.254.162/27
$ grep vm2 /etc/hosts
192.168.254.5    vm2.lab.host
192.168.254.130  vm2.lab.gluster

I'm not aware of any hosts IP-Name change related to this. However I also modified the /etc/hosts during oVirt upgrade.

And glusterVolumeStatus output:

$ vdsClient -s localhost glusterVolumeStatus volumeName=storage
{'status': {'code': 0, 'message': 'Done'},
 'volumeStatus': {'bricks': [{'brick': 'vm1.lab.gluster:/gluster/vms/storage',
                              'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                              'pid': '1894',
                              'port': '50153',
                              'status': 'ONLINE'},
                             {'brick': 'vm2.lab.gluster:/gluster/vms/storage',
                              'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                              'pid': '1758',
                              'port': '50159',
                              'status': 'ONLINE'}],
                  'name': 'storage',
                  'nfs': [{'hostname': '192.168.254.5',
                           'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                           'pid': '24817',
                           'port': '2049',
                           'status': 'ONLINE'},
                          {'hostname': '192.168.254.129',
                           'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                           'pid': '16277',
                           'port': '2049',
                           'status': 'ONLINE'}],
                  'shd': [{'hostname': '192.168.254.5',
                           'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                           'pid': '24824',
                           'status': 'ONLINE'},
                          {'hostname': '192.168.254.129',
                           'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                           'pid': '16286',
                           'status': 'ONLINE'}]}}

--- Additional comment from Ludek Finstrle on 2014-09-29 12:51:33 EDT ---

The previous output is from vm2.lab.gluster host.

Now here is the output from vm1.lab.gluster (doesn't match =
Local IP is wrong, remote IP is ok)/

$ vdsClient -s localhost glusterVolumeStatus volumeName=storage
{'status': {'code': 0, 'message': 'Done'},
 'volumeStatus': {'bricks': [{'brick': 'vm1.lab.gluster:/gluster/vms/storage',
                              'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                              'pid': '1894',
                              'port': '50153',
                              'status': 'ONLINE'},
                             {'brick': 'vm2.lab.gluster:/gluster/vms/storage',
                              'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                              'pid': '1758',
                              'port': '50159',
                              'status': 'ONLINE'}],
                  'name': 'storage',
                  'nfs': [{'hostname': '192.168.254.1',
                           'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                           'pid': '16277',
                           'port': '2049',
                           'status': 'ONLINE'},
                          {'hostname': '192.168.254.130',
                           'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                           'pid': '24817',
                           'port': '2049',
                           'status': 'ONLINE'}],
                  'shd': [{'hostname': '192.168.254.1',
                           'hostuuid': '0c779c52-a097-4101-9c85-c9636499ce82',
                           'pid': '16286',
                           'status': 'ONLINE'},
                          {'hostname': '192.168.254.130',
                           'hostuuid': '21a5bd1e-78e3-4824-b299-f7a7c72b7d7a',
                           'pid': '24824',
                           'status': 'ONLINE'}]}}

--- Additional comment from Sahina Bose on 2014-09-30 01:48:38 EDT ---

The host names returned from gluster volume status is mapped to engine's host using the hostuuid field.

In your engine, the hosts that you have:
vm1.lab.gluster
vm2.lab.gluster, could you tell me the host uuid for these.

You could do this, by running the query:
psql engine postgres -c "select vds_name, gluster_server_uuid  from vds_static, gluster_server where vds_id= server_id;"

Also, could you attach the engine.log to the bug?

--- Additional comment from Ludek Finstrle on 2014-09-30 02:27:28 EDT ---



--- Additional comment from Ludek Finstrle on 2014-09-30 02:28:11 EDT ---



--- Additional comment from Ludek Finstrle on 2014-09-30 02:28:51 EDT ---



--- Additional comment from Ludek Finstrle on 2014-09-30 02:44:33 EDT ---

I attached requested log (and also vdsm logs from nodes). I'm running two node oVirt with hosted engine. The hosts are also gluster nodes. All gluster volumes are replica with two bricks.

They're whole day logs. I don't remember exactly when I started the upgrade. But definitely I upgraded in order:
1) ovirt-engine
2) vm1.lab
3) vm2.lab

The gluster worked perfectly all the time. What I describe below is just the status in oVirt admin console.

The status of gluster volumes in oVirt was ok in the morning. After that (and I don't remember if after 1st or 2nd step - but I think after 2nd step) the gluster bricks on vm1 went down (red triangle). Than I upgraded also vm2.lab. All gluster bricks on vm1 were red and on vm2 there question marks (no red no green triangle just black question mark). As the last thing I tried to stop & start isos brick from admin console and it went into green status.

In the meanwhile I tried restart whole environment (engine, hosts) without impact to the running VMs. Also I tried stop & start isos and it went into green status but after some another step it went into vm1 red and vm2 question mark state.

I'm trying to run pgsql query but I was never logged to the ovirt internal pgsql instance (so I don't know credentials - or where to get them - and I don't have identd installed). So it'll take me some time. Maybe I'll change ident to trust auth :)

--- Additional comment from Ludek Finstrle on 2014-09-30 02:49:39 EDT ---

# sudo -u postgres psql engine postgres -c "select vds_name, gluster_server_uuid  from vds_static, gluster_server where vds_id= server_id;"
   vds_name      |         gluster_server_uuid          
-----------------+--------------------------------------
 vm1.lab.gluster | 0c779c52-a097-4101-9c85-c9636499ce82
 vm2.lab.gluster | 21a5bd1e-78e3-4824-b299-f7a7c72b7d7a
(2 rows)

--- Additional comment from Ludek Finstrle on 2014-09-30 03:35:04 EDT ---

I see how to reproduce the question mark state:
1) put the node under maintenance (not sure if needed)
2) stop vdsmd service on that node
3) Refresh capabilities from web admin console for that node while vdsmd is down

I see hot to reproduce the down (red triangle) state:
1) put the node under maintenance (not sure if needed)
2) stop glusterd service on that node
3) Refresh capabilities from web admin console for that node while glusterd is down

--- Additional comment from Sahina Bose on 2014-09-30 05:51:48 EDT ---

Hi!

Thanks for the detailed analysis on the bug.

This is designed as per bug - https://bugzilla.redhat.com/show_bug.cgi?id=1021441#c4

1) In the first case - when vdsmd is down, there is no communication possible between the engine and node. This could be due to many reasons - host powered down or vdsmd service not running. So the brick status is temporarily moved to Unknown (?) - the brick will be moved back to UP state, during the next refresh cycle when gluster volume status returns the brick as online.

2) In the second case - when glusterd is down, the bricks are marked Down (red) since gluster volume status will no longer list these bricks.


If the UNKNOWN (?) state is misleading, we could change our refresh logic - to always compare results from gluster volume status output. If the brick is not listed in the output, then mark this as DOWN.

Please let me know if the UNKNOWN state was the issue. (As the second case seems to be expected behaviour)

--- Additional comment from Ludek Finstrle on 2014-09-30 08:17:10 EDT ---

States are ok and I understand it. I have no problem with it.

There is no problem that it went into that state. The problem is that it never returns from that state.
It's in that status one day (with vdsmd and glusterd up). Also Refresh capabilities doesn't help.

The Gluster status refresh is weird. Right now I'm after series of test in situation that vm1 is down (host is up but all services includind wdmd, sanlock, gluster, vdsm is down) - for longer time than 2 hours but I see in ovirt console that brick on vm1 is up and brick on vm2 is down (opposite to the reality).

--- Additional comment from Ludek Finstrle on 2014-09-30 08:21:06 EDT ---

I see how to reproduce the question mark state:
1) put the node under maintenance (not sure if needed)
2) stop vdsmd service on that node
3) Refresh capabilities from web admin console for that node while vdsmd is down
4) start vdsmd service on that node
5) brick remain in unknown state even Refresh capabilities doesn't help (the only possible transition is to down state)

I see hot to reproduce the down (red triangle) state:
1) put the node under maintenance (not sure if needed)
2) stop glusterd service on that node
3) Refresh capabilities from web admin console for that node while glusterd is down
4) start glusterd service on that node
5) brick remain in down state even Refresh capabilities doesn't help

The problem arrived yesterday and I still see the strange state.
The only way from this is to stop & start gluster volume from ovirt console.

--- Additional comment from Alastair Neil on 2014-09-30 10:13:23 EDT ---

(In reply to Sahina Bose from comment #13)
> Hi!
> 
> Thanks for the detailed analysis on the bug.
> 
> This is designed as per bug -
> https://bugzilla.redhat.com/show_bug.cgi?id=1021441#c4
> 
> 1) In the first case - when vdsmd is down, there is no communication
> possible between the engine and node. This could be due to many reasons -
> host powered down or vdsmd service not running. So the brick status is
> temporarily moved to Unknown (?) - the brick will be moved back to UP state,
> during the next refresh cycle when gluster volume status returns the brick
> as online.

per my original bug report, agreed this is the expected behaviour but this does not happen in my cluster, is this resolved in 3.5? Until this additional report came in I had not seen any action on this bug, or even a confirmation that there was a problem, or a request for information.

> 
> 2) In the second case - when glusterd is down, the bricks are marked Down
> (red) since gluster volume status will no longer list these bricks.
> 
> 
> If the UNKNOWN (?) state is misleading, we could change our refresh logic -
> to always compare results from gluster volume status output. If the brick is
> not listed in the output, then mark this as DOWN.
> 
> Please let me know if the UNKNOWN state was the issue. (As the second case
> seems to be expected behaviour)

--- Additional comment from Sahina Bose on 2014-10-01 01:36:10 EDT ---

(In reply to Alastair Neil from comment #16)
> (In reply to Sahina Bose from comment #13)
> > Hi!
> > 
> > Thanks for the detailed analysis on the bug.
> > 
> > This is designed as per bug -
> > https://bugzilla.redhat.com/show_bug.cgi?id=1021441#c4
> > 
> > 1) In the first case - when vdsmd is down, there is no communication
> > possible between the engine and node. This could be due to many reasons -
> > host powered down or vdsmd service not running. So the brick status is
> > temporarily moved to Unknown (?) - the brick will be moved back to UP state,
> > during the next refresh cycle when gluster volume status returns the brick
> > as online.
> 
> per my original bug report, agreed this is the expected behaviour but this
> does not happen in my cluster, is this resolved in 3.5? Until this
> additional report came in I had not seen any action on this bug, or even a
> confirmation that there was a problem, or a request for information.


This fix was introduced in http://gerrit.ovirt.org/#/c/21444/ and should be available since ovirt-3.4.0

We had tried to reproduce the issue, but were unable to. We may be missing a scenario here.

If this happens again, could you provide the engine log and vdsm log?

> 
> > 
> > 2) In the second case - when glusterd is down, the bricks are marked Down
> > (red) since gluster volume status will no longer list these bricks.
> > 
> > 
> > If the UNKNOWN (?) state is misleading, we could change our refresh logic -
> > to always compare results from gluster volume status output. If the brick is
> > not listed in the output, then mark this as DOWN.
> > 
> > Please let me know if the UNKNOWN state was the issue. (As the second case
> > seems to be expected behaviour)

--- Additional comment from Sahina Bose on 2014-10-01 01:38:33 EDT ---

(In reply to Ludek Finstrle from comment #15)
> I see how to reproduce the question mark state:
> 1) put the node under maintenance (not sure if needed)
> 2) stop vdsmd service on that node
> 3) Refresh capabilities from web admin console for that node while vdsmd is
> down
> 4) start vdsmd service on that node
> 5) brick remain in unknown state even Refresh capabilities doesn't help (the
> only possible transition is to down state)
> 
> I see hot to reproduce the down (red triangle) state:
> 1) put the node under maintenance (not sure if needed)
> 2) stop glusterd service on that node
> 3) Refresh capabilities from web admin console for that node while glusterd
> is down
> 4) start glusterd service on that node
> 5) brick remain in down state even Refresh capabilities doesn't help
> 
> The problem arrived yesterday and I still see the strange state.
> The only way from this is to stop & start gluster volume from ovirt console.

The refresh capabilities only executes the getVdsCaps on the node. It does not execute the gluster commands - hence would not result in changing brick state to green. I did not notice any error in your logs regarding the gluster commands. Let me dig into this further now that there are reproducible steps.

--- Additional comment from Sahina Bose on 2014-10-01 04:57:44 EDT ---

As the logs of engine and vdsm were not from the same time, there was not much I could infer from the logs. No exceptions related to volumestatus found in either.

One possibility is that brick status does not get updated due to host names differing - the one that gluster returns and the one engine is aware of. The referenced patch addresses this issue.

--- Additional comment from Ludek Finstrle on 2014-10-01 08:35:54 EDT ---

Ops, I'm sorry I didn't notice that log rotation proceeded while I downloaded logs.
Are you interested in vdsm logs from 9/29/2014?

Will this patch be included in 3.5.0 release or 3.5.1?

--- Additional comment from Sahina Bose on 2014-10-01 09:52:15 EDT ---

(In reply to Ludek Finstrle from comment #20)
> Ops, I'm sorry I didn't notice that log rotation proceeded while I
> downloaded logs.
> Are you interested in vdsm logs from 9/29/2014?

If you do have it, yes. And this is the time during which the brick status was in red state, correct?

> 
> Will this patch be included in 3.5.0 release or 3.5.1?

It should be in 3.5.0

--- Additional comment from Ludek Finstrle on 2014-10-02 09:34:52 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:35:09 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:35:37 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:36:00 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:36:29 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:36:53 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:37:17 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:37:39 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:38:01 EDT ---



--- Additional comment from Ludek Finstrle on 2014-10-02 09:40:33 EDT ---

  I upploaded vdsm logs from vm1 and vm2 hopefully from the right time.
Previously attached vdsm logs (from 9/30) are from time when the status of gluster volumes is displayed wrong (red for all except isos - I stopped & started isos).

Comment 1 Sahina Bose 2014-10-15 06:56:39 UTC

*** Bug 1152882 has been marked as a duplicate of this bug. ***

Comment 2 Sahina Bose 2014-10-29 05:45:58 UTC

Changed the brick status sync to look at the host uuid of the brick to correctly identify the brick in the database.
Earlier updating brick status was skipped, if a host has multiple hostnames and the one used by gluster is different from the one used by RHSC engine.

Comment 4 Shruti Sampat 2014-11-20 07:56:03 UTC

Verified as fixed in rhsc-3.0.2-1.16.el6rhs.noarch.rpm

Brick status sync works correctly even if the hostname returned by gluster volume status is different from what engine knows. Tested with both vdsmd and glusterd being restarted.

Comment 5 Shalaka 2014-11-27 06:25:29 UTC

Please add doc text for this bug.

Comment 6 Pavithra 2014-12-26 07:06:07 UTC

Hi Sahina,

Can you please review the edited doc text for technical accuracy and sign off?

Comment 7 Sahina Bose 2014-12-30 05:53:40 UTC

Looks good

Comment 9 errata-xmlrpc 2015-01-15 13:49:51 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://rhn.redhat.com/errata/RHBA-2015-0039.html

Note You need to log in before you can comment on or make changes to this bug.