1483160 – On Giveback after A Share Service is Brought Back after Being Shut Down Manila's Access to Shares is Lost

Bug 1483160 - On Giveback after A Share Service is Brought Back after Being Shut Down Manila's Access to Shares is Lost

Summary: On Giveback after A Share Service is Brought Back after Being Shut Down Manil...

Keywords:
Status:	CLOSED ERRATA
Alias:	None
Product:	Red Hat OpenStack
Classification:	Red Hat
Component:	puppet-manila
Sub Component:
Version:	10.0 (Newton)
Hardware:	Unspecified
OS:	Unspecified
Priority:	high
Severity:	medium
Target Milestone:	z6
Target Release:	10.0 (Newton)
Assignee:	Tom Barron
QA Contact:	Dustin Schoenbrun
Docs Contact:	Don Domingo
URL:
Whiteboard:
Depends On:	1485016
Blocks:
TreeView+	depends on / blocked

Reported:	2017-08-18 22:18 UTC by Dustin Schoenbrun
Modified:	2017-11-15 13:45 UTC (History)
CC List:	4 users (show)
Fixed In Version:	openstack-tripleo-heat-templates-5.3.3-1.el7ost puppet-manila-9.5.0-2.el7ost
Doc Type:	If docs needed, set a value
Doc Text:
Clone Of:
Clones:	1485015 (view as bug list)
Environment:
Last Closed:	2017-11-15 13:45:13 UTC
Target Upstream Version:
Embargoed:

Attachments	(Terms of Use)

Links
System	ID	Priority	Status	Summary	Last Updated
Launchpad	1712842	None	None	None	2017-08-24 21:05:38 UTC
OpenStack gerrit	499937	None	None	None	2017-09-06 02:20:47 UTC
OpenStack gerrit	508117	None	None	None	2017-09-28 11:33:12 UTC
Red Hat Product Errata	RHBA-2017:3231	normal	SHIPPED_LIVE	Red Hat OpenStack Platform 10 director Bug Fix Advisory	2017-11-15 18:36:45 UTC

Description Dustin Schoenbrun 2017-08-18 22:18:26 UTC

Description of problem:
When the Active share service stops and another, passive share service takes over as the active, when the share service that was stopped comes back, the shares that were on the other share service will become unavailable for Manila to control.

Version-Release number of selected component (if applicable):
openstack-manila-3.0.0-8.el7ost.noarch
openstack-manila-ui-2.5.1-9.el7ost.noarch
puppet-manila-9.5.0-1.el7ost.noarch
python-manilaclient-1.11.0-1.el7ost.noarch
python-manila-3.0.0-8.el7ost.noarch
openstack-manila-share-3.0.0-8.el7ost.noarch

How reproducible:
100%

Steps to Reproduce:
1. Use Infrared to deploy an OSP-10z4 deployment with Manila and with at least 2 controller nodes and any number of compute nodes. I used a NetApp backend for the storage.
2. Disable the active Manila Share service. Observe that the service will start on another controller node.
3. Create a share on the new share service.
4. Re-enable the disabled share service and observe that the share that was created is no longer controllable by Manila.

Actual results:
Manila shares created on the other share service become uncontrollable when the first share service is reactivated.

Expected results:
Disruption of the share service shall not impact Manila shares.

Additional info:
I looked into how Cinder does Volume service HA and they use a hostgroup for all of the volume services so that the "hostname" of the volume service does not change when another volume service takes over the active role. Chances are something similar will need to happen to Manila as well.

Comment 1 Tom Barron 2017-09-08 09:53:00 UTC

puppet manila patch 499937 has merged upstream in stable/newton but we still need to cherry pick THT patch 499111 after it merges to stable/ocata

Comment 2 Tom Barron 2017-09-28 11:33:13 UTC

stable/ocata tripleo-heat-templates patch 499111 has been cherry-picked to stable/newton as 508117

Comment 3 Tom Barron 2017-10-10 07:49:52 UTC

508117 has merged upstream to stable/newton

Comment 7 Dustin Schoenbrun 2017-11-06 18:20:49 UTC

Doing the procedure I listed above with the OSP-10z6 puddle, I was able to successfully have shares survive the loss of the controller node where the share service was running with all shares created before the controller was killed being listed and available while it was down. Looks like we're good here.

Comment 9 errata-xmlrpc 2017-11-15 13:45:13 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2017:3231

Note You need to log in before you can comment on or make changes to this bug.