Bug 1721475 - Improvements on Gluster Deployment in RHHI
Summary: Improvements on Gluster Deployment in RHHI
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Gluster Storage
Classification: Red Hat Storage
Component: rhhi
Version: rhhiv-1.6
Hardware: x86_64
OS: Linux
unspecified
high
Target Milestone: ---
: RHHI-V 1.6.z Async Update
Assignee: Sahina Bose
QA Contact: SATHEESARAN
URL:
Whiteboard:
Depends On: 1712798
Blocks:
TreeView+ depends on / blocked
 
Reported: 2019-06-18 11:29 UTC by SATHEESARAN
Modified: 2019-10-03 12:24 UTC (History)
10 users (show)

Fixed In Version: gluster-ansible-roles-1.0.5-2, gluster-ansible-features-1.0.5-2
Doc Type: No Doc Update
Doc Text:
Clone Of: 1712798
Environment:
Last Closed: 2019-10-03 12:23:57 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2019:2963 0 None None None 2019-10-03 12:24:05 UTC

Description SATHEESARAN 2019-06-18 11:29:21 UTC
Description of problem:
======================
Raising the bug to improve the Gluster Deployment time taken in RHHI. We can remove few checks which might not be actually required.

Version-Release number of selected component (if applicable):
============================================================

How reproducible:
================


Steps to Reproduce:
==================
1.
2.
3.

Actual results:
==============

Expected results:
================


Additional info:
================

--- Additional comment from RHEL Product and Program Management on 2019-05-22 09:25:42 UTC ---

This bug is automatically being proposed for the next minor release of Red Hat Gluster Storage by setting the release flag 'rhgs‑3.5.0' to '?'. 

If this bug should be proposed for a different release, please manually change the proposed release flag.

--- Additional comment from Sahina Bose on 2019-05-22 09:36:53 UTC ---

(In reply to bipin from comment #0)
> Description of problem:
> ======================
> Raising the bug to improve the Gluster Deployment time taken in RHHI. We can
> remove few checks which might not be actually required.
>

Can you be specific on the checks to be removed?

--- Additional comment from bipin on 2019-05-22 09:45:00 UTC ---

Things noted in the current ansible based Gluster deployment:
================================================================

1.Remove the task to start firewalld
  a.TASK [gluster.features/roles/gluster_hci : Enable and start glusterd and chronyd] ***

2.Port Specific
  a.Remove the task that opens port 5666 - used by Nagio in the list of ports. These are no longer required to be opened
  b.There are ports opened from 5900-6923. Why is that required ?

3.Skipped tasks can be removed altogether ?
  a.Tasks involves NFS ganesha, CTDB, Samba are skipped, but still it takes some time to skip those. Can we totally eliminate it ?

4.Multipath configuration changes
  a.QE to test and get back with proper values

5.Glusterfs slice
  a.Ansible roles is also configures glusterfs slice
  b.Adding the host to engine, also configures glusterfs slice

6.Remove the package check step altogether


Doc link:
========
https://docs.google.com/document/d/1p0IsFo-acc7UdSXtpnUoqlVzudLwcrAJH3ATtvw1fJs/edit?ts=5ce51097

--- Additional comment from Sachidananda Urs on 2019-05-27 06:49:52 UTC ---

(In reply to bipin from comment #3)
> Things noted in the current ansible based Gluster deployment:
> ================================================================
> 
> 1.Remove the task to start firewalld
>   a.TASK [gluster.features/roles/gluster_hci : Enable and start glusterd and
> chronyd] ***
> 

Ack. Done.


> 2.Port Specific
>   a.Remove the task that opens port 5666 - used by Nagio in the list of
> ports. These are no longer required to be opened
>   b.There are ports opened from 5900-6923. Why is that required ?

These were done as part of pre-requisites, following earlier documentation.
These can be skipped by cockpit, since these are just variables. We can
configure based on which ports we need.

> 
> 3.Skipped tasks can be removed altogether ?
>   a.Tasks involves NFS ganesha, CTDB, Samba are skipped, but still it takes
> some time to skip those. Can we totally eliminate it ?

This is by design. The gluster.features role defines all the features like (rhhi, nfs, ctdb) ... 
Skipping these tasks takes less than a second.

> 
> 4.Multipath configuration changes
>   a.QE to test and get back with proper values

Ack.

> 
> 5.Glusterfs slice
>   a.Ansible roles is also configures glusterfs slice
>   b.Adding the host to engine, also configures glusterfs slice



> 
> 6.Remove the package check step altogether

Ack. Done.

> 
> 
> Doc link:
> ========
> https://docs.google.com/document/d/1p0IsFo-
> acc7UdSXtpnUoqlVzudLwcrAJH3ATtvw1fJs/edit?ts=5ce51097

--- Additional comment from SATHEESARAN on 2019-05-27 09:41:04 UTC ---

Let's try to have precise changes listed here:

1. Port to be removed from the list of ports opened: 

5666
Reason: Nagios no longer used

5900-6923. 
Why is that required ? who's using these ports ?

2. Multipath config changes.
QE to provide input based on testing

3. Glusterfs slice configuration
Decision: to be removed
Reason: Redundant.This is already taken care while adding the host in to the cluster

4. Remove package check
Decision to be remove the package check like ovirt-host, screen, vdsm, etc
Reason: RHVH bundles all these packages, and for RHEL installing these packages are 
required to be done as pre-requisite.

--- Additional comment from RHEL Product and Program Management on 2019-05-27 09:59:17 UTC ---

This BZ is being approved for the RHGS 3.5.0 release, upon receipt of the 3 ACKs (PM,Devel,QA) for the release flag 'rhgs-3.5.0', and on being attached to an official RHGS 3.5.0 BZ Tracker

--- Additional comment from SATHEESARAN on 2019-05-30 09:38:52 UTC ---


(In reply to SATHEESARAN from comment #5)
> Let's try to have precise changes listed here:
> 
> 1. Port to be removed from the list of ports opened: 
> 
> 5666
> Reason: Nagios no longer used
> 
> 5900-6923. 
> Why is that required ? who's using these ports ?
> 
> 2. Multipath config changes.
> QE to provide input based on testing
> 
> 3. Glusterfs slice configuration
> Decision: to be removed
> Reason: Redundant.This is already taken care while adding the host in to the
> cluster
> 
> 4. Remove package check
> Decision to be remove the package check like ovirt-host, screen, vdsm, etc
> Reason: RHVH bundles all these packages, and for RHEL installing these
> packages are 
> required to be done as pre-requisite.


Sac,

As part of this bug following can be treated as final requirements:
1. Remove package check
2. Remove glusterfs slice configuration
3. Remove port 5666 from the list of ports opened

For multipath configs, I will raise a separate bug

Sac, Are you  clear with the above list of requirements ?

--- Additional comment from Sachidananda Urs on 2019-05-30 09:55:54 UTC ---

(In reply to SATHEESARAN from comment #7)
> (In reply to SATHEESARAN from comment #5)
> > Let's try to have precise changes listed here:
> > 
> > 1. Port to be removed from the list of ports opened: 
> > 
> > 5666
> > Reason: Nagios no longer used
> > 
> > 5900-6923. 
> > Why is that required ? who's using these ports ?
> > 
> > 2. Multipath config changes.
> > QE to provide input based on testing
> > 
> > 3. Glusterfs slice configuration
> > Decision: to be removed
> > Reason: Redundant.This is already taken care while adding the host in to the
> > cluster
> > 
> > 4. Remove package check
> > Decision to be remove the package check like ovirt-host, screen, vdsm, etc
> > Reason: RHVH bundles all these packages, and for RHEL installing these
> > packages are 
> > required to be done as pre-requisite.
> 
> 
> Sac,
> 
> As part of this bug following can be treated as final requirements:
> 1. Remove package check
> 2. Remove glusterfs slice configuration
> 3. Remove port 5666 from the list of ports opened
> 
> For multipath configs, I will raise a separate bug
> 
> Sac, Are you  clear with the above list of requirements ?

Ack.

--- Additional comment from Sachidananda Urs on 2019-06-04 09:28:00 UTC ---

https://github.com/gluster/gluster-ansible/pull/70

Awaiting updates on multipath.

--- Additional comment from Sachidananda Urs on 2019-06-11 05:10:30 UTC ---

All the above checks will be made before any changes are done.

Copying comments from bug raised by Sean:

Original bug: https://bugzilla.redhat.com/show_bug.cgi?id=1713929

[RFE] Deployment system for RHHI-V 1.6 lacks the ability to run the prerequisite playbook in the cockpit as preflight check,so potential points of failure are made visible before deploying (with maybe link to KB article to resolve )

--- Additional comment from Sachidananda Urs on 2019-06-11 05:11:33 UTC ---

sas, I'm waiting on the multipath updates. Will move the bug to POST once those details are provided.

--- Additional comment from Sachidananda Urs on 2019-06-11 05:13:39 UTC ---



--- Additional comment from SATHEESARAN on 2019-06-14 03:49:19 UTC ---

(In reply to Sachidananda Urs from comment #11)
> sas, I'm waiting on the multipath updates. Will move the bug to POST once
> those details are provided.

I have done couple of testing paths:
1. With multipath entries for local disks in place, rebooted the host,
 to make sure that the bricks are mounted with XFS UUID. This works good.

2. Tried to deploy with multipath entries instead of disk names. This is little
tough to test with, as the deployment procedure itself flushes away the mpath entries.

It requires full fledged testing and I would recommend to go ahead with, removing
the places where multipath is configured.
Following items are done and those should be removed:
        1. Generate /etc/multipath using the command 'vdsm-tool configure --force'
        2. Edit multipath.conf to have blacklisting configuration
I have created a separate bug 1720334 to take care of this issue

--- Additional comment from Atin Mukherjee on 2019-06-16 14:40:54 UTC ---

Need an estimate on how soon this fix can land into upstream. This is to assess to current state to meet to the development freeze milestone.

--- Additional comment from Sachidananda Urs on 2019-06-18 11:23:28 UTC ---

(In reply to Atin Mukherjee from comment #14)
> Need an estimate on how soon this fix can land into upstream. This is to
> assess to current state to meet to the development freeze milestone.


PR: https://github.com/gluster/gluster-ansible-features/pull/31 fixes the issue.

This was done quite sometime ago, I forgot to update the bug. Sorry about that.

Comment 5 SATHEESARAN 2019-06-25 11:21:15 UTC
The dependent bug is already ON_QA and moving this bug also to ON_QA

Comment 6 SATHEESARAN 2019-06-26 01:34:26 UTC
Tested with RHVH 4.3.5 + RHEL 7.7 + RHGS 3.4.4 ( interim build - glusterfs-6.0-6 ) with ansible 2.8.1-1
with:
gluster-ansible-features-1.0.5-2.el7rhgs.noarch
gluster-ansible-roles-1.0.5-2.el7rhgs.noarch
gluster-ansible-infra-1.0.4-3.el7rhgs.noarch

Gluster configuration is done with the following modifications:
1. Glusterfs systemd slice configuration removed in favor of configuring the same while adding the host to RHVM managed cluster
2. Configuration of multipath removed: no blacklisting of devices
3. Package installation step is removed, as RHVH has all the packages available

Comment 8 errata-xmlrpc 2019-10-03 12:23:57 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2019:2963


Note You need to log in before you can comment on or make changes to this bug.