Bug 1438386 - [RFE] - provide a way for the user to replace host which has both virt and gluster services enabled from UI
Summary: [RFE] - provide a way for the user to replace host which has both virt and gl...
Keywords:
Status: CLOSED DEFERRED
Alias: None
Product: ovirt-engine
Classification: oVirt
Component: BLL.Gluster
Version: 4.1.1.2
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: ---
: ---
Assignee: Prajith
QA Contact: SATHEESARAN
URL:
Whiteboard:
Depends On:
Blocks: 1633126
TreeView+ depends on / blocked
 
Reported: 2017-04-03 10:09 UTC by RamaKasturi
Modified: 2021-01-11 05:27 UTC (History)
6 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2021-01-11 05:27:18 UTC
oVirt Team: Gluster
Embargoed:
pm-rhel: ovirt-4.4+


Attachments (Terms of Use)
host-deploy logfile (1.53 MB, text/plain)
2020-11-20 15:55 UTC, SATHEESARAN
no flags Details
engine.log (1.07 MB, application/gzip)
2020-11-20 15:56 UTC, SATHEESARAN
no flags Details
vdsm.log (559.72 KB, text/plain)
2020-11-20 16:00 UTC, SATHEESARAN
no flags Details


Links
System ID Private Priority Status Summary Last Updated
oVirt gerrit 105924 0 None MERGED Reconfigure Gluster in Reinstall Model 2021-01-27 07:17:54 UTC
oVirt gerrit 109995 0 master MERGED Implementing Replace Host Full Flow from engine in reinstall view 2021-01-27 07:18:37 UTC
oVirt gerrit 111122 0 master MERGED [RFE] Replace Host in Reinstall Flow 2021-01-27 07:17:54 UTC

Description RamaKasturi 2017-04-03 10:09:30 UTC
Description of problem:
currently user cannot perform replace host from UI and it has to be done manually. Replacing a host involves lot of steps and if the setup has SSL enabled with self-signed certs, it is much more complex.  There should be a single button  for replace host  and clicking on that should replace the host with out much hassle.

Version-Release number of selected component (if applicable):
ovirt-engine-4.1.1.2-0.1.el7.noarch

How reproducible:


Steps to Reproduce:
1. 
2.
3.

Actual results:
Currently replace host is completely manual.

Expected results:
Provide a way for the user to replace host from UI.

Additional info:

Comment 2 Sandro Bonazzola 2019-01-28 09:34:14 UTC
This bug has not been marked as blocker for oVirt 4.3.0.
Since we are releasing it tomorrow, January 29th, this bug has been re-targeted to 4.3.1.

Comment 3 Sahina Bose 2019-01-30 12:24:13 UTC
Sachi, does the replace host role handle the case of SSL certificates too?

Comment 4 Sachidananda Urs 2019-01-30 13:01:13 UTC
(In reply to Sahina Bose from comment #3)
> Sachi, does the replace host role handle the case of SSL certificates too?

No, it doesn't as of now.

Comment 5 Prajith 2020-03-11 12:04:55 UTC
The role has been provided and integrated with rhhi-engine side. The role is embedded in reinstall flow from rhhi-engine. 

The role requires 3 parameters and a condition of gluster being supported in the cluster
and a minimum number of 3 host in the same gluster supported cluster
needs to be satisfied to run the playbook

(i)   the oldNode: the node which is needed to reinstalled
(ii)  the clusterNode_1: the maintenance node
(iii) the clusterNode_2: the second maintenance node


As of now this reconfigure gluster role is triggered if the cluster is gluster supported. It is included with reinstall flow , i.e every time a node reinstalled from the rhhi-engine side , the node's gluster gets reconfigured and gets added back to gluster peer network. (A front end checkbox is under WIP , which enables customers to choose whether the selected node needs to gluster reconfigured while reinstalling, meaning the  reconfigureGluster is only called if the customer checks the checkbox as true , else the node will be reinstalled as usual)

What the roles does :-

The role basically deletes the existing gluster directory and remove the affected node from the gluster peer network. The role then gets the peer info and other details of the corrupted node from the other two nodes and reconfigure it, and then add it back to the gluster peer network, after which self heal happens and all the nodes will be in sync.

Comment 6 Sandro Bonazzola 2020-03-19 14:40:00 UTC
Moving to QE, assuming that the fixes are included in ovirt-engine-4.4.0-0.0.master.20200315172200.gitb5b5c99ca2f since bug moved to modified on 2020-03-11 but no patch is linked from this bug.

Comment 7 SATHEESARAN 2020-05-27 04:36:40 UTC
(In reply to Sandro Bonazzola from comment #6)
> Moving to QE, assuming that the fixes are included in
> ovirt-engine-4.4.0-0.0.master.20200315172200.gitb5b5c99ca2f since bug moved
> to modified on 2020-03-11 but no patch is linked from this bug.

This feature failed qualification as the feature doesn't consider the presence of 2
networks and also misses the step to perform 'gluster volume replace-brick'

These changes will be accomodated with the slight changes in the design to translate
the front-end FQDN/IP to back-end FQDN/IP.

So we are in agreement to fix this bug for ovirt-4.4.z

The existing RHV Manager UI to 'Reconfigure Gluster' during the 'reinstallation' of the
host to the cluster will be disabled and tracked as part of this bug - https://bugzilla.redhat.com/show_bug.cgi?id=1840083

Comment 8 Sandro Bonazzola 2020-08-28 15:45:37 UTC
This bug is not marked as blocker and should be re-targeted to 4.4.3

Comment 9 SATHEESARAN 2020-08-31 08:07:46 UTC
(In reply to Sandro Bonazzola from comment #8)
> This bug is not marked as blocker and should be re-targeted to 4.4.3

Hi Sandro,

Yes, this bug is not a blocker for ovirt-4.4.2.
Moving it to 4.4.3

Comment 10 Prajith 2020-09-07 08:02:38 UTC
rechanging the status to assigned since the code was reverted due to jenkins OST break,ref :- https://gerrit.ovirt.org/#/c/111027/

Comment 11 SATHEESARAN 2020-11-20 15:39:20 UTC
Tested with RHV 4.4.3 (4.4.3.12-0.1.el8ev), but the replacing the host procedure failed.

Tested with the following steps:

1. Create a 3 node RHHI-V deployment
2. Created a separate gluster network for gluster and attached that HC hosts
3. Simulated a failure of one of the node, by abruptly turning off that node
4. Reinstalled the OS on that node, without formatting the other disks
5. Copied the authorized_keys from other hosts to the newly installed host
6. Removed the LVM filter from /etc/lvm/lvm.conf
7. Performed replace host procedure from RHV admin portal and that failed.

The relevant logs will be attached soon.
As this bug is not a blocker for the release, retargeting this bug for RHV 4.4.4
The replace host playbook still works good to replacing the same host

Comment 12 RHEL Program Management 2020-11-20 15:39:28 UTC
Target release should be placed once a package build is known to fix a issue. Since this bug is not modified, the target version has been reset. Please use target milestone to plan a fix for a oVirt release.

Comment 13 SATHEESARAN 2020-11-20 15:55:39 UTC
Created attachment 1731314 [details]
host-deploy logfile

Comment 14 SATHEESARAN 2020-11-20 15:56:06 UTC
Created attachment 1731315 [details]
engine.log

Comment 15 SATHEESARAN 2020-11-20 16:00:03 UTC
Created attachment 1731316 [details]
vdsm.log

Comment 16 SATHEESARAN 2020-12-16 14:11:14 UTC
For now, moving this bug out of ovirt-4.4.4 as decided in the team meeting.

@Gobinda, could you close this bug, removing all the acks..

Comment 17 Gobinda Das 2021-01-11 05:27:18 UTC
Closing this bug as righ now we don't have plan for RFE.


Note You need to log in before you can comment on or make changes to this bug.