Bug 1674659

Summary: Before gluster deployment make sure Gluster network uses FQDN
Product: [Red Hat Storage] Red Hat Gluster Storage Reporter: Gobinda Das <godas>
Component: gluster-ansibleAssignee: Sachidananda Urs <surs>
Status: CLOSED ERRATA QA Contact: SATHEESARAN <sasundar>
Severity: medium Docs Contact:
Priority: high    
Version: rhhiv-1.6CC: guillaume.pavese, rcyriac, rhs-bugs, sabose, sankarshan, sasundar, surs
Target Milestone: ---Keywords: TestBlocker, ZStream
Target Release: RHGS 3.4.z Batch Update 4   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: gluster-ansible-features-1.0.4-3.el7rhgs Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: 1674650 Environment:
Last Closed: 2019-03-27 03:44:39 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1674650    

Comment 2 Sachidananda Urs 2019-02-19 09:29:01 UTC
Patch posted at: https://github.com/gluster/gluster-ansible-features/pull/20

Comment 4 SATHEESARAN 2019-02-21 09:38:17 UTC
FQDN validation with dig failed, as there is check 'localhost' as hostname with dig command and this blocks the RHHI deployment

Error exactly goes like this:
<snip>
TASK [gluster.features/roles/gluster_hci : Check if valid FQDN is provided] ****
failed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] (item=localhost) => {"changed": true, "cmd": ["dig", "localhost", "+short"], "delta": "0:00:00.117666", "end": "2019-02-21 14:29:31.546174", "failed_when_result": true, "item": "localhost", "rc": 0, "start": "2019-02-21 14:29:31.428508", "stderr": "", "stderr_lines": [], "stdout": "", "stdout_lines": []}
changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] => (item=dhcp37-124.lab.eng.blr.redhat.com)
changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] => (item=dhcp37-138.lab.eng.blr.redhat.com)
changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] => (item=dhcp37-108.lab.eng.blr.redhat.com)
</snip>

[root@ ~]# dig localhost +short
[root@ ~]#

Comment 6 Sachidananda Urs 2019-02-21 10:18:04 UTC
(In reply to SATHEESARAN from comment #4)
> FQDN validation with dig failed, as there is check 'localhost' as hostname
> with dig command and this blocks the RHHI deployment
> 
> Error exactly goes like this:
> <snip>
> TASK [gluster.features/roles/gluster_hci : Check if valid FQDN is provided]
> ****
> failed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] (item=localhost) =>
> {"changed": true, "cmd": ["dig", "localhost", "+short"], "delta":
> "0:00:00.117666", "end": "2019-02-21 14:29:31.546174", "failed_when_result":
> true, "item": "localhost", "rc": 0, "start": "2019-02-21 14:29:31.428508",
> "stderr": "", "stderr_lines": [], "stdout": "", "stdout_lines": []}
> changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] =>
> (item=dhcp37-124.lab.eng.blr.redhat.com)
> changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] =>
> (item=dhcp37-138.lab.eng.blr.redhat.com)
> changed: [dhcp37-124.lab.eng.blr.redhat.com -> localhost] =>
> (item=dhcp37-108.lab.eng.blr.redhat.com)
> </snip>
> 
> [root@ ~]# dig localhost +short
> [root@ ~]#

sas, localhost will fail because it is not a valid FQDN. Can you provide the playbook that you used.
We should not have localhost in the playbook. This is the expected result.

The playbook has to fail if there an entry called localhost in the gluster_inventory.yml
Else you have to set: gluster_features_fqdn_check: false

Comment 7 Sachidananda Urs 2019-02-21 10:53:13 UTC
Inventory file provided by sas had the lines:

local:
  hosts:
    localhost: null
  vars:
    ansible_ssh_common_args: '-o BatchMode=yes'

which caused the above error, and it is the expected behaviour to fail because localhost is not a valid FQDN.
To sidestep the error we should set the variable - gluster_features_fqdn_check: false

Comment 8 SATHEESARAN 2019-02-21 11:43:20 UTC
(In reply to Sachidananda Urs from comment #7)
> Inventory file provided by sas had the lines:
> 
> local:
>   hosts:
>     localhost: null
>   vars:
>     ansible_ssh_common_args: '-o BatchMode=yes'
> 
> which caused the above error, and it is the expected behaviour to fail
> because localhost is not a valid FQDN.
> To sidestep the error we should set the variable -
> gluster_features_fqdn_check: false

Yes, this is the inventory file generated by cockpit-ovirt-dashboard.
So its the bug in this construction of this inventory file.

I have reported a bug[1] on cockpit-ovirt-dashboard for the same
And this bug[1] blocks verification of this bug

[1] - https://bugzilla.redhat.com/show_bug.cgi?id=1679536

Comment 9 SATHEESARAN 2019-02-23 17:11:17 UTC
Verified with gluster-ansible-features-1.0.4-3 with the following tests

1. Using IPs for hostnames and the deployment failed
2. Use the non-resolvable hostnames and also the deployment fails

Comment 12 errata-xmlrpc 2019-03-27 03:44:39 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2019:0661