Bug 1849803

Summary: [Ceph-Installer]: Probe hosts in Cockpit UI fails
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Ameena Suhani S H <amsyedha>
Component: Ceph-InstallerAssignee: Jeff Brown <jefbrown>
Status: CLOSED ERRATA QA Contact: Tejas <tchandra>
Severity: urgent Docs Contact:
Priority: unspecified    
Version: 4.1CC: aschoen, ceph-eng-bugs, gmeno, nthomas, pcuzner, tserlin, vashastr, ykaul
Target Milestone: z1Keywords: Regression, TestBlocker
Target Release: 4.1   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: cockpit-ceph-installer-1.2-0.el8cp, cockpit-ceph-installer-1.2-0.el7cp Doc Type: No Doc Update
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-07-20 14:21:41 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
Probe hosts error screenshot
none
Screenshot of UI where I have added a metric node
none
screenshot showing sucessful probe on QE systems with the patch in place none

Description Ameena Suhani S H 2020-06-22 20:23:28 UTC
Created attachment 1698351 [details]
Probe hosts error screenshot

Description of problem:
Probe hosts in Cockpit UI fail with error "Unexpected playbook failure. Check ansible-runner-service directory '2dcbb7ee-b4c4-11ea-aff0-002590fbd652' for details" as shown in the attachment and hence could not proceed with the installation.

Also, logs are not available in ansible-runner-service directory as the error shows

Version-Release number of selected component (if applicable):

cockpit-ceph-installer-1.1-0.el7cp.noarch
ansible-2.8.12-1.el7ae.noarch
ceph-ansible-4.0.24-1.el7cp.noarch


How reproducible:
3/3

Steps to Reproduce:
1. Follow the steps as in doc

https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/4/html-single/installation_guide/index#installing-red-hat-ceph-storage-using-the-cockpit-web-interface

2. In the validate step to try to probe hosts


Actual results:
fails with error "https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/4/html-single/installation_guide/index#installing-red-hat-ceph-storage-using-the-cockpit-web-interface"

Expected results:
probe hosts should be successful in validating step


Additional info:

Comment 1 Ameena Suhani S H 2020-06-22 20:31:23 UTC
Created attachment 1698354 [details]
Screenshot of UI where I have added a metric node

Also, I had added a metrics node as you can see in the attachment but in next validate step it went missing any idea why this happened?( you can compare attachment 1698351 [details])

Comment 2 Paul Cuzner 2020-06-22 23:09:46 UTC
issue relates to EOL py2 support. The probe trips up detecting the disk signature gpt or mbr since it was using py3 only syntax - the Ameena's test environment is RHEL7 + py2.

fix on upstream branch ready for testing (fix-gpt-detection-py2)

Comment 3 Paul Cuzner 2020-06-24 00:20:31 UTC
Created attachment 1698517 [details]
screenshot showing sucessful probe on QE systems with the patch in place

Comment 4 Paul Cuzner 2020-06-24 00:23:03 UTC
Tested patch on the QE systems directly (minor py change)

- all hosts have py2 as the default python interpreter
- cockpit UI launched, and installer flow started using the existing server:role layout
- probe performaned successfully (see attachment)

Ameena, please confirm. Once confirmed, I'll merge the change and get the rpm rebuilt.

Comment 5 Ameena Suhani S H 2020-06-24 08:27:21 UTC
Hi Paul,
I have tested the patch and the probe host issue is not seen.

Comment 6 Jeff Brown 2020-06-25 14:01:46 UTC
Changes are in Ceph-Installer-1.2 and has merged downstream.

Comment 9 Ameena Suhani S H 2020-06-29 02:09:22 UTC
verified using
cockpit-ceph-installer-1.2-0.el7cp.noarch
ceph-ansible-4.0.25-1.el7cp.noarch
ansible-2.8.12-1.el7ae.noarch

Comment 11 errata-xmlrpc 2020-07-20 14:21:41 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:3003