| Summary: | [ceph-iscsi-ansible] lun creation is not running in parallel, causing task failures | ||
|---|---|---|---|
| Product: | Red Hat Storage Console | Reporter: | Paul Cuzner <pcuzner> |
| Component: | ceph-ansible | Assignee: | Paul Cuzner <pcuzner> |
| Status: | CLOSED NOTABUG | QA Contact: | Tejas <tchandra> |
| Severity: | urgent | Docs Contact: | |
| Priority: | urgent | ||
| Version: | 2 | CC: | adeza, aschoen, ceph-eng-bugs, ceph-qe-bugs, gmeno, hnallurv, jdillama, kurs, nthomas, pcuzner, sankarshan |
| Target Milestone: | --- | Flags: | kurs:
needinfo+
|
| Target Release: | 2 | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-11-03 17:14:47 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Attachments: | |||
|
Description
Paul Cuzner
2016-10-30 22:58:08 UTC
Created attachment 1215663 [details]
playbook run with -f 10 and -vvv
I ran this playbook with -f 10 -vvv and it worked
Purged the config and reran again ... same issue
Attached a copy of the -vvv output
A couple more observations; 1. when I purge the configuration but leave the rbd's in place and rerun I see the same timeouts. This seems to suggest that the rbd create process has nothing to do with any delay that is being seen. 2. issue is seen whether the playbook is run under root or ansible accounts 3. using -vvv under root or ansible and the playbook completes properly - without -vvv ... timeouts! Talking the issue through with Alfredo Ran a test playbook to remove the ceph-iscsi logic from the picture
---
- name: test timing
hosts: pctest
tasks:
- name: sleep across systems for a changing amount of time
command: sleep {{ item }}
with_items:
- 5
- 10
- 15
- 20
The scheduling issue is still there. This is confirmation that the issue is *not* related to the ceph-iscsi-* rpms/playbooks but is instead build/environmental.
To further test this theory, I installed Ansible 1.9-4 on another box within the BAGL environment (gprfc088), and ran the above playbook. On these servers ansible is scheduling as expected
Created attachment 1216263 [details]
test playbook based on the bash sleep command
Created attachment 1216264 [details]
timings showing the delays in scheduling during the test playbook
Created attachment 1216265 [details]
test playbook running on different servers in BAGL environment - successful
This is caused by the fact that "UserKnownHostsFile=/dev/null" in "/etc/ssh/ssh_config". That is the delta between the two ansible hosts. This is not an issue with ceph-iscsi-ansible and is instead a known and expected behavior when ansible believes you will need to type "yes" to confirm that you know the host due to not persisting the the known hosts file. I believe this should be re-tested and closed by QE once the configuration issue is addressed. Paul,
I dont believe we are seeing this issue anymore. If you feel that any specific QE testing is needed for this, please let us know.
Else this is fixed from a QE side.
Thanks,
Tejas
Moving this to CLOSED/NOTABUG due to environment configuration issue. Agree - this is not an issue. Jason was spot on identifying it as a local ssh configuration problem |