Bug 1763672
| Summary: | Timeout for heat deployment 'create_admin' when integrating external ceph | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Chen <cchen> |
| Component: | openstack-tripleo-common | Assignee: | Rabi Mishra <ramishra> |
| Status: | CLOSED ERRATA | QA Contact: | David Rosenfeld <drosenfe> |
| Severity: | medium | Docs Contact: | |
| Priority: | medium | ||
| Version: | 13.0 (Queens) | CC: | apetrich, aschultz, cschwede, gfidente, johfulto, mburns, msufiyan, nalmond, pmannidi, ramishra, slinaber |
| Target Milestone: | --- | Keywords: | Reopened, Triaged, ZStream |
| Target Release: | --- | Flags: | tshefi:
automate_bug-
|
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | openstack-tripleo-common-8.7.1-21.el7ost | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2020-10-28 18:23:41 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Chen
2019-10-21 10:23:40 UTC
Because this bug is no longer occurring, I'm going to close it. If the issue recurs, then feel free to re-open. NOTABUG nothing to automate/test per close loop. We know the following: - uc ran access workbook [1] - uc asked compute to create access workbook and paused to wait for confirmation it was done - compute created tripleo-admin user - compute sent HTTP PUT to swift on undercloud and received 201 from undercloud - uc swift logs show 201 received ??? - uc access workbook timed out while waiting for confirmation that user was created - after the workbook time out, overcloud deployment stopped itself as it was unable to continue We don't know what happened at '???' but need to know that in order to find the bug. [1] https://github.com/openstack/tripleo-common/blob/stable/queens/workbooks/access.yaml Ah! You're missing https://code.engineering.redhat.com/gerrit/#/c/195663/ which is in openstack-tripleo-common-8.7.1-17.el7ost. I don't think it has made it to a zstream yet. May be you can use a hotfix. As you can see the deployment is started at 17:16:43 19:16:43,698] (heat-config) [DEBUG] Running /usr/libexec/heat-config/hooks/ansible < /var/lib/heat-config/deployed/d1c4f248-fb74-416d-8d10-c3c620a892d2.json However, ansible takes 11 mins (possibly timeouts) when gathering facts and hence the delay. Apr 8 19:16:44 overcloud-compute-0 ansible-setup: Invoked with filter=* gather_subset=['all'] fact_path=/etc/ansible/facts.d gather_timeout=10 Apr 8 19:27:48 overcloud-compute-0 ansible-user: Invoked with comment=None ssh_key_bits=0 update_password=always non_unique=False force=False ssh_key_type=rsa create_home=True password_lock=None ssh_key_passphrase=NOT_LOGGING_PARAMETER uid=None home=None append=False skeleton=None ssh_key_comment=ansible-generated on overcloud-compute-0 group=None system=False state=present hidden=None local=None shell=None expires=None ssh_key_file=None groups=None move_home=False password=NOT_LOGGING_PARAMETER name=tripleo-admin seuser=None remove=False login_class=None generate_ssh_key=None Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (Red Hat OpenStack Platform 13.0 director bug fix advisory), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2020:4388 *** Bug 1904588 has been marked as a duplicate of this bug. *** |