Bug 1334008
Summary: | /setup/key/ endpoint occasionally fails with "/var/lib/ceph-installer/.ssh/id_rsa already exists" | ||||||
---|---|---|---|---|---|---|---|
Product: | [Red Hat Storage] Red Hat Storage Console | Reporter: | Ken Dreyer (Red Hat) <kdreyer> | ||||
Component: | ceph-installer | Assignee: | Alfredo Deza <adeza> | ||||
Status: | CLOSED ERRATA | QA Contact: | sds-qe-bugs | ||||
Severity: | unspecified | Docs Contact: | |||||
Priority: | unspecified | ||||||
Version: | 2 | CC: | adeza, aschoen, ceph-eng-bugs, mkudlej, nthomas, sankarshan, sds-qe-bugs | ||||
Target Milestone: | --- | ||||||
Target Release: | 2 | ||||||
Hardware: | Unspecified | ||||||
OS: | Unspecified | ||||||
Whiteboard: | |||||||
Fixed In Version: | ceph-installer-1.0.9-1.el7scon | Doc Type: | Bug Fix | ||||
Doc Text: | Story Points: | --- | |||||
Clone Of: | Environment: | ||||||
Last Closed: | 2016-08-23 19:50:05 UTC | Type: | Bug | ||||
Regression: | --- | Mount Type: | --- | ||||
Documentation: | --- | CRM: | |||||
Verified Versions: | Category: | --- | |||||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |||||
Cloudforms Team: | --- | Target Upstream Version: | |||||
Embargoed: | |||||||
Attachments: |
|
Description
Ken Dreyer (Red Hat)
2016-05-07 02:29:07 UTC
Created attachment 1154772 [details]
`sudo journalctl -u ceph-installer`
The installer's log (from systemd-journald) shows the error as well as the two separate ssh-keygen invocations.
Something else to note: I'm booting these nodes in sequential order, but they all start very close to each other. The order in which they boot is: node-1: installer node node-2: mon, has the key node-3: osd, has the error node-4: osd, has the key node-5: osd, has the key So maybe node-2 and node-3 are racing there, and node-2's ssh-keygen step hasn't finished before node-3 initiates the HTTP request, therefore triggering the second (doomed-to-fail) ssh-keygen operation. A solution here is to run ssh-keygen very early, before the HTTP server will accept any connections from clients. Looks like gunicorn's on_starting() might work? We can do this when the app is getting started as part of loading the Pecan application. Thanks for debugging this! Pull request opened https://github.com/ceph/ceph-installer/pull/146 Alfredo mind reviewing https://github.com/ceph/ceph-installer/pull/147 for this as well? Reviewed and merged This will be fixed in the upcoming v1.0.9. Checked with ceph-installer-1.0.11-1.el7scon.noarch and we don't see this issue in our test environment. -> Verified Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2016:1754 |