Bug 1253693 - Ironic deployment hangs after copying image
Summary: Ironic deployment hangs after copying image
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: openstack-ironic
Version: Director
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
: 7.0 (Kilo)
Assignee: Lucas Alvares Gomes
QA Contact: Toure Dunnon
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2015-08-14 13:14 UTC by Ben Nemec
Modified: 2016-11-10 11:38 UTC (History)
3 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-11-10 11:38:50 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
ironic conductor logs from failed deployment (12.70 MB, application/zip)
2015-08-14 13:14 UTC, Ben Nemec
no flags Details

Description Ben Nemec 2015-08-14 13:14:42 UTC
Created attachment 1063029 [details]
ironic conductor logs from failed deployment

Description of problem: On an HA deployment (3 controllers, 1 compute), twice in a row one of the controller nodes failed to deploy properly.  Looking through the ironic logs, it appears the node may have been locked when the image copy completed, so ironic failed to send the completion notification and the node just sat at the nc command until it was rebooted.


Version-Release number of selected component (if applicable): 2015.1.0-9


How reproducible: Intermittent, but not frequent


Steps to Reproduce:
1. Deploy servers via ironic
2. 
3.

Actual results: One or more may hang waiting for the completion message from ironic


Expected results: All servers deployed successfully


Additional info: Manually rebooting the server seems to often get around this problem because it re-runs the deployment and the completion signal gets sent successfully.

For grep purposes, the node id that hung was 2efd7fea-e163-48d8-8a9f-fc1c72bc3147

Comment 4 Dmitry Tantsur 2016-11-10 11:38:50 UTC
Hi!

The bug seems to related to the old bash ramdisk, which we unfortunately cannot fix too much. I think it should be gone with switch to IPA.


Note You need to log in before you can comment on or make changes to this bug.