Description of problem: ironic is very slow to respond and it is deleting nodes then trying to build them again Steps to Reproduce: 1. build 4 compute nodes 2. try to add 4 moore nodes Actual results: ironic is slow and is rebuilding nodes --------------------- Aug 21 17:51:54 tpacpuidc ironic-conductor: raise exception.InstanceDeployFailure(msg) Aug 21 17:51:54 tpacpuidc ironic-conductor: InstanceDeployFailure: Failed to notify ramdisk to reboot after bootloader installation. Error: [Errno 111] ECONNREFUSED --------------------- Expected results: Nodes should be added without issue
Created attachment 1066619 [details] ironic-conductor and ironic-disoverd logs
I knew this seemed familiar: https://bugs.launchpad.net/ironic/+bug/1383432 I think we should try to lower that timeout ([ipmi] #retry_timeout=60). We should try to find the highest value that relieves the issue, as setting this too low can cause some BMCs to crash. I would try 30, 15, 10, 5. @Jack could you have them try that and report the results?
Verified : instack-undercloud-2.1.2-26.el7ost.noarch The bug was marked SanityOnly - I checked that there were no regression found when using the proposed fix.
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHSA-2015:1862