Red Hat Bugzilla – Bug 1468353
Intermittent SSL errors cause iDRAC operation failure
Last modified: 2017-09-20 20:47:37 EDT
Description of problem:
When communicating with an iDRAC, an SSLError or ConnectionError may intermittently occur. If this happens at an inopportune time, it can kill an entire overcloud deployment when using tripleo.
Retry logic should be added to wsman.py so that it will recover when intermittent communication issues with the iDRAC occur.
This issue has been fixed upstream:
This BZ is to backport the fix to OSP10.
Version-Release number of selected component (if applicable):
Try doing an overcloud deployment with several overcloud nodes using Dell hardware and the iDRAC driver. Note that a single SSL error in python-dracclient will kill the deployment.
Steps to Reproduce:
1. See above.
Communication with the iDRAC is not retried on SSL errors.
Communication with the iDRAC should be retried on SSL errors.
I just checked, and this fix has already been pulled in to the latest OSP10/Newton bits. I have validated that the fix works as expected.
As a result, this BZ can be closed. Not sure how you want to handle this - move to ERRATA or just close?
Chris - thanks for checking and validating this.