Bug 1484523

Summary: 51-hosts at scale fails to complete and does not report an error
Product: Red Hat OpenStack Reporter: David Vallee Delisle <dvd>
Component: os-refresh-configAssignee: James Slagle <jslagle>
Status: CLOSED DUPLICATE QA Contact: Shai Revivo <srevivo>
Severity: high Docs Contact:
Priority: high    
Version: 10.0 (Newton)CC: aschultz, dhill, mburns, ohochman, pablo.iranzo, rhel-osp-director-maint
Target Milestone: ---Keywords: ZStream
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-08-29 21:34:21 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description David Vallee Delisle 2017-08-23 19:14:09 UTC
Description of problem:
51-hosts fails to complete running, and wedges scale deployments from ever completing. 

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Have a lot of nodes
2. Add more nodes
3. Again, add more nodes

Actual results:
Deployment is hung up with ~457 deployments IN_PROGRESS.

Error message:
/usr/libexec/os-refresh-config/configure.d/51-hosts: line 17: /bin/awk: Argument list too long

Expected results:
Finish deployment

Additional info:
This bug has been upstream already:
https://bugs.launchpad.net/tripleo/+bug/1674732

And the patch seems to be working.

Comment 1 David Vallee Delisle 2017-08-23 19:25:49 UTC
More output:

~~~
192.168.8.177 servername servername.ctlplane' '/^# HEAT_HOSTS_START/ {
            print $0
            print v
            f=1
            }f &&!/^# HEAT_HOSTS_END$/{next}/^# HEAT_HOSTS_END$/{f=0}!f' /etc/cloud/templates/hosts.redhat.tmpl
/usr/libexec/os-refresh-config/configure.d/51-hosts: line 17: /bin/awk: Argument list too long
[2017-08-23 12:58:45,719] (os-refresh-config) [ERROR] during configure phase. [Command '['dib-run-parts', '/usr/libexec/os-refresh-config/configure.d']' returned non-zero exit status 1]

[2017-08-23 12:58:45,719] (os-refresh-config) [ERROR] Aborting...
~~~

Comment 3 Alex Schultz 2017-08-29 21:34:21 UTC

*** This bug has been marked as a duplicate of bug 1484533 ***