Bug 824904

Summary: Parameter mismatch in returned parameters caused deadlock
Product: [Retired] CloudForms Cloud Engine Reporter: Chris Pelland <cpelland>
Component: aeolus-audrey-agentAssignee: Greg Blomquist <gblomqui>
Status: CLOSED ERRATA QA Contact: dgao
Severity: high Docs Contact:
Priority: unspecified    
Version: 1.0.0CC: akarol, cpelland, dajohnso, deltacloud-maint, dradez, gblomqui, hbrock, jliberma, jrd, sachua, scollier, sreichar, whayutin
Target Milestone: 1.0.1Keywords: Rebase, Triaged, ZStream
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Rebase: Bug Fixes and Enhancements
Doc Text:
A faulty application blueprint can lock instances in a boot state. For example, an application blueprint with a mistyped parameter name fails due to Audrey Configuration Server being unable to find the correct runtime configuration settings. The Audrey Configuration Server uses a ten minute timeout to bypass any errors with application blueprints. This allows the instances to boot without runtime configuration. Ensure to verify all application blueprints before launch to avoid this issue.
Story Points: ---
Clone Of: 801861 Environment:
Last Closed: 2012-07-10 07:23:16 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 801861    
Bug Blocks:    

Description Chris Pelland 2012-05-24 14:48:17 UTC
+++ This bug was initially created as a clone of Bug #801861 +++

Description of problem:

While I understand that I caused this issue to myself, it seems that some potential checking would have save a bit of aggravation.

I had a mutli-instance Blueprint which one assembly returned two parameters to another instance.  Access to the parameters use a reference to the assembly 
name.  I mistyped a "-" instead of a "_", thus the expect parameters were not actually defined.   Then the instance that was waiting in the boot sequence for the parameters was waiting for config server to pass them to it, it wait since config server never had the 'correct' parameters.

Seems like this case could have been caught prior to launch by checking that names and reference existing in the Blueprint.


Version-Release number of selected component (if applicable):


Beta3


Cfg Server

[root@dhcp-105 aeolus-configserver]# cat /etc/redhat-release 
Red Hat Enterprise Linux Server release 6.2 (Santiago)
[root@dhcp-105 aeolus-configserver]# uname -a
Linux dhcp-105.cloud.lab.eng.bos.redhat.com 2.6.32-220.4.2.el6.x86_64 #1 SMP Mon Feb 6 16:39:28 EST 2012 x86_64 x86_64 x86_64 GNU/Linux
[root@dhcp-105 aeolus-configserver]# rpm -qa aeolus-configserver
aeolus-configserver-0.4.6-0.el6.noarch
[root@dhcp-105 aeolus-configserver]# 

CE

yum list | grep aeolus
aeolus-audrey-agent.noarch
aeolus-configserver.noarch
[root@dhcp-105 aeolus-configserver]# 


How reproducible:

Was reproducible while debuggin it.

Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 2 sachua 2012-06-06 04:30:35 UTC
    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
A faulty application blueprint can lock instances in a boot state. For example, an application blueprint with a mistyped parameter name fails due to Audrey Configuration Server being unable to find the correct runtime configuration settings. The Audrey Configuration Server uses a ten minute timeout to bypass any errors with application blueprints. This allows the instances to boot without runtime configuration. Ensure to verify all application blueprints before launch to avoid this issue.

Comment 3 dgao 2012-06-19 18:17:38 UTC
[root@dhcp77-213 ~]# rpm -qa | grep "aeolus-audrey"
aeolus-audrey-agent-0.4.9-1.el6_2.noarch


2012-06-19 07:38:12,886 - INFO    : audrey:951 Invoked CSClient.get_cs_tooling()
2012-06-19 07:38:12,929 - INFO    : audrey:683 Invoked unpack_tooling()
2012-06-19 07:38:12,936 - INFO    : audrey:908 Invoked CSClient.get_cs_configs()
2012-06-19 07:38:12,976 - INFO    : audrey:1369 No configuration parameters provided. status: 202
.
.
.

2012-06-19 07:48:18,513 - INFO    : audrey:908 Invoked CSClient.get_cs_configs()
2012-06-19 07:48:18,560 - INFO    : audrey:1369 No configuration parameters provided. status: 202

Audrey agent exists after 10mins. Verified.

Comment 5 errata-xmlrpc 2012-07-10 07:23:16 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHBA-2012-1063.html