Bug 1153746 - a deployment that fails due to a misconfigured settings file is unrecoverable
Summary: a deployment that fails due to a misconfigured settings file is unrecoverable
Alias: None
Product: Fedora
Classification: Fedora
Component: rolekit
Version: 21
Hardware: Unspecified
OS: Unspecified
Target Milestone: ---
Assignee: Thomas Woerner
QA Contact: Fedora Extras Quality Assurance
Depends On:
TreeView+ depends on / blocked
Reported: 2014-10-16 17:14 UTC by Dan Mossor [danofsatx]
Modified: 2014-11-18 12:10 UTC (History)
2 users (show)

Fixed In Version: rolekit-0.1.2-1.fc21
Doc Type: Bug Fix
Doc Text:
Clone Of:
Last Closed: 2014-11-18 12:10:01 UTC
Type: Bug

Attachments (Terms of Use)
journal output (2.34 KB, text/plain)
2014-10-16 17:14 UTC, Dan Mossor [danofsatx]
no flags Details
Settings file (198 bytes, text/plain)
2014-10-16 17:15 UTC, Dan Mossor [danofsatx]
no flags Details

Description Dan Mossor [danofsatx] 2014-10-16 17:14:58 UTC
Created attachment 947696 [details]
journal output

Description of problem:
When a role deployment fails due to a typo or otherwise misconfigured settings-file, rolekit is unable to redeploy with a corrected file or decommission the failed deployment.

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:
1. Create a settings.json file with incorrect information (Place quotes around an INT, have an incorrect IP address for primary_ip)
2. Run 'rolectl deploy --settings-file ./settings.json domaincontroller'
3. When deployment fails with "Error: COMMAND_FAILED: 512".
4. Correct settings.json file.
5. Attempt 'rolectl redeploy --settings.file domaincontroller/1'
6. Attempt 'rolectl decommission domaincontroller/1'

Actual results:
redeploy will fail with "Error: UNKNOWN_SETTING: admin_password"
decommission will fail with "Error: COMMAND_FAILED: 256"

Expected results:
For redeploy, the deployment process should run again and complete.
For decommission, the role and instance should be deleted.

Additional info:
Attaching the journal output and the settings.json file I used.

Comment 1 Dan Mossor [danofsatx] 2014-10-16 17:15:32 UTC
Created attachment 947697 [details]
Settings file

Comment 2 Thomas Woerner 2014-11-13 16:05:38 UTC
There are fixes for this already in rolekit GIT:

Remove instance again if verification of settings failed

rolebase, rolectl: New force option for decommission

With the first fix, the instance will be removed again in case of settings failures that can be detected in rolekit. There are also checks in the roles that rolekit can not detect easily, because they are done in deploy. In this case and also other decommission errors, there is a new force option for decommission.

Comment 3 Fedora Update System 2014-11-17 19:53:51 UTC
rolekit-0.1.2-1.fc21 has been submitted as an update for Fedora 21.

Comment 4 Fedora Update System 2014-11-18 12:10:01 UTC
rolekit-0.1.2-1.fc21 has been pushed to the Fedora 21 stable repository.  If problems still persist, please make note of it in this bug report.

Note You need to log in before you can comment on or make changes to this bug.