Bug 1571864
Summary: | FFU: RHELRegistration resource hangs on DELETE | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Andrew Austin <aaustin> | |
Component: | openstack-tripleo-heat-templates | Assignee: | Jiri Stransky <jstransk> | |
Status: | CLOSED CURRENTRELEASE | QA Contact: | Marius Cornea <mcornea> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 13.0 (Queens) | CC: | agurenko, augol, jamsmith, jschluet, jstransk, lbezdick, mbracho, mburns, mcornea, morazi, pveiga, rrubins | |
Target Milestone: | rc | Keywords: | TestOnly, Triaged | |
Target Release: | 13.0 (Queens) | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | Known Issue | ||
Doc Text: |
Temporary removal of Heat stack resources during fast-forward upgrade preparation triggers RHEL unregistration.
As a result, RHEL unregistration is stalled because Heat software deployment signalling does not work properly.
To avoid the problem, while the overcloud is still on OSP 10 and ready to perform the last overcloud minor version update:
1. Edit the template file /usr/share/openstack-tripleo-heat-templates/extraconfig/pre_deploy/rhel-registration/rhel-registration.yaml
2. Delete RHELUnregistration and RHELUnregistrationDeployment resources from the template.
3. Proceed with the minor update and fast-forward upgrade procedure.
|
Story Points: | --- | |
Clone Of: | ||||
: | 1574601 1574610 (view as bug list) | Environment: | ||
Last Closed: | 2018-06-28 15:17:09 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | 1547091, 1574601, 1574610 | |||
Bug Blocks: |
Description
Andrew Austin
2018-04-25 14:46:29 UTC
When attempting to manually trigger the signal with curl, the following error is returned by Heat: <ErrorResponse><Error><Message>A bad or out-of-range value was supplied:signal is not supported for resource. Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/heat/common/context.py", line 409, in wrapped return func(self, ctx, *args, **kwargs) File "/usr/lib/python2.7/site-packages/heat/engine/service.py", line 1824, in resource_signal _resource_signal(stack, rsrc, details, False) File "/usr/lib/python2.7/site-packages/heat/engine/service.py", line 1789, in _resource_signal needs_metadata_updates = rsrc.signal(details, need_check) File "/usr/lib/python2.7/site-packages/heat/engine/resource.py", line 2508, in signal self._handle_signal(details) File "/usr/lib/python2.7/site-packages/heat/engine/resource.py", line 2453, in _handle_signal raise exception.ResourceActionNotSupported(action='signal') ResourceActionNotSupported: signal is not supported for resource. We hit this during testing FFWD, it's a blocker as the overcloud Heat stack effectively got into a state which was (AFAIU) unrecoverable without manual editing of the DB. I think we managed to work around the issue by manually applying patches on the OSP 10 templates before going through FFWD procedure. I think the patches were probably: https://review.openstack.org/#/c/522033/ https://review.openstack.org/#/c/546144/ but will confirm with Andrew / Randy. ^ If the above is correct, we may have already patched OSP 10.z ready, through bug 1547091. At another look, it seems that we'll also need this one: https://review.openstack.org/#/c/492970 You'll also need https://review.openstack.org/#/c/558541/ For me modifying the software_deployment table in heat from IN-PROGRESS to COMPLETE, got the ‘ffwd-upgrade prepare’ unstuck (verified 3 times now). Also, had to reregister all overcloud nodes before ‘ffwd-upgrade run’. It would be nice if we could have 'DeleteOnRHELUnregistration: True' or something like that in OSP10, based on https://review.openstack.org/#/c/492970 With #1574610 we should just document the issue in FFU now. The 10 backport is MODIFIED so i'll move this at least to POST. Jirka, I am thinking you are saying this bug is really meant as a queue to retest once all the bits have landed in OSP 10. Are there any merges that we need to track in this bug against 13 or should this be TestOnly or something similar? Yes exactly, we're just waiting for the dependent BZ to land in OSP 10 puddle so that we can retest, i was unsure about how to properly reflect that in the state of this BZ. I'm adding TestOnly keyword. Should this stay in POST or move to some other state? Thanks TestOnly is right flag to add, and once we have build in a puddle you can test with update this BZ to ON_QA Bug 1574610 is now ON_QA, moving this bug to ON_QA too. |