Bug 1434509
| Summary: | openstack overcloud update stack -i overcloud is stuck | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Michele Baldessari <michele> |
| Component: | openstack-heat | Assignee: | Zane Bitter <zbitter> |
| Status: | CLOSED CURRENTRELEASE | QA Contact: | Amit Ugol <augol> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 10.0 (Newton) | CC: | cpaquin, hjensas, mandreou, mburns, mschuppe, rhel-osp-director-maint, sathlang, sbaker, shardy, srevivo, therve, zbitter |
| Target Milestone: | --- | Keywords: | ZStream |
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | python-heatclient-1.5.0-3.el7ost | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2017-06-29 14:34:16 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1436712, 1437016, 1457208, 1520109 | ||
| Bug Blocks: | |||
|
Description
Michele Baldessari
2017-03-21 16:02:47 UTC
EC2 signatures should not expire, so I'd have to assume that it's due to the user associated with the deployment being deleted. That could happen when the resource is being replaced, but in that case we shouldn't be continuing to use the old user's credentials. We haven't found any indication that the 403 was an issue. We found some issue with hooks though. I opened https://bugzilla.redhat.com/show_bug.cgi?id=1436712 for making heat hook-poll work again. We need another one for tripleoclient/tripleo-common. I opened https://bugzilla.redhat.com/show_bug.cgi?id=1437016 for tripleoclient Here is the situation as far as I understand it wrt 403s on the os-collect-config side 403s are expected when 55-heat-config or 99-refresh-completed attempt to signal a resource which no longer exists (which could happen for a few reasons, database restore, multiple stack updates spanning a network outage). For deployment resources which call hooks (group: script|puppet), signalling happens via 55-heat-config -> heat-config-notify. If the response is a 403, 55-heat-config will continue to process other deployments. For os-apply-config resources (group: os-apply-config) signalling happens via 99-refresh-completed and due to bug 1285495 any 403 will result in later os-apply-config resources not being signalled. This is only an issue in OSP-10 since in OSP-11 onwards os-apply-config is handled by a hook, so signalling happens via 55-heat-config. bug 1285495 should still be fixed for OSP-10 if 403s are causing *other* os-apply-config resources to never be signalled - it is not clear whether that is the case here. Unless there is another bug in heat when users are being deleted when they shouldn't be, fixing bug 1285495 in OSP-10 will make all 403s harmless. If it is common enough that a stuck deployment resource prevents minor updates like this, it might be useful to build a client tool which interactively shows the user what deployment is still waiting for a signal and gives them the option of sending a fake signal which will indicate a COMPLETE or FAILED for that resource. Resolved by the fix for bug 1436712. |