Bug 1348998
Summary: | Removing network fails with "internal server error" in HA environment | |||
---|---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Arie Bregman <abregman> | |
Component: | openstack-neutron | Assignee: | Hynek Mlnarik <hmlnarik> | |
Status: | CLOSED NOTABUG | QA Contact: | Toni Freger <tfreger> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 7.0 (Kilo) | CC: | abregman, amuller, chrisw, nyechiel, srevivo | |
Target Milestone: | async | Keywords: | AutomationBlocker, ZStream | |
Target Release: | 7.0 (Kilo) | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | Bug Fix | ||
Doc Text: |
Cause:
Simultaneous deletion of subnet while running automatic allocation of DHCP ports cause exception thrown due to database deadlock or concurrently updated database rows.
Consequence:
Deleting of the subnet fails, either of the following exceptions is logged to neutron server log:
* StaleDataError: UPDATE statement on table 'ports' expected to update 1 row(s); 0 were matched.
* DBDeadlock: Deadlock found when trying to get lock; try restarting transaction
Fix:
Restart the transaction when this condition is encountered
Result:
Subnet is correctly deleted
|
Story Points: | --- | |
Clone Of: | ||||
: | 1351101 (view as bug list) | Environment: | ||
Last Closed: | 2016-09-12 14:59:52 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1351101 |
Description
Arie Bregman
2016-06-22 13:11:54 UTC
The primary cause of the issue is that DHCP ports are created while processing subnet_delete [1]. The bug was fixed in Liberty by retrying the operation [2]. [1] https://bugs.launchpad.net/neutron/+bug/1357055/comments/36 [2] https://review.openstack.org/#/c/171848/ The bug is triggered also by database deadlock that is resolved in [1]. Including the relevant commit. [1] https://review.openstack.org/#/c/191540/ 3 controllers The issue hasn't been resolved. It works for 2 controllers, but not for 3. We should identify why there is an internal server error when using 3 controllers. (In reply to Arie Bregman from comment #15) > The issue hasn't been resolved. > It works for 2 controllers, but not for 3. We should identify why there is > an internal server error when using 3 controllers. See comments 2 and 3. Reason is race between dhcp port creation and subnet deletion caused by tempest tests together with IP address pool starvation. The proper way to fix that is to fix the tempest tests or adjust number of controller nodes to reflect number of available IP addresses. |