Bugzilla will be upgraded to version 5.0 on a still to be determined date in the near future. The original upgrade date has been delayed.
Bug 1519783 - [Netvirt] Networking-odl trying to delete a router which was never pushed to ODL
[Netvirt] Networking-odl trying to delete a router which was never pushed to ODL
Status: CLOSED ERRATA
Product: Red Hat OpenStack
Classification: Red Hat
Component: python-networking-odl (Show other bugs)
12.0 (Pike)
Unspecified Unspecified
high Severity high
: z1
: 13.0 (Queens)
Assigned To: Mike Kolesnik
Tomas Jamrisko
Netvirt
: Triaged, ZStream
: 1556686 1569696 (view as bug list)
Depends On:
Blocks:
  Show dependency treegraph
 
Reported: 2017-12-01 08:02 EST by Sridhar Gaddam
Modified: 2018-10-18 03:20 EDT (History)
10 users (show)

See Also:
Fixed In Version: python-networking-odl-12.0.1-0.20180228140715.8733cf6.el7ost
Doc Type: Known Issue
Doc Text:
Neutron may issue an error claiming that the Quota has been exceed for Neutron Router creation. This is a known issue where multiple router resources are created with a single create request in Neutron DB due to a bug with networking-odl. The workaround for this issue is to delete the duplicated routers using the OpenStack Neutron CLI and create a router again, resulting with a single instance.
Story Points: ---
Clone Of:
Environment:
N/A
Last Closed: 2018-07-19 09:53:05 EDT
Type: Bug
Regression: ---
Mount Type: ---
Documentation: ---
CRM:
Verified Versions:
Category: ---
oVirt Team: ---
RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: ---


Attachments (Terms of Use)
Neutron Server Logs (1.10 MB, application/x-xz)
2017-12-01 08:13 EST, Sridhar Gaddam
no flags Details


External Trackers
Tracker ID Priority Status Summary Last Updated
Launchpad 1738246 None None None 2017-12-14 12:39 EST
OpenStack gerrit 570775 None master: MERGED networking-odl: Retry journal recording in L3 (Ibef0db9a58d86a85ffd948a35db983863999d319) 2018-07-02 10:17 EDT
Red Hat Product Errata RHBA-2018:2215 None None None 2018-07-19 09:53 EDT

  None (edit)
Description Sridhar Gaddam 2017-12-01 08:02:54 EST
Description of problem:
When running Browbeat+Rally scenario with netcreate-boot-ping scenario (where we spawn a VM, associate a FIP and then try to ping the FIP with concurrency set to 5 and times set to 500), we observed the following in neutron server logs.

Setup:
OSP12 image
ODL running in a non-cluster, non-container setup with 1 controller and 32 compute nodes.
Port-status is disabled in this setup and inactivity_probe/max_backoff are appropriately configured.

RPMs used:
python-networking-odl-11.0.0-2.el7ost.noarch
opendaylight-6.2.0-5.el7ost.noarch

2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client Traceback (most recent call last):
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client   File "/usr/lib/python2.7/site-packages/networking_odl/common/client.py", line 136, in _check_response
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client     response.raise_for_status()
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client   File "/usr/lib/python2.7/site-packages/requests/models.py", line 862, in raise_for_status
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client     raise HTTPError(http_error_msg, response=self)
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client HTTPError: 404 Client Error: Not Found for url: http://172.16.0.13:8081/controller/nb/v2/neutron/routers/659a6590-1523-49c7-a845-b5b195e3505c
2017-12-01 12:05:16.021 948413 ERROR networking_odl.common.client
2017-12-01 12:05:16.022 948413 ERROR networking_odl.common.client [req-ec69f007-0638-46bd-b884-8aa140469c4d - - - - -] REST request ( delete ) to url ( routers/659a6590-1523-49c7-a845-b5b195e3505c ) is failed. Request body : [None] service: HTTPError: 404 Client Error: Not Found for url: http://172.16.0.13:8081/controller/nb/v2/neutron/routers/659a6590-1523-49c7-a845-b5b195e3505c
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal [req-ec69f007-0638-46bd-b884-8aa140469c4d - - - - -] Error while processing (Entry ID: 3013) - delete router 659a6590-1523-49c7-a845-b5b195e3505c: HTTPError: 404 Client Error: Not Found for url: http://172.16.0.13:8081/controller/nb/v2/neutron/routers/659a6590-1523-49c7-a845-b5b195e3505c
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal Traceback (most recent call last):
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/networking_odl/journal/journal.py", line 298, in _sync_entry
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     self.client.sendjson(method, urlpath, to_send)
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/networking_odl/common/client.py", line 106, in sendjson
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     'body': obj})
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 220, in __exit__
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     self.force_reraise()
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 196, in force_reraise
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     six.reraise(self.type_, self.value, self.tb)
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/networking_odl/common/client.py", line 98, in sendjson
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     self.request(method, urlpath, data))
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/networking_odl/common/client.py", line 140, in _check_response
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     {'e': error, 'text': response.text}, exc_info=1)
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 220, in __exit__
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     self.force_reraise()
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/oslo_utils/excutils.py", line 196, in force_reraise
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     six.reraise(self.type_, self.value, self.tb)
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/networking_odl/common/client.py", line 136, in _check_response
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     response.raise_for_status()
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal   File "/usr/lib/python2.7/site-packages/requests/models.py", line 862, in raise_for_status
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal     raise HTTPError(http_error_msg, response=self)
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal HTTPError: 404 Client Error: Not Found for url: http://172.16.0.13:8081/controller/nb/v2/neutron/routers/659a6590-1523-49c7-a845-b5b195e3505c
2017-12-01 12:05:16.023 948413 ERROR networking_odl.journal.journal

I took a random neutron router UUID (i.e., 21e9ccb0-cb43-40d6-811c-f63fcbce8607) for which we were seeing the HTTPError 404.
In the logs, there is no "create router *" for this UUID. The first reference to this UUID was as part of router_gateway port creation.
Then we see the following log

2017-12-01 12:07:08.088 948417 DEBUG neutron.api.rpc.agentnotifiers.l3_rpc_agent_api [req-b5532c6b-c2c0-46b5-a461-1ad7a04190a8 4f3065ec9a8b40ab8291b285cdd33fdd f505a5e332c5463c92ac74c48badc286 - default default] Fanout notify agent at l3_agent the message router_deleted on router 21e9ccb0-cb43-40d6-811c-f63fcbce8607 _notification_fanout /usr/lib/python2.7/site-packages/neutron/api/rpc/agentnotifiers/l3_rpc_agent_api.py:118

So something seems to be going wrong on the neutron/networking-odl side as its trying to delete a resource (i.e., router) which was never pushed to ODL.
Comment 1 Sridhar Gaddam 2017-12-01 08:13 EST
Created attachment 1361601 [details]
Neutron Server Logs
Comment 2 Sai Sindhur Malleni 2017-12-01 08:22:46 EST
Similar Bug from last iteration: https://bugzilla.redhat.com/show_bug.cgi?id=1487794
Comment 3 Mike Kolesnik 2017-12-12 05:17:55 EST
Is there an upstream bug for this?
Comment 4 Sridhar Gaddam 2017-12-14 12:39:10 EST
Upstream bug: https://bugs.launchpad.net/networking-odl/+bug/1738246
Comment 5 Mike Kolesnik 2018-02-19 06:40:12 EST
This should be fixed as part of L3 Flavors, which should be available in OSP 14
Comment 7 Mike Kolesnik 2018-04-22 03:03:11 EDT
*** Bug 1569696 has been marked as a duplicate of this bug. ***
Comment 8 Mike Kolesnik 2018-05-21 08:11:52 EDT
*** Bug 1556686 has been marked as a duplicate of this bug. ***
Comment 9 Mike Kolesnik 2018-05-23 04:40:20 EDT
I will work on a stopgap solution for this that sjhould cover 99% of the affected cases.
The complete solution would be L3 Falvors which will be available in OSP 14.
Comment 11 jamo luhrsen 2018-05-23 12:43:32 EDT
(In reply to Mike Kolesnik from comment #9)
> I will work on a stopgap solution for this that sjhould cover 99% of the
> affected cases.
> The complete solution would be L3 Falvors which will be available in OSP 14.

Thanks Mike. Let me know if I can help. I can usually get a setup to reproduce
this pretty easily and can try/verify/test things as needed.
Comment 12 Mike Kolesnik 2018-05-29 07:58:27 EDT
Posted upstream for review
Comment 23 errata-xmlrpc 2018-07-19 09:53:05 EDT
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHBA-2018:2215

Note You need to log in before you can comment on or make changes to this bug.