Bug 1526755
Summary: | DVR: a test which adds a router to L3 agent fails with a conflict | ||
---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Arie Bregman <abregman> |
Component: | openstack-tempest | Assignee: | Slawek Kaplonski <skaplons> |
Status: | CLOSED ERRATA | QA Contact: | Toni Freger <tfreger> |
Severity: | high | Docs Contact: | |
Priority: | high | ||
Version: | 10.0 (Newton) | CC: | amuller, apevec, bcafarel, bhaley, chrisw, lhh, ragiman, skaplons, slinaber, srevivo, udesale |
Target Milestone: | --- | Keywords: | AutomationBlocker, Triaged, ZStream |
Target Release: | 10.0 (Newton) | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | openstack-tempest-13.0.0-22.bafe630git.el7ost | Doc Type: | If docs needed, set a value |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-04-30 16:59:39 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Arie Bregman
2017-12-17 09:33:07 UTC
Jenkins is unstable, need to wait for further triaging. Downstream jenkins was unstable from what I remember right before the shutdown, with failures unrelated to neutron. I will look at this again now. It's still there in OSP 10, here's a newer link: https://rhos-qe-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/view/DFG/view/network/view/neutron/job/DFG-network-neutron-10_director-rhel-virthost-3cont_2comp-ipv4-vxlan-dvr/lastCompletedBuild/testReport/tempest.api.network.admin.test_l3_agent_scheduler/L3AgentSchedulerTestJSON/test_add_list_remove_router_on_l3_agent_id_9464e5e7_8625_49c3_8fd1_89c52be59d66_/ Probably just a missing backport. So there is bug in tempest test in https://github.com/openstack/tempest/blob/master/tempest/api/network/admin/test_l3_agent_scheduler.py#L68 In case when router is not HA router it should be scheduled only to one L3 agent at a time. Sometimes in multimode environment it may happen that router is scheduled automatically to eg. agent 1 but test tries to add it to agent 2. That will fail with conflict, which is raised in neutron: https://github.com/openstack/neutron/blob/f6c6be78eeab6a4f621d3ecf95875c539cf4f0b2/neutron/db/l3_agentschedulers_db.py#L134 In case when router is HA, then there is no this problem as router can be then scheduled to more than one L3 agent. It don't happens in our OSP-13 and OSP-14 CI because we have there configured l3_ha=True in neutron config. So every router is HA by default. In OSP-10 we have this option configured to false so that's why it is failing sometimes. I think that this should be fixed on tempest side and we should always create ha router there - that will avoid similar issues regardless of neutron config. Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2019:0922 |