Bug 1286302
| Summary: | rhel-osp-director: No "active" entry in "L3 agents hosting a router" listing after replacing a controller in HA deployment. | ||
|---|---|---|---|
| Product: | Red Hat OpenStack | Reporter: | Alexander Chuzhoy <sasha> |
| Component: | openstack-neutron | Assignee: | Miguel Angel Ajo <majopela> |
| Status: | CLOSED ERRATA | QA Contact: | Alexander Chuzhoy <sasha> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 8.0 (Liberty) | CC: | amuller, chrisw, jcoufal, jschluet, majopela, mandreou, mburns, mcornea, mlopes, nlevinki, nyechiel, rhel-osp-director-maint, sasha, srevivo |
| Target Milestone: | async | Keywords: | TestOnly, ZStream |
| Target Release: | 8.0 (Liberty) | ||
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | openstack-neutron-7.0.0-2.el7 | Doc Type: | Bug Fix |
| Doc Text: |
Previously, using 'neutron-netns-cleanup' when manually taking down a node from an HA cluster would not properly clean up processes in the neutron L3-HA routers. Consequently, when the node was connected again to the cluster, and services were re-created, the processes would not properly respawn with the right connectivity. As a result, even if the processes were alive, they were disconnected; this sometimes led to a situation where no L3-HA router was able to take the 'ACTIVE' role.
With this update, the 'neutron-netns-cleanup' scripts and related OCF resources have been fixed to kill the relevant keepalived processes and child processes.
As a result, nodes can be taken off the cluster and back, and the resources will be properly cleaned up when taken off the cluster, and restored when taken back.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2016-05-12 16:24:11 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1313529, 1326507, 1338623 | ||
| Bug Blocks: | |||
|
Description
Alexander Chuzhoy
2015-11-27 22:33:47 UTC
This is fixed in a later version of netns-cleanup @ https://review.gerrithub.io/#/c/248931/1/neutron-netns-cleanup.init. If that is backportable for 8.0 then we should do that. If not we should document the workaround as Marios describes above. Either way this is not a director bug AFAICT. I have reassigned it to Neutron. Miguel, I see that it's in Mitaka and Liberty in Delorean, can you look in to availability of the fix in OSP 8? I was sure we had this closed. I see that the patch is available in OSP 8 rhos-8.0-rhel-7 branch of Neutron. Ooops, sorry, I missed this bz assignment. Checking Yes, as @assaf said, this was introduced in the rhos-8.0-rhel-7 branch, specifically in openstack-neutron-7.0.0-2.el7 Automation passed https://rhos-jenkins.rhev-ci-vms.eng.rdu2.redhat.com/view/RHOS/view/RHOS8/job/ospd_qe-8_director-puddle-rhel-7.2-virthost-1cont_1comp_1ceph-three-nics-vlans-ceph_internal-ipv4-vxlan-ssl/ openstack-neutron-7.0.4-2.el7ost.noarch.rpm installed Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://rhn.redhat.com/errata/RHBA-2016-1063.html |