Bug 1353915
Summary: | [RFE] Add to scripts generated by undercloud install - script to replace failed TripleO QuickStart HA Controller | ||
---|---|---|---|
Product: | [Community] RDO | Reporter: | Boris Derzhavets <bderzhavets> |
Component: | openstack-tripleo | Assignee: | James Slagle <jslagle> |
Status: | CLOSED INSUFFICIENT_DATA | QA Contact: | Shai Revivo <srevivo> |
Severity: | urgent | Docs Contact: | |
Priority: | unspecified | ||
Version: | trunk | CC: | amedeo.salvati, chris.brown, jtrowbri, lars |
Target Milestone: | --- | ||
Target Release: | trunk | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2017-06-18 12:06:31 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Boris Derzhavets
2016-07-08 12:03:27 UTC
Workaround : If just one controller_(X) was stopped and started, then creating new fake Router(X) will help out [root@overcloud-controller-0 ~]# neutron l3-agent-list-hosting-router Router01 +---------------------------------+---------------------------------+----------------+-------+----------+ | id | host | admin_state_up | alive | ha_state | +---------------------------------+---------------------------------+----------------+-------+----------+ | 1fd8b44b-265f- | overcloud- | True | xxx | standby | | 4e05-a4e3-cf8eb26027bd | controller-1.localdomain | | | | | 2b027242-c6e1-4122-9e01-01fcd3b | overcloud-controller-0 | True | :-) | active | | 30e3f | | | | | | 377c6968-05ee-457c- | overcloud-controller-2 | True | :-) | standby | | acb3-f910a2ce3df5 | | | | | +---------------------------------+---------------------------------+----------------+-------+----------+ Creating Router2 will result [root@overcloud-controller-0 ~]# neutron l3-agent-list-hosting-router Router02 +---------------------------------+---------------------------------+----------------+-------+----------+ | id | host | admin_state_up | alive | ha_state | +---------------------------------+---------------------------------+----------------+-------+----------+ | 377c6968-05ee-457c- | overcloud-controller-2 | True | :-) | standby | | acb3-f910a2ce3df5 | | | | | | 2b027242-c6e1-4122-9e01-01fcd3b | overcloud-controller-0 | True | :-) | active | | 30e3f | | | | | | 1fd8b44b-265f- | overcloud- | True | :-) | standby | | 4e05-a4e3-cf8eb26027bd | controller-1.localdomain | | | | +---------------------------------+---------------------------------+----------------+-------+----------+ [root@overcloud-controller-0 ~]# neutron l3-agent-list-hosting-router Router01 +---------------------------------+---------------------------------+----------------+-------+----------+ | id | host | admin_state_up | alive | ha_state | +---------------------------------+---------------------------------+----------------+-------+----------+ | 1fd8b44b-265f- | overcloud- | True | :-) | standby | | 4e05-a4e3-cf8eb26027bd | controller-1.localdomain | | | | | 2b027242-c6e1-4122-9e01-01fcd3b | overcloud-controller-0 | True | :-) | active | | 30e3f | | | | | | 377c6968-05ee-457c- | overcloud-controller-2 | True | :-) | standby | | acb3-f910a2ce3df5 | | | | | +---------------------------------+---------------------------------+----------------+-------+----------+ Works only routers created under admin account. Creating new neutron router for ordinary tenant allows only to switch to newly created neutron router, what obviously breaks VM been attached to private network served as interface for active neutron router before `nova stop/start overcloud-controller-(X)`. If recovery of crashed overcloud-controller-(X) is possible via running special heat template ,please, advise. Attempted fo follow http://docs.openstack.org/developer/tripleo-docs/post_deployment/replace_controller.html It is not quite clear how update overcloud-deploy.sh to replace failed controller node. Upstream requested clarification on lp bug but none given so closing as stale. |