Bug 1397114 - OPS Tools | Availability Monitoring | OS Checks | Current OS checks initial deployment on each overcloud node is not effective and gives a misleading information to the openstack administrator.
Summary: OPS Tools | Availability Monitoring | OS Checks | Current OS checks initial d...
Keywords:
Status: CLOSED CURRENTRELEASE
Alias: None
Product: Red Hat OpenStack
Classification: Red Hat
Component: sensu
Version: 10.0 (Newton)
Hardware: Unspecified
OS: Unspecified
unspecified
urgent
Target Milestone: ---
: ---
Assignee: Lars Kellogg-Stedman
QA Contact: Leonid Natapov
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2016-11-21 16:17 UTC by Leonid Natapov
Modified: 2016-12-16 16:51 UTC (History)
6 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Environment:
Last Closed: 2016-12-16 16:51:01 UTC
Target Upstream Version:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
RDO 3898 0 None None None 2016-12-08 08:40:02 UTC

Description Leonid Natapov 2016-11-21 16:17:58 UTC
OPS Tools | Availability Monitoring | OS Checks | Current OS checks initial deployment on each overcloud node is not effective  and gives a misleading information to the ops tools administrator.


Current oschecks are checking openstack as a unit and not checking each overcloud node. Checks are running against Virtual IP and checking whether certain API responding or not. Each check runs against only 1 controller or compute. The one that holds Virtual IP. So ,obviously,if some of controllers or computes will be down or certain services on them will be down (HA scenario) it won't effect openstack as a unit and oschecks will still report "ok" status.


The problem is that by deploying and running those checks on each overcloud node we provide to a user a misleading information and it looks like all the checks are being executed on EACH overcloud node checking API status on EACH overcloud node when in fact it's being checked only against 1 node.

Moreover if ,for some reason, one of controllers/computes will have a problem with openstack services it still will be reported as "ok" in Availability Monitoring UI (Uchiwa).

Comment 1 Martin Magr 2016-11-21 16:37:52 UTC
Removing OSP flag as check configuration is server side.

Comment 2 Martin Magr 2016-11-22 12:00:23 UTC
Check configuration is performed on server side and as such it cannot be marked for OSP and cannot be blocker.

Comment 6 Martin Magr 2016-12-08 08:40:03 UTC
Patch was merged to opstools-ansible.

Comment 8 Leonid Natapov 2016-12-12 10:17:20 UTC
Current opstools-ansible build includes systemd checks for openstack services on each overcloud node.


Note You need to log in before you can comment on or make changes to this bug.