Bug 583788 - status check program for vm.sh & user-controlled error tolerance
Summary: status check program for vm.sh & user-controlled error tolerance
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: rgmanager
Version: 5.4
Hardware: All
OS: Linux
urgent
medium
Target Milestone: rc
: ---
Assignee: Lon Hohberger
QA Contact: Brandon Perkins
URL:
Whiteboard:
: 591230 (view as bug list)
Depends On:
Blocks: 583778 583789 594476 595610 599643 606480 606754
TreeView+ depends on / blocked
 
Reported: 2010-04-19 17:56 UTC by Perry Myers
Modified: 2018-11-14 19:37 UTC (History)
9 users (show)

Fixed In Version: rgmanager-2.0.52-6.10.el5
Doc Type: Enhancement
Doc Text:
Clone Of:
: 583789 (view as bug list)
Environment:
Last Closed: 2011-01-13 23:25:22 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)
Patch 1/4 (1.28 KB, patch)
2010-05-20 14:31 UTC, Lon Hohberger
no flags Details | Diff
Patch 2/4 (4.86 KB, patch)
2010-05-20 14:32 UTC, Lon Hohberger
no flags Details | Diff
Patch 3/4 (3.11 KB, patch)
2010-05-20 14:32 UTC, Lon Hohberger
no flags Details | Diff
Patch 4/4 (2.34 KB, patch)
2010-05-20 14:32 UTC, Lon Hohberger
no flags Details | Diff
Patch 4/4 (2.39 KB, patch)
2010-06-03 18:32 UTC, Lon Hohberger
no flags Details | Diff
Status program specifically for RHEV (uses libcurl) (1.86 KB, text/plain)
2010-06-04 20:08 UTC, Lon Hohberger
no flags Details
Shell script replacing option 1, making it easy to use but not introducing a universal dependency on libcurl (178 bytes, text/plain)
2010-06-04 21:04 UTC, Lon Hohberger
no flags Details
Updated status script with logging (376 bytes, text/plain)
2010-06-04 22:23 UTC, Lon Hohberger
no flags Details


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2011:0134 0 normal SHIPPED_LIVE rgmanager bug fix and enhancement update 2011-01-12 19:20:47 UTC

Description Perry Myers 2010-04-19 17:56:59 UTC
Description of problem:
vm.sh presently only checks the health of the VM itself, not the health of any services inside.  General support for in guest service health can be done on RHEL systems by running a cluster inside the guest.  But for a more lightweight solution it would be advantageous to have a way to query the guest for state.

if the in guest service implements a 'status check web page' then a simple curl command can be used to check the state of the services in the guest and use that information to make decisions about guest relocation/restart.

This bug is to implement generic support in vm.sh for checking an in guest URL.  So the changes would need to be configuration of the url to check.  A http response of 200 should indicate the guest services are healthy, while a non 200 would cause rgmanager to fail over the vm.

Comment 10 Lon Hohberger 2010-05-20 14:31:38 UTC
Created attachment 415427 [details]
Patch 1/4

Backport from STABLE3 branch

Comment 11 Lon Hohberger 2010-05-20 14:32:01 UTC
Created attachment 415428 [details]
Patch 2/4

Comment 12 Lon Hohberger 2010-05-20 14:32:29 UTC
Created attachment 415430 [details]
Patch 3/4

Comment 13 Lon Hohberger 2010-05-20 14:32:48 UTC
Created attachment 415433 [details]
Patch 4/4

Comment 15 Lon Hohberger 2010-05-20 14:41:44 UTC
Patches pushed to cluster.git, but I can't link right now due to load on the Fedora servers.

Comment 20 Lon Hohberger 2010-06-03 18:32:44 UTC
Created attachment 419476 [details]
Patch 4/4

Comment 21 Lon Hohberger 2010-06-04 20:08:51 UTC
Created attachment 421344 [details]
Status program specifically for RHEV (uses libcurl)

Comment 22 Lon Hohberger 2010-06-04 20:39:57 UTC
Without recompiling or introducing additional dependencies, you could insert the following into cluster.conf as the status program for the rhevm virtual machine:

status_program="[ "$(curl -sk https://RHEVM_IP_ADDRESS/RHEVManagerWeb/HealthStatus.aspx -D - | head -1 | cut -f2 -d' ')" = "200" ]"


With the above program, this is simplified somewhat:

status_program="rhev-check RHEVM_IP_ADDRESS"

Comment 23 Lon Hohberger 2010-06-04 21:04:21 UTC
Created attachment 421354 [details]
Shell script replacing option 1, making it easy to use but not introducing a universal dependency on libcurl

Comment 25 Lon Hohberger 2010-06-04 22:23:58 UTC
Created attachment 421370 [details]
Updated status script with logging

Comment 30 Lon Hohberger 2010-07-01 18:28:48 UTC
*** Bug 591230 has been marked as a duplicate of this bug. ***

Comment 32 errata-xmlrpc 2011-01-13 23:25:22 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2011-0134.html


Note You need to log in before you can comment on or make changes to this bug.