Bug 545229

Summary: rgmanager will hang on start if cman is not running and will not stop
Product: Red Hat Enterprise Linux 5 Reporter: Shane Bradley <sbradley>
Component: rgmanagerAssignee: Lon Hohberger <lhh>
Status: CLOSED ERRATA QA Contact: Toure Dunnon <tdunnon>
Severity: medium Docs Contact:
Priority: low    
Version: 5.6CC: cluster-maint, djansa, iannis, tao, tdunnon
Target Milestone: rcKeywords: Patch
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: rgmanager-2.0.52-6.13.el5 Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2011-01-13 23:23:57 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 502912    
Attachments:
Description Flags
Patch that will allow rgmanager to shutdown
none
git-am clean patch none

Description Shane Bradley 2009-12-07 22:03:30 UTC
Created attachment 376792 [details]
Patch that will allow rgmanager to shutdown

Description of problem:

If cman is not running then rgmanager will spin and will not shutdown
if called to shutdown in this state.

$ /etc/init.d/cman stop
Stopping cluster:
 Stopping fencing... done
 Stopping cman... done
 Stopping ccsd... done
 Unmounting configfs... done
                                                         [  OK  ]
$ clustat
Could not connect to CMAN: Connection refused
$ service rgmanager start
Starting Cluster Service Manager:                          [  OK  ]
$ ps -ef | grep rg
root      4383     1  0 21:36 ?        00:00:00 clurgmgrd
root      4384  4383  0 21:36 ?        00:00:00 clurgmgrd
root      4388  4261  0 21:36 pts/0    00:00:00 grep rg
$ killall clurgmgrd
$ ps -ef | grep rg
root      4383     1  0 21:36 ?        00:00:00 clurgmgrd
root      4384  4383  0 21:36 ?        00:00:00 clurgmgrd
root      4392  4261  0 21:37 pts/0    00:00:00 grep rg
$ gdb /usr/sbin/clurgmgrd 4383
(gdb) thr a a bt

Thread 1 (Thread 0x2afb47649060 (LWP 16660)):
#0  0x0000003f7a00e005 in __libc_waitpid (pid=<value optimized out>, stat_loc=<value optimized out>, options=<value optimized out>) at ../sysdeps/unix/sysv/linux/waitpid.c:32
#1  0x000000000041a9b3 in watchdog_init () at watchdog.c:73
#2  0x000000000040c36f in main (argc=1, argv=0x7fff6346f288) at main.c:1020

Attached is a patch that will allow rgmanager to be cleanly shutdown
if it is in this state.

Version-Release number of selected component (if applicable):
rgmanager-2.0.52-1.el5_4.2

How reproducible:
Everytime

Steps to Reproduce:
1.stop all cluster services
2./etc/init.d/rgmanager start
3./etc/init.d/rgmanager stop
  
Actual results:
rgmanager is not stopped. It is stuck spinning.

Expected results:
rgmanager should stop cleanly.

Additional info:

A patch was added that was pulled from stable 3 to resolve this issue.

Comment 1 Lon Hohberger 2009-12-09 15:17:28 UTC
Actually, I had to rework parts of the patch in order for it to apply. ;)

Comment 2 Lon Hohberger 2009-12-09 15:24:29 UTC
Created attachment 377217 [details]
git-am clean patch

Comment 7 errata-xmlrpc 2011-01-13 23:23:57 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2011-0134.html