Red Hat Bugzilla – Bug 172066
Infinite recovery loop with clusterfs due to status always returning 1
Last modified: 2009-04-16 16:18:25 EDT
Description of problem:
A cluster sevice involving clusterfs resources will be infinitely recovered.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
1.create a cluster service using a GFS filesystem
service is considered malfunctioning, rgmanager will continuously relocate and
recover the service
rgmanager should detect that the service is not malfunctioning.
In clusterfs' status/monitor code, there is a test that has to fail if the
service is OK ([ $? -ne $YES ] && exit $OCF_ERR_GENERIC). But the last line
returns the last exit code (exit $?). So the status always returns 1.
This mini patch fixes this behaviour:
--- /usr/share/cluster/clusterfs.sh~ 2005-10-18 21:17:51.000000000 +0200
+++ /usr/share/cluster/clusterfs.sh 2005-10-30 14:57:06.000000000 +0100
@@ -846,7 +846,7 @@
[ $? -ne $YES ] && exit $OCF_ERR_GENERIC
- exit $?
+ exit 0
Patch has been applied to the stable, rhel4, and head branches.
This is addressed, so I'm setting state to modified for now -- it should come
out with the next update.