From Bugzilla Helper: User-Agent: Opera/9.01 (Windows NT 5.1; U; en) Description of problem: clusvcmgrd Service "Status" operation fails returning "ERROR: Memory fault", causing the stop of the cluster service: Aug 7 10:42:38 rtc2 clusvcmgrd: [18687]: <err> service error: User script '/ home/core/DSCP/cluster/bin/clu_Offline status' returned Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: LTRH Running Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: ERROR: Memory fault Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: Check status failed on user script for Offline Aug 7 10:42:39 rtc2 clusvcmgrd[18686]: <warning> Restarting locally failed service Offline Version-Release number of selected component (if applicable): clumanager-1.2.22-2 How reproducible: Sometimes Steps to Reproduce: 1.Configure a short monitor interval for this service 2.Wait for a failure (happens arount once each week) 3. Actual Results: Expected Results: Additional info:
[quote] Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: LTRH Running Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: ERROR: Memory fault Aug 7 10:42:39 rtc2 clusvcmgrd: [18687]: <err> service error: Check status failed on user script for Offline [/quote] Those messages are coming from the user service script when it does a 'status' and not from clusvcmgrd itself; rather, clusvcmgrd is simply reporting the error on behalf of the user service script.