Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

Bug 486484

Summary: condor_ha_scheduler configuration missing QMF_STOREFILE
Product: Red Hat Enterprise MRG Reporter: Matthew Farrellee <matt>
Component: gridAssignee: Robert Rati <rrati>
Status: CLOSED ERRATA QA Contact: Martin Kudlej <mkudlej>
Severity: high Docs Contact:
Priority: high    
Version: 1.1CC: iboverma, lans.carstensen, lbrindle, mkudlej, tao
Target Milestone: 1.2   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Grid enhancement. Added SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile to the HA Scheduler configuration. Jobs and schedulers no longer appear in Cumin twice after a scheduler failover.
Story Points: ---
Clone Of: Environment:
Last Closed: 2009-12-03 09:18:25 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 527551    

Description Matthew Farrellee 2009-02-19 23:14:29 UTC
Description of problem:

In a HA scheduler setup the QMF plugins in the Schedd need to share a QMF_STOREFILE so that they appear to be a single Scheduler from QMF's perspective. This means SCHEDD.QMF_STOREFILE should be set to $SPOOL/.schedd_storefile in the HA configuration.

Version-Release number of selected component (if applicable):

condor-remote-configuration-server-1.0-12.el5


Actual results:

Fail over a HA schedd and you'll see a second Scheduler created in Cumin.


Expected results:

The Scheduler's name in Cumin may change, but a second should not be created.

Comment 1 Robert Rati 2009-03-02 17:10:08 UTC
Added

SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile

To the HA Scheduler configuration and tested in qpid-tool that duplicate jobs/schedulers no longer appear after the scheduler has been failed over.

Fixed in:
condor-remote-configuration-1.0-15

Comment 3 Irina Boverman 2009-10-28 18:06:01 UTC
Release note added. If any revisions are required, please set the 
"requires_release_notes" flag to "?" and edit the "Release Notes" field accordingly.
All revisions will be proofread by the Engineering Content Services team.

New Contents:
Added SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile to the HA Scheduler configuration. Jobs/schedulers no longer appear in Cumin second time after the scheduler has been failed over (486484)

Comment 4 Lana Brindley 2009-11-05 01:54:01 UTC
Release note updated. If any revisions are required, please set the 
"requires_release_notes"  flag to "?" and edit the "Release Notes" field accordingly.
All revisions will be proofread by the Engineering Content Services team.

Diffed Contents:
@@ -1 +1,3 @@
-Added SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile to the HA Scheduler configuration. Jobs/schedulers no longer appear in Cumin second time after the scheduler has been failed over (486484)+Grid enhancement.
+
+Added SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile to the HA Scheduler configuration. Jobs and schedulers no longer appear in Cumin twice after a scheduler failover.

Comment 5 Martin Kudlej 2009-11-10 12:25:53 UTC
Tested on RHEL 5.4/4.8 x i386/x86_64 with condor-remote-configuration-1.0-14 and it doesn't work.
Teste with condor-remote-configuration-1.0-23 and it works. -->VERIFIED

Comment 7 errata-xmlrpc 2009-12-03 09:18:25 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHEA-2009-1633.html