Bug 852321 - Missing SPOOL settings in HAScheduler feature
Summary: Missing SPOOL settings in HAScheduler feature
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise MRG
Classification: Red Hat
Component: condor-wallaby-base-db
Version: 2.2
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: 2.2
: ---
Assignee: Robert Rati
QA Contact: MRG Quality Engineering
URL:
Whiteboard:
Depends On:
Blocks: 828434
TreeView+ depends on / blocked
 
Reported: 2012-08-28 07:58 UTC by Lubos Trilety
Modified: 2012-09-25 08:40 UTC (History)
5 users (show)

Fixed In Version: condor-wallaby-base-db-1.23-1
Doc Type: Bug Fix
Doc Text:
C: Using Remote Configuration to configure a High Availablity Scheduler with the HAScheduler feature C: The shadow could fail to start with the error: ERROR "According to /var/lib/condor/spool/spool_version, the SPOOL directory is written in spool version 0, but I only support versions back to 1 F: Added parameters to point the shadow at the same spool location used by the HAScheduler feature R: The shadow will run jobs as usual using the spool pointed to by the node with the HAScheduler feature installed
Clone Of:
Environment:
Last Closed: 2012-09-19 17:45:33 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 852537 0 unspecified CLOSED RHHAv2 won't run jobs 2021-02-22 00:41:40 UTC
Red Hat Product Errata RHSA-2012:1278 0 normal SHIPPED_LIVE Moderate: Red Hat Enterprise MRG Grid 2.2 security update 2012-09-19 21:40:26 UTC

Internal Links: 852537

Description Lubos Trilety 2012-08-28 07:58:54 UTC
Description of problem:
Feature HAScheduler contains SCHEDD.SPOOL but that's not enough, the condor needs to set SPOOL for all daemons or at least for Shadow. This setting leads to error on Shadow daemon.
ERROR "According to /var/lib/condor/spool/spool_version, the SPOOL directory is written in spool version 0, but I only support versions back to 1.
" at line 67 in file /builddir/build/BUILD/condor-7.6.4/src/condor_utils/spool_version.cpp

Probably there should be set parameter SPOOL not SCHED.SPOOL in HAScheduler feature, like it was in previous version.

Version-Release number of selected component (if applicable):
condor-wallaby-base-db-1.22-5

How reproducible:
100%

Steps to Reproduce:
1. Set HAScheduler on some machines
2. Submit some jobs, see shadow log

  
Actual results:
error about spool version

Expected results:
no error, condor work correctly

Additional info:

Comment 2 Jeff Needle 2012-08-29 11:47:46 UTC

*** This bug has been marked as a duplicate of bug 852537 ***

Comment 3 Jeff Needle 2012-08-29 11:53:51 UTC
Incorrectly marked as a dup.  Reopening.

Comment 4 Robert Rati 2012-08-30 17:11:09 UTC
Added GRIDMANAGER.SPOOL and SHADOW.SPOOL to the SharedJobQueueLocation feature.  This resolves this issue for the shadow and gridmanager.

Fixed on upstream on:
master

Comment 5 Robert Rati 2012-08-30 17:17:09 UTC
    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
C: Using Remote Configuration to configure a High Availablity Scheduler with the HAScheduler feature
C: The shadow could fail to start with the error:
ERROR "According to /var/lib/condor/spool/spool_version, the SPOOL directory is written in spool version 0, but I only support versions back to 1
F: Added parameters to point the shadow at the same spool location used by the HAScheduler feature
R: The shadow will run jobs as usual using the spool pointed to by the node with the HAScheduler feature installed

Comment 7 Lubos Trilety 2012-08-31 14:32:50 UTC
Tested with:
condor-wallaby-base-db-1.23-1

Tested on:
RHEL5 x86_64,i386
RHEL6 x86_64,i386

# wallaby show-feature SharedJobQueueLocation
Console Connection Established...
SharedJobQueueLocation
  name:  "SharedJobQueueLocation"
  params:  {"SHADOW.SPOOL"=>"$(SCHEDD.SPOOL)", "GRIDMANAGER.SPOOL"=>"$(SCHEDD.SPOOL)", "SCHEDD.HISTORY"=>"$(SCHEDD.SPOOL)/history", "SCHEDD.SPOOL"=>""}
  depends:  []
  conflicts:  []
  included_features:  []


All jobs run correctly, no error in logs.

>>> VERIFIED

Comment 9 errata-xmlrpc 2012-09-19 17:45:33 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

http://rhn.redhat.com/errata/RHSA-2012-1278.html


Note You need to log in before you can comment on or make changes to this bug.