Bug 837640 - hierarchical collectors invalidating ads
Summary: hierarchical collectors invalidating ads
Keywords:
Status: CLOSED NOTABUG
Alias: None
Product: Red Hat Enterprise MRG
Classification: Red Hat
Component: condor
Version: 2.2
Hardware: Unspecified
OS: Unspecified
medium
medium
Target Milestone: 2.2
: ---
Assignee: Matthew Farrellee
QA Contact: MRG Quality Engineering
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2012-07-04 14:23 UTC by Lubos Trilety
Modified: 2013-01-04 15:38 UTC (History)
1 user (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2012-07-05 21:14:06 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)

Description Lubos Trilety 2012-07-04 14:23:09 UTC
Description of problem:
After stopping machine with sub collector the daemons are still present in master collector statistics.

Version-Release number of selected component (if applicable):
condor-7.6.5-0.14

How reproducible:
100%

Steps to Reproduce:
1. Set sub collector on remote machine
COLLECTOR0: $(COLLECTOR)
COLLECTOR0_ARGS = -f -p 12345
COLLECTOR0_ENVIRONMENT = "_CONDOR_COLLECTOR_LOG=$(LOG)/CollectorLog"
DAEMON_LIST = MASTER, STARTD, COLLECTOR0
CONDOR_HOST = <master.collector.machine>
STARTD.COLLECTOR_HOST = <sub.collector.machine>:12345
CONDOR_VIEW_HOST = $(COLLECTOR_HOST)

2. check running daemons using 'condor_status -any'
# condor_status -any
MyType               TargetType           Name                          
Collector            None                 Personal Condor at <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
DaemonMaster         None                 <sub.collector.machine>
Negotiator           None                 <master.collector.machine>
Collector            None                 Personal Condor at <master.collector.machine>
...

3. stop condor on machine with sub collector, check the daemons again
# condor_status -any
MyType               TargetType           Name                          
Collector            None                 Personal Condor at <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Negotiator           None                 <master.collector.machine>
Collector            None                 Personal Condor at <master.collector.machine>
...
  
Actual results:
Only ads from directly connected daemons were removed, DaemonMaster in this case. Sub collector and daemons connected to it are still present. They are removed after 15 minutes.

Expected results:
All daemons should be removed immediately after stopping condor on machine with sub collector.

Additional info:

Comment 1 Matthew Farrellee 2012-07-05 21:14:06 UTC
FYI, Version should likely be 1.0.

This is by design.

Consider, TOP-Collector <- { SUB-Collector-A, SUB-Collector-B } <- startd. Shutdown of SUB-Collector-A should not erase knowledge of startd from TOP-Collector.


Note You need to log in before you can comment on or make changes to this bug.