Bug 837640

Summary: hierarchical collectors invalidating ads
Product: Red Hat Enterprise MRG Reporter: Lubos Trilety <ltrilety>
Component: condorAssignee: Matthew Farrellee <matt>
Status: CLOSED NOTABUG QA Contact: MRG Quality Engineering <mrgqe-bugs>
Severity: medium Docs Contact:
Priority: medium    
Version: 2.2CC: matt
Target Milestone: 2.2   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-07-05 21:14:06 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Lubos Trilety 2012-07-04 14:23:09 UTC
Description of problem:
After stopping machine with sub collector the daemons are still present in master collector statistics.

Version-Release number of selected component (if applicable):
condor-7.6.5-0.14

How reproducible:
100%

Steps to Reproduce:
1. Set sub collector on remote machine
COLLECTOR0: $(COLLECTOR)
COLLECTOR0_ARGS = -f -p 12345
COLLECTOR0_ENVIRONMENT = "_CONDOR_COLLECTOR_LOG=$(LOG)/CollectorLog"
DAEMON_LIST = MASTER, STARTD, COLLECTOR0
CONDOR_HOST = <master.collector.machine>
STARTD.COLLECTOR_HOST = <sub.collector.machine>:12345
CONDOR_VIEW_HOST = $(COLLECTOR_HOST)

2. check running daemons using 'condor_status -any'
# condor_status -any
MyType               TargetType           Name                          
Collector            None                 Personal Condor at <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
DaemonMaster         None                 <sub.collector.machine>
Negotiator           None                 <master.collector.machine>
Collector            None                 Personal Condor at <master.collector.machine>
...

3. stop condor on machine with sub collector, check the daemons again
# condor_status -any
MyType               TargetType           Name                          
Collector            None                 Personal Condor at <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Machine              Job                  <sub.collector.machine>
Negotiator           None                 <master.collector.machine>
Collector            None                 Personal Condor at <master.collector.machine>
...
  
Actual results:
Only ads from directly connected daemons were removed, DaemonMaster in this case. Sub collector and daemons connected to it are still present. They are removed after 15 minutes.

Expected results:
All daemons should be removed immediately after stopping condor on machine with sub collector.

Additional info:

Comment 1 Matthew Farrellee 2012-07-05 21:14:06 UTC
FYI, Version should likely be 1.0.

This is by design.

Consider, TOP-Collector <- { SUB-Collector-A, SUB-Collector-B } <- startd. Shutdown of SUB-Collector-A should not erase knowledge of startd from TOP-Collector.