Red Hat Bugzilla – Bug 808142
condor_startd reconfig from static slots to partitionable slots causes ERROR and exit if job is running
Last modified: 2016-05-26 15:57:07 EDT
Description of problem:
The condor_startd allows a reconfig to change its slot configuration. If a job
is running during that reconfiguration, the startd will ERROR and exit.
Version-Release number of selected component (if applicable):
Likely all, definitely 7.6.7-0.8.
Steps to Reproduce:
1. service condor start
2. echo 'cmd=/bin/sleep\nargs=1d\nqueue' | condor_submit
3. Wait for job to start running
4. Enable partitionable slots
NUM_SLOTS_TYPE_1 = 1
In MasterLog -
Sent SIGHUP to STARTD (pid 10522)
The STARTD (pid 10522) exited with status 0
In StartLog -
ERROR "Unknown state in ResState::leave_action" at line 604 in file .../src/condor_startd.V6/ResState.cpp
No ERROR, no exit.
MRG-Grid is in maintenance and only customer escalations will be considered. This issue can be reopened if a customer escalation associated with it occurs.