Bug 699723
Summary: | [RFE] Make shadow worklife extend past CLAIM_WORKLIFE | ||
---|---|---|---|
Product: | Red Hat Enterprise MRG | Reporter: | Matthew Farrellee <matt> |
Component: | condor | Assignee: | Timothy St. Clair <tstclair> |
Status: | CLOSED WONTFIX | QA Contact: | MRG Quality Engineering <mrgqe-bugs> |
Severity: | medium | Docs Contact: | |
Priority: | medium | ||
Version: | 1.3 | CC: | dan, matt, tstclair |
Target Milestone: | 2.1 | ||
Target Release: | --- | ||
Hardware: | Unspecified | ||
OS: | Unspecified | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2011-08-03 18:21:43 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | |||
Bug Blocks: | 699726 |
Description
Matthew Farrellee
2011-04-26 13:24:21 UTC
Now tracking upstream https://condor-wiki.cs.wisc.edu/index.cgi/tktview?tn=2229 Dev notes: Original patch would not work because of sequence of events which the schedd follows when starting a claim. If the CLAIM_WORKLIFE expires then the schedd has no method of blocking to request a new claim for a given match_rec. Presently, it enqueues a claim_request for a given match which then waits for a response. Once the response has been received, it then triggers another timer to start the job. Everything is 1:1 as it comes in from negotiation. I will consult with upstream, but it appears that if one wants to do this it would require rearchitecting how we handle claim_request and how jobs are spun. It does however seem possible to reuse other transient error conditions (107 & 108). Dev notes: Consulted with DanB, and he is in agreement with the assessment. There is little to be gained with trying to recycle a shadow once a claim has expired because you would need to request a new claim, which is a high latency event. Will pursue the option of reusing for other error conditions. |