Bug 69595

Summary: Atd fails to properly start jobs
Product: [Retired] Red Hat Linux Reporter: Mike Gahagan <mgahagan>
Component: atAssignee: Jens Petersen <petersen>
Status: CLOSED RAWHIDE QA Contact: Mike McLean <mikem>
Severity: medium Docs Contact:
Priority: medium    
Version: 7.3CC: tao
Target Milestone: ---   
Target Release: ---   
Hardware: i386   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2002-07-23 16:17:09 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
patch to fix problems noted above. none

Description Mike Gahagan 2002-07-23 16:16:01 UTC
Description of Problem:

1. It deletes entries in the "=" queue after one hour, even if they are
not technically stale, i.e. the job is still running. We haven't changed
this since it does not affect correctness; the user will just see the job
dissapear from the "=" queue even if it's still runnning.

2. There is a race condition between at and atd that can delay job
execution by one hour. We have changed this to 5 minutes since a proper
fix would involve some serious changes.

3. There is a race condition that allows "atd" to miss starting jobs. We
have fixed this by making it such that it will check the job directory
every time it is woken up; We are now testing this fix.


Version-Release number of selected component (if applicable):

3.1.8-23 with the patches that fixed problems reported in #67414

How Reproducible:

Somewhat intermittent, submitting jobs which take a long time (> 1hr) to
complete is the best way to reproduce these. An SMP system also seems to help.


Steps to Reproduce:
1. 
2. See above
3. 

Actual Results:

See above

Expected Results:

Jobs should complete properly, and not be removed from the queue until acutally
complete.

Additional Information:
	

attached patch applies cleanly to 7.3 at package and should fix the problems,
but a little more testing is needed.

Comment 1 Mike Gahagan 2002-07-23 16:17:05 UTC
Created attachment 66624 [details]
patch to fix problems noted above.

Comment 2 Bill Huang 2002-07-25 02:32:47 UTC
It is fixed.(3.1.8-31)