Hide Forgot
Description of problem: We had some jobs killed by external watchdog the last outage - see Bug 637186#c6 and following comment(s). We need to make sure after outage taking longer than 30 minutes all the watchdogs are updated to allow for machines to sync up with the host. Version-Release number of selected component (if applicable): 0.7.2 How reproducible: Not easy to reproduce. Steps to Reproduce: 1. schedule a task taking X minutes 1. perform an outage longer than X + 1 hour Actual results: Job killed by external watchdog Expected results: Job resumes after the delay, submits all results and continues execution. Additional info:
How about a command line which would allow the admins to extend current watchdogs by the length of the outage? bkr watchdogs --add 30m
of course they would need to run that before starting beaker-watchdog services.
Yes please. I think that would do.
moving to 0.8.2
pushed to gerrit for review