Bug 614747

Summary: External watchdog aborting recipes for no reason
Product: [Retired] Beaker Reporter: Petr Šplíchal <psplicha>
Component: beahAssignee: Marian Csontos <mcsontos>
Status: CLOSED CURRENTRELEASE QA Contact: Petr Šplíchal <psplicha>
Severity: high Docs Contact:
Priority: high    
Version: 0.5CC: bpeck, gozen, kbaker, mcsontos, ohudlick, rmancy
Target Milestone: ---   
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-07-23 12:29:58 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description Petr Šplíchal 2010-07-15 07:04:27 UTC
Description of problem:

It seems that external watchdog is (randomly) killing (passing)
tasks. In two test tier jobs I've observed four such aborts. Test
finishes ok (in a few minutes, sometimes even seconds) but only
after 30 minutes it's killed and the rest of the job is aborted.

This is very badly hurting our tier testing, suggesting as a
high priority blocker.

Version-Release number of selected component (if applicable):
0.5.49

Actual results:
https://beaker.engineering.redhat.com/recipes/13124#task156594
https://beaker.engineering.redhat.com/recipes/13125#task156926
https://beaker.engineering.redhat.com/recipes/13126#task157165
https://beaker.engineering.redhat.com/recipes/13144#task158349

Comment 1 Bill Peck 2010-07-19 13:25:11 UTC
*** Bug 609538 has been marked as a duplicate of this bug. ***

Comment 2 Marian Csontos 2010-07-20 13:39:25 UTC
*** Bug 609229 has been marked as a duplicate of this bug. ***

Comment 3 Bill Peck 2010-07-22 18:56:13 UTC
I made one change in Today's upgrade which should help this, it may fix it entirely.

Marian is going to make one more change on the harness side.

Please report back here any additional watchdog expires that should not happen.

Thanks!

Comment 4 Petr Šplíchal 2010-07-23 08:06:07 UTC
Seems fine in the current Beaker. I've checked with those tests
previously aborted:

    /CoreOS/bash/Regression/bz190350-vi-mode-redo-insert
    /CoreOS/python/Security/CVE-2008-5983-untrusted-modules-search-path
    /CoreOS/perl/Regression/bz203357-buggy-regexp-in-Encode-Alias

and everything passed without any issue:

    https://beaker.engineering.redhat.com/jobs/8607

Thanks for the fix!