1262990 – [bug] watchdog can be pushed out after a panic is detected

Bug 1262990 - [bug] watchdog can be pushed out after a panic is detected

Summary: [bug] watchdog can be pushed out after a panic is detected

Keywords:
Status:	CLOSED WONTFIX
Alias:	None
Product:	Beaker
Classification:	Retired
Component:	lab controller
Sub Component:
Version:	20
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Assignee:	beaker-dev-list
QA Contact:	tools-bugs
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2015-09-14 20:19 UTC by Bill Peck
Modified:	2020-10-21 14:13 UTC (History)
CC List:	3 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2020-10-21 14:13:33 UTC
Embargoed:

Attachments	(Terms of Use)

Description Bill Peck 2015-09-14 20:19:22 UTC

Description of problem:
When a panic is detencted by the lab controller it sets the current watchdog to ten minutes from the current time to give the system a chance to kdump if configured.

The problem is if kdump default configuration is to reboot the system.  This means its possible that the system under test could push the watchdog out again and attempt to run the test and possibly panic again.

Version-Release number of selected component (if applicable):
20.2

How reproducible:
Sun, moon and stars need to align

Actual results:
We had a bad kernel go through beaker which ended up filling the netdump server disk and kept the systems busy until someone manually canceled the job.

Several things had to go wrong.
1. bad kernel that caused a panic
2. bug in kdump that kept the system from halting
3. fsck? rolled changes back in filesystem which made the harness think it was running for the first time (rebootcount==0)

Expected results:
Once initial 10 minute watchdog timeout is set because of a panic it should not be changeable.

Additional info:
There is some other code in the abort code which looks like it can modify the watchdog time as well if WATCHDOG_SCRIPT is defined. So this could be tricky.

Comment 1 Pavel Cahyna 2019-07-18 16:15:56 UTC

Bill Peck: how will one be able to collect the vmcore generated by kdump? Ten minutes is not enough.

Note You need to log in before you can comment on or make changes to this bug.