Bug 893075 - [RFE] we need better integration/communication of anaconda and the harness
Summary: [RFE] we need better integration/communication of anaconda and the harness
Keywords:
Status: CLOSED EOL
Alias: None
Product: Beaker
Classification: Retired
Component: beah
Version: 0.10
Hardware: Unspecified
OS: Unspecified
unspecified
unspecified
Target Milestone: ---
Assignee: beaker-dev-list
QA Contact:
URL:
Whiteboard: Kickstart
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2013-01-08 14:23 UTC by Endre "Hrebicek" Balint-Nagy
Modified: 2020-02-11 12:10 UTC (History)
5 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed: 2020-02-11 12:08:18 UTC
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Bugzilla 1588895 0 unspecified CLOSED Make the installation error detection more reliable 2021-02-22 00:41:40 UTC

Internal Links: 1588895

Description Endre "Hrebicek" Balint-Nagy 2013-01-08 14:23:48 UTC
Description of problem:
There are cases, when an anaconda detected error ends up in a long timeout in beaker. It would be much better if anaconda reported the problems to the test harness if running under beaker. Probably anaconda needs a boot option to turn on this behavior. (Maybe the anaconda bugzilla would be better for this RFE, but anyway requires activity from beaker side.)

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1.
2.
3.
  
Actual results:


Expected results:


Additional info:

Comment 1 Endre "Hrebicek" Balint-Nagy 2013-01-08 14:46:23 UTC
The anaconda BZ for this issue is 893084. Please keep in touch with its assignee to work out a solution.

Comment 2 Dan Callaghan 2013-01-09 22:50:49 UTC
I do not expect the Anaconda folks will want to add Beaker-specific code, which is fair. But maybe we could work with them to figure out a more generally useful way of reporting errors which Beaker can plug in to.

The other thing to bear in mind is that although there are many kinds of errors that can make the installation hang, Anaconda often shows a prompt to allow the problem to be fixed with manual intervention. For example, when the kickstart can't be fetched you will be prompted to enter a different URL. And this is useful even in Beaker, because it gives us a chance to get onto the serial console and investigate the problem. If the system were returned immediately we would not have a chance to do that.

A better approach might be to have a shorter initial watchdog time (I think we allow 3 hours for installation right now) and add something to the anamon script to periodically extend the watchdog by 10 minutes as long as Anaconda is making progress. The hard part might be figuring out whether Anaconda is actually making progress, but a rudimentary way might be just to check for new log entries in any of the monitored Anaconda logs.

Comment 3 Endre "Hrebicek" Balint-Nagy 2013-02-01 02:57:22 UTC
My last comment on anaconda pair of this bug, ATM unanswered,

The most important point is to make the job submitter aware, when anaconda is asking questions on console. Generally, this is an issue in case of unattended installation, thus a more generic solution is desirable. That solution then can be implemented in beaker.
Probably a minimalist solution to give an rsyslog server on the boot command line, but this not is necessarily the beaker way.
The second point is to agree on unique patterns signalizing fatal conditions.
(Maybe the rsyslog solution is unavoidable while we have test systems lacking the serial console.)
The third point, to make an opaque string to be able to identify the actual installation, - in beaker case is the taskid - to include in each log line.
I hope this is not too much to implement as rsyslog support is already present.
(For the beaker side this means a mandatory, open for the whole lab rsyslog port on the lab controller and monitored its content for relevant events.)

Are my wishes realistic?

Cheers
Endre.

Comment 6 Roman Joost 2018-03-08 02:39:21 UTC
We've just tried to fix Bug 1413827 and while doing the QA part found Bug 1552401. This feels like we're fighting a loosing battle. The idea behind this RFE I think is sound and should use this bug to investigate anaconda, as well as come up with an idea of perhaps not rely on pattern matching against the log.

Comment 7 Martin Styk 2020-02-11 12:08:18 UTC
Beah is no longer supported by Beaker development team.
Instead of that, we are working on Restraint test harness. You can find all the features of Restraint here.

https://restraint.readthedocs.io/en/latest/

If you think your RFE should be still implemented as part of Restraint feel free to create a new BZ ticket.

https://bugzilla.redhat.com/enter_bug.cgi?product=Restraint

In case you have any question feel free to reach out to me
Thank you,
Martin Styk <martin.styk>


Note You need to log in before you can comment on or make changes to this bug.