Red Hat Satellite engineering is moving the tracking of its product development work on Satellite to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "Satellite project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs will be migrated starting at the end of May. If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "Satellite project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/SAT-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.
Bug 1628638 - The termination procedure after memory threshold exceeded can get stuck, waiting infinitely for some events to occur
Summary: The termination procedure after memory threshold exceeded can get stuck, wait...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Satellite
Classification: Red Hat
Component: Tasks Plugin
Version: Unspecified
Hardware: Unspecified
OS: Unspecified
unspecified
high
Target Milestone: 6.5.0
Assignee: satellite6-bugs
QA Contact: Jan Hutař
URL:
Whiteboard:
Depends On: 1654217 1665461
Blocks:
TreeView+ depends on / blocked
 
Reported: 2018-09-13 15:41 UTC by Ivan Necas
Modified: 2023-03-24 14:14 UTC (History)
12 users (show)

Fixed In Version: tfm-rubygem-dynflow-1.1.1
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
: 1661291 1684687 (view as bug list)
Environment:
Last Closed: 2019-05-14 12:38:03 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Foreman Issue Tracker 25021 0 Normal Closed The termination procedure after memory threshold exceeded can get stuck, waiting infinitely for some events to occur 2020-10-26 08:41:36 UTC
Red Hat Bugzilla 1654975 0 unspecified CLOSED Dynflow executor termination may hang if there is an action which keeps the executor occupied 2021-02-22 00:41:40 UTC
Red Hat Knowledge Base (Solution) 3891181 0 None None None 2019-02-06 14:25:35 UTC
Red Hat Product Errata RHSA-2019:1222 0 None None None 2019-05-14 12:38:10 UTC

Internal Links: 1654975

Description Ivan Necas 2018-09-13 15:41:00 UTC
Description of problem:


Version-Release number of selected component (if applicable):


How reproducible:
Occasionally

Steps to Reproduce:
1. set a memory threshold on dynflow executor (EXECUTOR_MEMORY_LIMIT=2gb in /etc/sysconfig/foreman-tasks in sat 6.3, /etc/sysconfig/dynflowd in 6.4

2. run some substantial load on the tasking system continuous resyncing and publishing of CVs might be a good start
3. wait for the memory limit of the dynflow process to cross the limit - watch production.log for 'Memory level exceeded' message

Actual results:
The process still keeps running, termination is not finished, the tasks are not proceeding anymore

Expected results:
The process exits in timely manner and new one gets started

Additional info:
I've written the reproduced based on customer observations, haven't reproducer it locally in production just yet, we however know about places where we wait in the termination phase without any timeout

Comment 2 sthirugn@redhat.com 2018-09-14 15:31:56 UTC
Increasing the severity.

Comment 4 Ivan Necas 2018-09-24 12:23:47 UTC
Created redmine issue https://projects.theforeman.org/issues/25021 from this bug

Comment 5 Ivan Necas 2018-10-11 16:22:43 UTC
Fixed upstream in https://github.com/Dynflow/dynflow/pull/297

Comment 28 Mike McCune 2019-03-01 22:16:01 UTC
This bug was cloned and is still going to be included in the 6.4.3 release. It no longer has the sat-6.4.z+ flag and 6.4.3 Target Milestone Set which are now on the 6.4.z cloned bug. Please see the Clones field to track the progress of this bug in the 6.4.3 release.

Comment 31 errata-xmlrpc 2019-05-14 12:38:03 UTC
Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory, and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2019:1222


Note You need to log in before you can comment on or make changes to this bug.