Bug 577365
Summary: | python-linux-procfs: python traceback while monitoring system | ||
---|---|---|---|
Product: | Red Hat Enterprise MRG | Reporter: | Clark Williams <williams> |
Component: | realtime-utilities | Assignee: | Arnaldo Carvalho de Melo <acme> |
Status: | CLOSED ERRATA | QA Contact: | David Sommerseth <davids> |
Severity: | medium | Docs Contact: | |
Priority: | low | ||
Version: | 1.2 | CC: | bhu, lgoncalv, ovasik |
Target Milestone: | --- | ||
Target Release: | --- | ||
Hardware: | All | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | Bug Fix | |
Doc Text: |
On systems with a large number of CPUs (24 and more), Tuna may have attempted to read procfs data for a terminated process and terminate unexpectedly. With this update, Tuna has been modified to catch an exception and remove the terminated process from its data structures.
|
Story Points: | --- |
Clone Of: | Environment: | ||
Last Closed: | 2010-10-11 15:10:29 UTC | Type: | --- |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: |
Description
Clark Williams
2010-03-26 19:11:46 UTC
Problem is in python-linux-procfs, commited a fix upstream and will provide a package to test on this machine. Tested with the istambul machine, couldn't reproduce. Also tested localy with a machine with 10 Gbit/s cards, brew build at: https://brewweb.devel.redhat.com/taskinfo?taskID=2433122 Will tag after some more testing. Technical note added. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. New Contents: * Cause: * Consequence: Python backtrace * Fix: * Result: Works properly on large (>=24 core) cpu systems Technical note updated. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1,4 +1,4 @@ -* Cause: +* Cause: Processes can terminate while its procfs data is being read * Consequence: Python backtrace -* Fix: +* Fix: Catch exception and remove dead process from data structures * Result: Works properly on large (>=24 core) cpu systems Tried running tuna-0.9.2-1 and python-linux-procfs-0.4.2-1 on a 32 cores box for 30 minutes without triggering this bug. Ran tuna-0.9.4-1 and python-linux-procfs-0.4.5-1 for over 1 hour without any issues. As it seems to work reliable -> moving to VERIFIED. Technical note updated. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1,4 +1 @@ -* Cause: Processes can terminate while its procfs data is being read +On systems with a large number of CPUs (24 and more), Tuna may have attempted to read procfs data for a terminated process and terminate unexpectedly. With this update, Tuna has been modified to catch an exception and remove the terminated process from its data structures.-* Consequence: Python backtrace -* Fix: Catch exception and remove dead process from data structures -* Result: Works properly on large (>=24 core) cpu systems An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2010-0762.html |