Bug 526847
| Summary: | condor_startd SEGV when deleting dynamic slots | ||
|---|---|---|---|
| Product: | Red Hat Enterprise MRG | Reporter: | Matthew Farrellee <matt> |
| Component: | condor | Assignee: | Matthew Farrellee <matt> |
| Status: | CLOSED ERRATA | QA Contact: | Luigi Toscano <ltoscano> |
| Severity: | high | Docs Contact: | |
| Priority: | high | ||
| Version: | 1.1 | CC: | lbrindle, ltoscano |
| Target Milestone: | 1.2 | ||
| Target Release: | --- | ||
| Hardware: | All | ||
| OS: | Linux | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | Bug Fix | |
| Doc Text: |
Grid bug fix
C: An update timer is registered for a resource (a dynamic slot) that has been deleted.
C: The condor_startd crashes when the timer fires.
F: The timer is no longer registered.
R: Dynamic slots can be deleted without creating a crash.
When an update timer was registered for a resource (a dynamic slot) that had been deleted, the condor_startd crashed when the timer fired. The timer is no longer registered, which means that dynamic slots can be deleted without creating a crash.
|
Story Points: | --- |
| Clone Of: | Environment: | ||
| Last Closed: | 2009-12-03 09:18:03 UTC | Type: | --- |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | |||
| Bug Blocks: | 527551 | ||
|
Description
Matthew Farrellee
2009-10-02 02:58:04 UTC
Cause of the SEGV happens before the stack shown, with the registration of an update timer on a Resource (a slot) that has been deleted. The eval_and_update_all timer calls Resource::eval_and_update which calls eval_state resulting in the Resource being deleted and then update, which registers the timer in the stack. This is fixed upstream and will be built into 7.4.0-0.6 commit 0d5e3ad8fc85f0cd0dc58f73b503c76c0ad49bc4 Author: Matthew Farrellee <matt> Date: Thu Oct 1 22:22:08 2009 -0400 Release note added. If any revisions are required, please set the "requires_release_notes" flag to "?" and edit the "Release Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. New Contents: please see bug summary. The crash does often show itself running the said configuration on condor-7.2.2-0.9, RHEL5.4 and 4.8, both i386 and x86_64. On the same machines that crash has disappeared with condor-7.4.1-0.2. Changing the state to VERIFIED. Release note updated. If any revisions are required, please set the "requires_release_notes" flag to "?" and edit the "Release Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1 +1,8 @@ -please see bug summary.+Grid bug fix + +C: And update timer is registered on a resource (a dynamic slot) that has been deleted. +C: The condor_startd crashes +F: +R: Dynamic slots can be deleted without creating a crash. + +MORE INFORMATION REQUIRED FOR RELNOTE. Release note updated. If any revisions are required, please set the "requires_release_notes" flag to "?" and edit the "Release Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1,8 +1,6 @@ Grid bug fix -C: And update timer is registered on a resource (a dynamic slot) that has been deleted. +C: And update timer is registered for a resource (a dynamic slot) that has been deleted. -C: The condor_startd crashes +C: The condor_startd crashes when the timer fires. -F: +F: The timer is no longer registered. -R: Dynamic slots can be deleted without creating a crash. +R: Dynamic slots can be deleted without creating a crash.- -MORE INFORMATION REQUIRED FOR RELNOTE. Release note updated. If any revisions are required, please set the "requires_release_notes" flag to "?" and edit the "Release Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. Diffed Contents: @@ -1,6 +1,8 @@ Grid bug fix -C: And update timer is registered for a resource (a dynamic slot) that has been deleted. +C: An update timer is registered for a resource (a dynamic slot) that has been deleted. C: The condor_startd crashes when the timer fires. F: The timer is no longer registered. -R: Dynamic slots can be deleted without creating a crash.+R: Dynamic slots can be deleted without creating a crash. + +When an update timer was registered for a resource (a dynamic slot) that had been deleted, the condor_startd crashed when the timer fired. The timer is no longer registered, which means that dynamic slots can be deleted without creating a crash. An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHEA-2009-1633.html |