Bug 490099 - Checkpoint service incorrectly calculates reference counts on checkpoints from leaving node
Summary: Checkpoint service incorrectly calculates reference counts on checkpoints fro...
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 5
Classification: Red Hat
Component: openais
Version: 5.3
Hardware: All
OS: Linux
urgent
medium
Target Milestone: rc
: ---
Assignee: Steven Dake
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks: 491395
TreeView+ depends on / blocked
 
Reported: 2009-03-13 09:25 UTC by Steven Dake
Modified: 2016-04-26 15:49 UTC (History)
5 users (show)

Fixed In Version:
Doc Type: Bug Fix
Doc Text:
Clone Of:
Environment:
Last Closed: 2009-09-02 11:29:37 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2009:1366 0 normal SHIPPED_LIVE openais bug-fix and enhancement update 2009-09-01 11:00:17 UTC

Description Steven Dake 2009-03-13 09:25:26 UTC
Description of problem:
Nodes that depart the cluster are incorrectly tracked in the refcount.  This means the checkpoint duration timer is never fired because reference counts are always > 1.

Version-Release number of selected component (if applicable):
openai-0.80.3-22 and below

How reproducible:
100%

Steps to Reproduce:
1.start two nodes, open checkpoint on each with test app, sleep in app continually
2.kill second node
3.white box instrumentation shows refcount at 2, when it should be at 1
  
Actual results:
refcount is higher then number of checkpoint opens in system and checkpoint is never garbage collected when all checkpoint opens are closed.

Expected results:
refcount should match number of checkpoint opens

Additional info:

Comment 1 Steven Dake 2009-03-13 09:25:49 UTC
patch available.

Comment 2 Irina Boverman 2009-03-20 18:29:48 UTC
please add this bug to the rhel 5.3.x errata currently in testing.

Comment 6 errata-xmlrpc 2009-09-02 11:29:37 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2009-1366.html


Note You need to log in before you can comment on or make changes to this bug.