Bug 588898

Summary: if a node originates more then 512 messages in recovery it will sigabort (assert)
Product: Red Hat Enterprise Linux 5 Reporter: Benjamin Kahn <bkahn>
Component: openaisAssignee: Steven Dake <sdake>
Status: CLOSED ERRATA QA Contact: Cluster QE <mspqa-list>
Severity: urgent Docs Contact:
Priority: urgent    
Version: 5.5CC: cluster-maint, edamato, jwest, pm-eus, sdake
Target Milestone: rcKeywords: ZStream
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: openais-0.80.6-8.el5_4.6 Doc Type: Bug Fix
Doc Text:
In high-loss networks, an assert based on a constant value for the retransmit message queue size could have caused a some nodes to receive SIGABRT signals, and therefore terminate. This constant value has been increased to correspond to the maximum number of entries, thus resolving the issue.
Story Points: ---
Clone Of: Environment:
Last Closed: 2010-06-16 10:17:49 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On: 588500    
Bug Blocks:    
Attachments:
Description Flags
revision 2129 backported none

Description Benjamin Kahn 2010-05-04 19:18:32 UTC
This bug has been copied from bug #588500 and has been proposed
to be backported to 5.4 z-stream (EUS).

Comment 3 Steven Dake 2010-05-04 20:15:23 UTC
Created attachment 411398 [details]
revision 2129 backported

Comment 6 errata-xmlrpc 2010-06-16 10:17:49 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2010-0485.html

Comment 7 Steven Dake 2010-12-21 07:11:46 UTC
*** Bug 647291 has been marked as a duplicate of this bug. ***

Comment 8 Douglas Silas 2011-01-11 23:16:54 UTC
    Technical note added. If any revisions are required, please edit the "Technical Notes" field
    accordingly. All revisions will be proofread by the Engineering Content Services team.
    
    New Contents:
In high-loss networks, an assert based on a constant value for the retransmit message queue size could have caused a some nodes to receive SIGABRT signals, and therefore terminate. This constant value has been increased to correspond to the maximum number of entries, thus resolving the issue.