Description of problem: A regression was introduced in the 5_3.4 z stream which causes the first message of a totem network to be thrown away when it shouldn't be. This causes sync not to finish and blocks the cluster with an error ERR_TRY_AGAIN Version-Release number of selected component (if applicable): openais-0.80.3-22.5_3.4 How reproducible: run cpgbench on 3 nodes. Kill a node. happens 20% of the time Steps to Reproduce: 1.run cpgbench on 3 nodes. 2. Kill 1 node. 3. system blocks indefinately. Actual results: system blocks. Expected results: system shouldn't block. Additional info: patch available fixed upstream.
~~ Attention - RHEL 5.4 Beta Released! ~~ RHEL 5.4 Beta has been released! There should be a fix present in the Beta release that addresses this particular request. Please test and report back results here, at your earliest convenience. RHEL 5.4 General Availability release is just around the corner! If you encounter any issues while testing Beta, please describe the issues you have encountered and set the bug into NEED_INFO. If you encounter new issues, please clone this bug to open a new issue and request it be reviewed for inclusion in RHEL 5.4 or a later update, if it is not of urgent severity. Please do not flip the bug status to VERIFIED. Only post your verification results, and if available, update Verified field with the appropriate value. Questions can be posted to this bug or your customer or partner representative.
An advisory has been issued which should help the problem described in this bug report. This report is therefore being closed with a resolution of ERRATA. For more information on therefore solution and/or where to find the updated files, please follow the link below. You may reopen this bug report if the solution does not work for you. http://rhn.redhat.com/errata/RHBA-2009-1366.html