Red Hat Bugzilla – Bug 597362
Sporadic failure of check-long in cluster_tests.py test_failover
Last modified: 2010-10-14 12:04:22 EDT
Description of problem:
Version-Release number of selected component (if applicable):
How reproducible: easy
Steps to Reproduce:
1. while make check-long LONG_TESTS=run_long_cluster_tests; do true; done
Eventually halts with an error
Based on the logs it appears that a new broker was joining the cluster when an error caused the updater to retract its update. The new broker exits with an error "update connection closed prematurely"
Fixed on trunk r949767 and ported to mrg_1.3.x branch:
Run indefinite loop with the test on both RHEL5 (i386, x86_64) machines.
Tested with (version):
RHEL5 i386,x86_64 - passed
Technical note added. If any revisions are required, please edit the "Technical Notes" field
accordingly. All revisions will be proofread by the Engineering Content Services team.
C: Bug in code handling cluster update.
C: Broker joining a cluster at the same time that the cluster is handling an error failed with "update connection closed prematurely"
F: Fixed the update code.
R: Brokers can join a cluster successfully even when the cluster is handling an error.
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on therefore solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.