Hide Forgot
Description of problem: 1) There is need for corosync RRP to have different mcast address for all NICs 2) Default threshold should be set to more reasonable value 3 Version-Release number of selected component (if applicable): Newest for 6.2 How reproducible: See https://bugzilla.redhat.com/show_bug.cgi?id=722469 Additional info: Threshold key is totem.rrp_problem_count_threshold Expected results: corosync-objctl -a | grep rrp_proble -> 3
Created attachment 522060 [details] proposed patch Unit test results: 0) no altname, no <cman/> Multicast addresses: 239.192.99.73 [root@clusternet-node2 ~]# corosync-objctl |grep rrp totem.rrp_mode=none 1) autoselection: no <cman/> <altname name="clusternet-node1-eth2"/> <altname name="clusternet-node2-eth2"/> Multicast addresses: 239.192.99.73 239.192.99.74 [root@clusternet-node2 ~]# corosync-objctl |grep rrp totem.rrp_mode=passive totem.rrp_problem_count_threshold=3 2) force cman main multicast on primary interface <cman> <multicast addr="239.192.100.1"/> </cman> <clusternodes> <clusternode name="clusternet-node1-eth1" votes="1" nodeid="1"> <altname name="clusternet-node1-eth2"/> [root@clusternet-node2 daemon]# cman_tool status Multicast addresses: 239.192.100.1 239.192.99.74 [root@clusternet-node2 ~]# corosync-objctl |grep rrp totem.rrp_mode=passive totem.rrp_problem_count_threshold=3 3) force mcast all over (note that altname need mcast per node!) <cman> <multicast addr="239.192.100.1"/> </cman> <altname name="clusternet-node1-eth2" mcast="239.192.100.2"/> <altname name="clusternet-node2-eth2" mcast="239.192.100.2"/> [root@clusternet-node2 daemon]# cman_tool status Multicast addresses: 239.192.100.1 239.192.100.2 [root@clusternet-node2 ~]# corosync-objctl |grep rrp totem.rrp_mode=passive totem.rrp_problem_count_threshold=3
Created attachment 522068 [details] new patch The previous patch had an error when <totem tag was present in cluster.conf. Previous unit test is still valid. Add the following cases: <totem rrp_mode="active"/> [root@clusternet-node2 xml]# corosync-objctl |grep rrp cluster.totem.rrp_mode=active totem.rrp_mode=active totem.rrp_problem_count_threshold=3 <totem rrp_mode="active" rrp_problem_count_threshold="10"/> [root@clusternet-node2 xml]# corosync-objctl |grep rrp cluster.totem.rrp_mode=active cluster.totem.rrp_problem_count_threshold=10 totem.rrp_mode=active totem.rrp_problem_count_threshold=10
The "today I just don´t know how to copy/paste" <totem rrp_problem_count_threshold="10"/> [root@clusternet-node2 xml]# corosync-objctl |grep rrp cluster.totem.rrp_problem_count_threshold=10 totem.rrp_problem_count_threshold=10 totem.rrp_mode=passive
Technical note added. If any revisions are required, please edit the "Technical Notes" field accordingly. All revisions will be proofread by the Engineering Content Services team. New Contents: Improve integration between cman and corosync for Redundant Ring Protocol
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory, and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. http://rhn.redhat.com/errata/RHBA-2011-1516.html