Bug 164577 - Piranha does not detect link-down events which would prevent proper operation
Summary: Piranha does not detect link-down events which would prevent proper operation
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Cluster Suite
Classification: Retired
Component: piranha
Version: 4
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Lon Hohberger
QA Contact: Cluster QE
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2005-07-28 20:34 UTC by Lon Hohberger
Modified: 2009-04-16 20:17 UTC (History)
2 users (show)

Fixed In Version: RHBA-2005-746
Clone Of:
Environment:
Last Closed: 2005-10-07 16:47:11 UTC
Embargoed:


Attachments (Terms of Use)
Implementation for 0.8.0 (RHCS4) (12.80 KB, patch)
2005-07-28 20:35 UTC, Lon Hohberger
no flags Details | Diff


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2005:746 0 normal SHIPPED_LIVE piranha bug fix update 2005-10-07 04:00:00 UTC

Description Lon Hohberger 2005-07-28 20:34:46 UTC
+++ This bug was initially created as a clone of Bug #164576 +++

Description of problem:

When piranha is used with a public + private NIC, any loss of a network path on
the master director constitutes a service outage.

For example, a normal NAT piranha cluster.  The left side are NICs on the public
network, and the right side is the private network.  The primary node is active:

                        +--> rs0
   <---> primary <----> |
          |   |         +--> rs1
   <---> backup  <----> |
                        +--> rs2

If we disconnect the public NIC on the primary node, heartbeats are still being
received over the public network, and thus, no failover occurs.  This causes an
outage.

                        +--> rs0
   X---X primary <----> |
          X   |         +--> rs1
   <---> backup  <----> |
                        +--> rs2

The same thing happens if we disconnect the private NIC: heartbeats are still
being sent over the public interface, so no failover occurs.  

                        +--> rs0
   <---> primary X----X |
          |   X         +--> rs1
   <---> backup  <----> |
                        +--> rs2

This reduces the availability of the piranha director cluster.


Version-Release number of selected component (if applicable): 0.7.10, 0.8.0


How reproducible: 100%


Steps to Reproduce:
1. Create a piranha cluster with a backup.  Configure the IPs for the public
NICs and private NICs.
2. Start piranha on both nodes.
3. Unplug either the public or private NIC on the master server.
  
Actual results:
No failover.  Any virtual services will malfunction.

Expected results:
Failover.  Virtual services should be made available.


Additional info:

Comment 1 Lon Hohberger 2005-07-28 20:35:55 UTC
Created attachment 117243 [details]
Implementation for 0.8.0 (RHCS4)

Comment 4 Red Hat Bugzilla 2005-10-07 16:47:11 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2005-746.html



Note You need to log in before you can comment on or make changes to this bug.