Bug 108827 - RHEL4: Infiniband support
Summary: RHEL4: Infiniband support
Keywords:
Status: CLOSED ERRATA
Alias: None
Product: Red Hat Enterprise Linux 4
Classification: Red Hat
Component: kernel
Version: 4.0
Hardware: All
OS: Linux
high
medium
Target Milestone: ---
: ---
Assignee: Doug Ledford
QA Contact:
URL: IT_10188 (Dell); IT_14002, IT_14003, ...
Whiteboard: Kernel
: 108541 109127 113652 (view as bug list)
Depends On: 158747
Blocks: 130530 168430 172741 173386 175120
TreeView+ depends on / blocked
 
Reported: 2003-11-02 17:22 UTC by Susan Denham
Modified: 2018-11-14 18:20 UTC (History)
11 users (show)

Fixed In Version: RHSA-2006-0132
Doc Type: Enhancement
Doc Text:
Clone Of:
Environment:
Last Closed: 2006-03-07 18:29:07 UTC
Target Upstream Version:
Embargoed:


Attachments (Terms of Use)


Links
System ID Private Priority Status Summary Last Updated
Red Hat Product Errata RHBA-2006:0013 0 qe-ready SHIPPED_LIVE udev bug fix update 2006-03-07 05:00:00 UTC
Red Hat Product Errata RHEA-2006:0136 0 qe-ready SHIPPED_LIVE OpenIB Infiniband networking technology preview 2006-03-06 05:00:00 UTC
Red Hat Product Errata RHSA-2005:808 0 normal SHIPPED_LIVE Important: kernel security update 2005-10-27 04:00:00 UTC
Red Hat Product Errata RHSA-2006:0016 0 qe-ready SHIPPED_LIVE Moderate: initscripts security update 2006-03-07 05:00:00 UTC
Red Hat Product Errata RHSA-2006:0132 0 qe-ready SHIPPED_LIVE Moderate: Updated kernel packages available for Red Hat Enterprise Linux 4 Update 3 2006-03-09 16:31:00 UTC

Description Susan Denham 2003-11-02 17:22:12 UTC
Arjan noted (and Steve Carbonari of Intel didn't object) that Infiniband is not
on the Intel chipset disclosure documents for CY04.  Arjan feels that 10GB
ethernet and similar technologies will supersede Infiniband.

Comment 2 Susan Denham 2003-11-02 21:48:28 UTC
*** Bug 108541 has been marked as a duplicate of this bug. ***

Comment 3 Susan Denham 2003-11-02 21:50:18 UTC
IBM specific info:  Priority: 1
Status: low-latency connection.  RAC interconnect and middle-tier to DB
connections are sensitive to latency and speed.
Description: Include Infiniband drivers
Business Justification: Provides a faster interconnect.

Comment 5 Bob Johnson 2003-11-05 00:19:51 UTC
*** Bug 109127 has been marked as a duplicate of this bug. ***

Comment 6 Tim Burke 2004-01-19 01:53:02 UTC
*** Bug 113652 has been marked as a duplicate of this bug. ***

Comment 8 Susan Denham 2004-03-03 17:49:29 UTC
Sun wants Infiniband as a backplane interconnect rather than as a replacement
for 10GB ethernet.  

The request for Infiniband doesn't seem to be going away from any of the OEMs....

Comment 11 Larry Troan 2004-05-12 17:59:36 UTC
Adding HP

Comment 12 Arjan van de Ven 2004-05-19 11:31:12 UTC
significantly doubtful for RHEL4. There are like 3 codebases, all really really bad.

Comment 13 Larry Troan 2004-05-20 18:29:45 UTC
Adding SLAC

Comment 14 Larry Troan 2004-05-20 18:41:41 UTC
From PM:

Based on Arjan's comments, this one looks like it's too ill-defined and the code
is in too poor a shape for RHEL4 consideration. If it turns out that Topspin and
its 10G patch are accepted upstream and it passes Community code inspection
quality, then we should reconsifder.

For now , setting status to CLOSE=DEFERRED. Suggest we consider again for RHEL5.

Comment 15 Ben Woodard 2004-05-20 19:52:16 UTC
I believe Larry's comment is unacceptable in this case. In this case the
business justification is so compelling in the time frame of RHEL4 that we need
to take a proactive stance on the issue and work with the people at TopSpin and
the other OpenIB partners to bring their code into shape.

For this issue Red Hat must act not as an aggregator of already exant technology
but a contributor to the technology to bring it in line with acceptable upstream
coding standards. Virtually every big partner that we have is asking for this
feature and we have lost significant market advantage if we try to put this
feature out to RHEL5. 

Comment 16 Ben Woodard 2004-05-20 20:14:26 UTC
I think that all parties will agree that Infiniband is not ready yet. The big
thing that we need to decide as a company is are we going to be a market leader
deploying innovative technology in response to customer needs or are we going to
be a follower and just gather together the already extant technology and resell it.

There is a very notable advantage in being the first to market and this should
not be discounted. At the moment SuSE wins every single time in those races.

Comment 25 Robert Perkins 2004-06-22 15:53:04 UTC
Right now this is stuck for RHEL4 and will not be delivered.
We need hardware.
We need engineers to implement the IB upstream, and port it back to RHEL3.
This is a good candidate for a separate project for Red Hat to take on, but it
needs a Marketing Reqts Doc or some other justification as well as a project
plan and schedule.  Will mark this as a key feature for RHEL4 timeframe, perhaps
as a layered add-on.  

Could this be released as a vendor-supported driver?


Comment 27 Ben Woodard 2004-07-08 00:45:31 UTC
First of all we need the low level drivers that support the hardware.

We would like IP networking over IB. We run NFS and other protocols over this
network interface.

We also need text based IB configuration tools sort of like ifconfig, ethtool
and ethdiag.

MPI over IB is really important.

low level kernel data transfer capabilities are also necessary. On top of this
the CFS people will build the interface that lustre uses.

API that the batch job scheduler will use to initialize the MPI interfaces for
the job.

Comment 28 Ben Woodard 2004-07-08 01:03:35 UTC
Along with the development of the software stack, they want an integrated
testing suite and regression tests.

http://www.llnl.gov/asci/pathforward_trilab/iba_rfp/OSSODA_IBA_RFPSOW_DRAFT.pdf
http://www.llnl.gov/asci/pathforward_trilab/iba_rfp/OSSODA_IBA_RFP_EVAL_PREP_DRAFT.pdf

The way that they put it in the RFP on page 5 is:

1) complete InfiniBand access layer including host channel adapter (HCA) device
drivers, user and kernel space Verbs interface, communication management,
resource management; 
2) subnet management and subnet administration; and 
3) HPC upper layer protocols for MPI, IPoIB, SDP, and Sandia Portals.

Comment 31 Ben Woodard 2004-07-12 16:18:52 UTC
For reference regarding SDP:

To: Ben Woodard <woodard>
Subject: Re: [Fwd: (UPDATED) Issue #36125 ([RHEL 4] Implement Infiniband
        Support)[FEG-RHEL4/Feature Escalations Group/LLNL (HPC)]]
X-Message-Flag: Warning: May contain useful information
References: <1089328133.16745.1512.camel.gov>
From: Roland Dreier <roland>
Date: Thu, 08 Jul 2004 21:42:08 -0700
In-Reply-To: <1089328133.16745.1512.camel.gov> (Ben Woodard's
message of "Thu, 08 Jul 2004 16:08:53 -0700")
Message-ID: <52vfgxbxlr.fsf>
User-Agent: Gnus/5.1006 (Gnus v5.10.6) XEmacs/21.4 (Security Through
        Obscurity, linux)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
X-OriginalArrivalTime: 09 Jul 2004 04:42:10.0152 (UTC)
        FILETIME=[1896AE80:01C4656F]
X-RedHat-Spam-Score: 0
X-Evolution-Source: imap://woodard.redhat.com/
Content-Transfer-Encoding: 8bit
 
This is something I wrote up a while ago about SDP:
 
SDP intellectual property
 
  Microsoft believes that they own certain intellectual property
  relating to the Sockets Direct Protocol (SDP)[2].  Therefore, we are
  including the following disclaimer required by Microsoft's license
  in SDP source that relates to the implementation of the protocol:
 
    "This source code may incorporate intellectual property owned by
    Microsoft Corporation. Our provision of this source code does not
    include any licenses or any other rights to you under any
    Microsoft intellectual property. If you would like a license from
    Microsoft (e.g., to rebrand, redistribute), you need to contact
    Microsoft directly."
 
  We realize that this is incompatible with open source licensing, and
  we are working to find a more satisfactory solution, but for the
  time being we are forced to comply with Microsoft's license.
 
  Please make sure you have fully understood the implications of
  Microsoft's claims before you redistribute any of the SDP source
  that contains the above disclaimer.
 
  [2]  <http://www.microsoft.com/mscorp/ip/standards/>


Comment 35 Robert Perkins 2004-08-04 12:08:47 UTC
PM is managing this from an overall program standpoint, in/out/when.  Greg is
project prime and will report on the Infiniband plans.

Comment 38 Robert Perkins 2004-08-17 15:49:55 UTC
Greg, please update with:
1.  What we are doing for RHEL4 or RHEL4 U1 (specify release)
    Please provide a synopsis of the functionality we are delivering as part of
the RFP.

2.  If we are delivering on RHEL3 U4, please provide a new FZ and provide the
same sort of info.

Thanks, this is a hot item for LLNL as you know and we have a SLA to meet to
respond to feature queries.  Also, many other vendors are interested in this.

Comment 49 Larry Troan 2004-10-06 19:53:05 UTC
Per Rob Perkins, plan is to support Infiniband via Vendor Provided "top spin"
driver.

Comment 57 Tim Burke 2005-03-02 15:53:54 UTC
Not in U1.

Being reviewed for a later update.  Still not mature enough upstream.  Unlikely
to be in U2.

Comment 63 Tim Burke 2005-07-07 21:55:55 UTC
This is not in U2. We are pursuing initial IB support in the context of U3.

Comment 97 Red Hat Bugzilla 2006-03-07 16:43:58 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHBA-2006-0013.html


Comment 98 Red Hat Bugzilla 2006-03-07 18:18:04 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHEA-2006-0136.html


Comment 99 Red Hat Bugzilla 2006-03-07 18:18:26 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2006-0016.html


Comment 100 Red Hat Bugzilla 2006-03-07 18:29:08 UTC
An advisory has been issued which should help the problem
described in this bug report. This report is therefore being
closed with a resolution of ERRATA. For more information
on the solution and/or where to find the updated files,
please follow the link below. You may reopen this bug report
if the solution does not work for you.

http://rhn.redhat.com/errata/RHSA-2006-0132.html



Note You need to log in before you can comment on or make changes to this bug.