Bug 1614452
Summary: | [Intel OSP16][RSD] Pooled FPGA over PCIe | ||
---|---|---|---|
Product: | Red Hat OpenStack | Reporter: | Krish Raghuram <krishnan.raghuram> |
Component: | openstack-nova | Assignee: | OSP DFG:Compute <osp-dfg-compute> |
Status: | CLOSED DEFERRED | QA Contact: | OSP DFG:Compute <osp-dfg-compute> |
Severity: | high | Docs Contact: | |
Priority: | medium | ||
Version: | 16.0 (Train) | CC: | dasmith, egallen, eglynn, jhakimra, kchamart, krishnan.raghuram, lin.a.yang, lyarwood, pchavva, pragyansri.pathi, sbauza, sgordon, smooney, srevivo, sundar.nadathur, vromanso |
Target Milestone: | --- | Keywords: | FutureFeature, Triaged |
Target Release: | --- | ||
Hardware: | x86_64 | ||
OS: | Linux | ||
Whiteboard: | |||
Fixed In Version: | Doc Type: | If docs needed, set a value | |
Doc Text: | Story Points: | --- | |
Clone Of: | Environment: | ||
Last Closed: | 2019-02-26 17:47:18 UTC | Type: | Bug |
Regression: | --- | Mount Type: | --- |
Documentation: | --- | CRM: | |
Verified Versions: | Category: | --- | |
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
Cloudforms Team: | --- | Target Upstream Version: | |
Embargoed: | |||
Bug Depends On: | 1562173 | ||
Bug Blocks: | 1595325, 1636090, 1414581 |
Description
Krish Raghuram
2018-08-09 15:27:34 UTC
Hi Krish. This feature request has several unresolved dependencies. Firstly cyborg is not a currently supported project in OSP and is not currently targeted to be added in OSP 15. Can you open a sperate Bugzilla to track that request and add it as a dependency for this request. Adding cyborg as a supported project is not trivial as it would require packaging the project as an rpm, adding a set of cyborg containers to kolla and the integrating the deployment of those containers with tripleo/director. In addition to the generic cyborg support above OSP director would have to be enhanced to be able to configure the cyborg agent with the credential for the PDOM to enable this feature. With that in mind, this is likely and OSP-next-next intersect not OSP-15. As you indicated this feature depends on upstream changes to Nova and cyborg that are yet to be implemented. when you have that info available please update this thicket with the relevant blueprint/reviews. Finally, from my reading of the request, we would require a specific hardware configuration to develop and validate this feature request. In particular a minimum of the following: - 1 networks switch for management/provisioning. - 1 RSD 2.4 compatible PODM (could be deployed in a VM if reference code is used else this is an appliance.) - 1 RSD 2.4 compatible PCIe switch with PSME. - 1+ RSD 2.4 compatible computer drawer with external PCIe backplane support - 1+ RSD 2.4 compatible FPGA drawer with external PCIe backplane support - 1+ FPGAs that are compatible with both RSD 2.4 and the cyborg agent. - 1+ standard servers for OSP control plane and standard compute nodes. Can you provide a detailed description of the hardware and topology required to deploy and test this feature and indicate whether intel would be able to provide a minimal RSD system as described above or access to one in a lab for the development and validation of this feature request. (In reply to smooney from comment #1) > > Hi Krish. > This feature request has several unresolved dependencies. > > Firstly cyborg is not a currently supported project in OSP and is not > currently targeted to be added in OSP 15. > Can you open a sperate Bugzilla to track that request and add it > as a dependency for this request. > > Adding cyborg as a supported project is not trivial as it would require > packaging the project as an rpm, adding a set of cyborg containers to kolla > and the integrating the deployment of those containers with tripleo/director. > > In addition to the generic cyborg support above OSP director would have to > be enhanced to be able to configure the cyborg agent with the credential > for the PDOM to enable this feature. > > With that in mind, this is likely and OSP-next-next intersect not OSP-15. > > As you indicated this feature depends on upstream changes to Nova and cyborg > that are yet to be implemented. when you have that info available > please update this thicket with the relevant blueprint/reviews. > > Finally, from my reading of the request, we would require a specific hardware > configuration to develop and validate this feature request. > > In particular a minimum of the following: > - 1 networks switch for management/provisioning. > - 1 RSD 2.4 compatible PODM (could be deployed in a VM if reference code > is > used else this is an appliance.) > - 1 RSD 2.4 compatible PCIe switch with PSME. > - 1+ RSD 2.4 compatible computer drawer with external PCIe backplane > support > - 1+ RSD 2.4 compatible FPGA drawer with external PCIe backplane support > - 1+ FPGAs that are compatible with both RSD 2.4 and the cyborg agent. > - 1+ standard servers for OSP control plane and standard compute nodes. > > Can you provide a detailed description of the hardware and topology required > to deploy and test this feature and indicate whether intel > would be able to provide a minimal RSD system as described above > or access to one in a lab for the development and validation > of this feature request. Sean, the basic Cyborg request is already at https://bugzilla.redhat.com/show_bug.cgi?id=1562173 Lin Yang will add links to the BPs or specs as they are submitted. I will have to discuss the hardware availability with the team and get back. I believe Red Hat has had access to an RSD rack in one of our labs in the past and probably still does - I'll investigate Thanks krish. i did not see that in my bugzilla query. i have added it as a dependency. As discussed in Sep 27th engineering meeting: We understand Red Hat has moved this to RH OSP16. FYI - Intel continues to work on this. Red Hat has moved to OSP16 (enhancement work of Cyborg). Revisit based on upstream status/customer use case. We are de-prioritizing this in favor of FPGA pooling over Ethernet fabric. Will open a separate BZ for the latter Closed based on the feedback from Intel. BZ can be Closed, as this project is being changed |