Bug 1892128
| Summary: | [RHEL-8.4][RDMA] rebuild openmpi against latest ucx package | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 8 | Reporter: | Honggang LI <honli> |
| Component: | openmpi | Assignee: | Honggang LI <honli> |
| Status: | CLOSED ERRATA | QA Contact: | Brian Chae <bchae> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 8.4 | CC: | bchae, rdma-dev-team, tmichael |
| Target Milestone: | rc | Keywords: | Triaged |
| Target Release: | 8.4 | Flags: | pm-rhel:
mirror+
|
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | openmpi-4.0.5-2.el8 | Doc Type: | If docs needed, set a value |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2021-05-18 14:44:44 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
| Bug Depends On: | 1851724 | ||
| Bug Blocks: | |||
|
Description
Honggang LI
2020-10-28 01:10:41 UTC
Tested with both RDMA tier2 openmpi and ucx test suites and these were tested on all multiple hosts with
1. MLX5 IB0
2. MLX5 ROCE
successfully.
DISTRO=RHEL-8.4.0-20201128.n.0
+ [20-12-07 14:52:08] cat /etc/redhat-release
Red Hat Enterprise Linux release 8.4 Beta (Ootpa)
+ [20-12-07 14:52:08] uname -a
Linux rdma-dev-22.lab.bos.redhat.com 4.18.0-254.el8.x86_64 #1 SMP Thu Nov 26 08:47:50 EST 2020 x86_64 x86_64 x86_64 GNU/Linux
+ [20-12-07 14:52:08] cat /proc/cmdline
BOOT_IMAGE=(hd0,msdos1)/vmlinuz-4.18.0-254.el8.x86_64 root=/dev/mapper/rhel_rdma--dev--22-root ro intel_idle.max_cstate=0 processor.max_cstate=0 intel_iommu=on iommu=on console=tty0 rd_NO_PLYMOUTH crashkernel=auto resume=/dev/mapper/rhel_rdma--dev--22-swap rd.lvm.lv=rhel_rdma-dev-22/root rd.lvm.lv=rhel_rdma-dev-22/swap console=ttyS1,115200n81
+ [20-12-07 14:52:08] rpm -q rdma-core linux-firmware
rdma-core-32.0-1.el8.x86_64
Installed:
mpitests-openmpi-5.6.3-1.el8.x86_64 openmpi-4.0.5-2.el8.x86_64
Installed:
ucx-cma-1.9.0-1.el8.x86_64 ucx-ib-1.9.0-1.el8.x86_64
ucx-rdmacm-1.9.0-1.el8.x86_64
Test results:
Test results for ucx/ucx/ on rdma-dev-22:
4.18.0-254.el8.x86_64, rdma-core-32.0-1.el8, mlx5, ib0, & mlx5_1
Result | Status | Test
---------+--------+------------------------------------
PASS | 0 | install ucx
PASS | 0 | install ucx-cma ucx-ib ucx-rdmacm
PASS | 0 | ucx version info
PASS | 0 | ucx build info
PASS | 0 | ucx system info
PASS | 0 | ucx device info
PASS | 0 | ucx transport info - cma
PASS | 0 | ucx transport info - dc_mlx5
PASS | 0 | ucx transport info - posix
PASS | 0 | ucx transport info - rc_mlx5
PASS | 0 | ucx transport info - rc_verbs
PASS | 0 | ucx transport info - self
PASS | 0 | ucx transport info - sysv
PASS | 0 | ucx transport info - tcp
PASS | 0 | ucx transport info - ud_mlx5
PASS | 0 | ucx transport info - ud_verbs
PASS | 0 | ucx configuration info
PASS | 0 | ucp context info for a
PASS | 0 | ucp worker info for a
PASS | 0 | ucp endpoint config for a
PASS | 0 | ucp context info for r
PASS | 0 | ucp worker info for r
PASS | 0 | ucp endpoint config for r
PASS | 0 | ucp context info for t
PASS | 0 | ucp worker info for t
PASS | 0 | ucp endpoint config for t
PASS | 0 | ucp context info for w
PASS | 0 | ucp worker info for w
PASS | 0 | ucp endpoint config for w
PASS | 0 | ucp context info for ae
PASS | 0 | ucp worker info for ae
PASS | 0 | ucp endpoint config for ae
PASS | 0 | ucp context info for re
PASS | 0 | ucp worker info for re
PASS | 0 | ucp endpoint config for re
PASS | 0 | ucp context info for te
PASS | 0 | ucp worker info for te
PASS | 0 | ucp endpoint config for te
PASS | 0 | ucp context info for we
PASS | 0 | ucp worker info for we
PASS | 0 | ucp endpoint config for we
PASS | 0 | ucx type and struct info
PASS | 0 | ucx_perftest am_lat
PASS | 0 | ucx_perftest put_lat
PASS | 0 | ucx_perftest add_lat
PASS | 0 | ucx_perftest fadd
PASS | 0 | ucx_perftest cswap
PASS | 0 | ucx_perftest am_bw
PASS | 0 | ucx_perftest put_bw
PASS | 0 | ucx_perftest add_mr
PASS | 0 | ucx_perftest tag_lat
PASS | 0 | ucx_perftest tag_bw
PASS | 0 | ucx_perftest ucp_put_lat
PASS | 0 | ucx_perftest ucp_put_bw
PASS | 0 | ucx_perftest ucp_get
PASS | 0 | openmpi setup
PASS | 0 | openmpi built with ucx
PASS | 0 | openmpi ucx osu_bw
Checking for failures and known issues:
no test failures
---------------------------------
Test results for ucx/ucx/ on rdma-dev-22:
4.18.0-249.el8.x86_64, rdma-core-32.0-1.el8, mlx5, ib1, & mlx5_2
Result | Status | Test
---------+--------+------------------------------------
PASS | 0 | install ucx
PASS | 0 | install ucx-cma ucx-ib ucx-rdmacm
PASS | 0 | ucx version info
PASS | 0 | ucx build info
PASS | 0 | ucx system info
PASS | 0 | ucx device info
PASS | 0 | ucx transport info - cma
PASS | 0 | ucx transport info - dc_mlx5
PASS | 0 | ucx transport info - posix
PASS | 0 | ucx transport info - rc_mlx5
PASS | 0 | ucx transport info - rc_verbs
PASS | 0 | ucx transport info - self
PASS | 0 | ucx transport info - sysv
PASS | 0 | ucx transport info - tcp
PASS | 0 | ucx transport info - ud_mlx5
PASS | 0 | ucx transport info - ud_verbs
PASS | 0 | ucx configuration info
PASS | 0 | ucp context info for a
PASS | 0 | ucp worker info for a
PASS | 0 | ucp endpoint config for a
PASS | 0 | ucp context info for r
PASS | 0 | ucp worker info for r
PASS | 0 | ucp endpoint config for r
PASS | 0 | ucp context info for t
PASS | 0 | ucp worker info for t
PASS | 0 | ucp endpoint config for t
PASS | 0 | ucp context info for w
PASS | 0 | ucp worker info for w
PASS | 0 | ucp endpoint config for w
PASS | 0 | ucp context info for ae
PASS | 0 | ucp worker info for ae
PASS | 0 | ucp endpoint config for ae
PASS | 0 | ucp context info for re
PASS | 0 | ucp worker info for re
PASS | 0 | ucp endpoint config for re
PASS | 0 | ucp context info for te
PASS | 0 | ucp worker info for te
PASS | 0 | ucp endpoint config for te
PASS | 0 | ucp context info for we
PASS | 0 | ucp worker info for we
PASS | 0 | ucp endpoint config for we
PASS | 0 | ucx type and struct info
PASS | 0 | ucx_perftest am_lat
PASS | 0 | ucx_perftest put_lat
PASS | 0 | ucx_perftest add_lat
PASS | 0 | ucx_perftest fadd
PASS | 0 | ucx_perftest cswap
PASS | 0 | ucx_perftest am_bw
PASS | 0 | ucx_perftest put_bw
PASS | 0 | ucx_perftest add_mr
PASS | 0 | ucx_perftest tag_lat
PASS | 0 | ucx_perftest tag_bw
PASS | 0 | ucx_perftest ucp_put_lat
PASS | 0 | ucx_perftest ucp_put_bw
PASS | 0 | ucx_perftest ucp_get
PASS | 0 | openmpi setup
PASS | 0 | openmpi built with ucx
PASS | 0 | openmpi ucx osu_bw
Checking for failures and known issues:
no test failures
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (RDMA stack bug fix and enhancement update), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHBA-2021:1594 |