Bug 2015402
| Summary: | [RHEL-9.0.0/RDMA] update ucx | ||
|---|---|---|---|
| Product: | Red Hat Enterprise Linux 9 | Reporter: | Honggang LI <honli> |
| Component: | ucx | Assignee: | Jonathan Toppins <jtoppins> |
| Status: | CLOSED ERRATA | QA Contact: | Afom T. Michael <tmichael> |
| Severity: | unspecified | Docs Contact: | |
| Priority: | unspecified | ||
| Version: | 9.0 | CC: | bchae, rdma-dev-team, zguo |
| Target Milestone: | rc | Keywords: | Triaged |
| Target Release: | 9.0 | Flags: | pm-rhel:
mirror+
|
| Hardware: | Unspecified | ||
| OS: | Unspecified | ||
| Whiteboard: | |||
| Fixed In Version: | Doc Type: | If docs needed, set a value | |
| Doc Text: | Story Points: | --- | |
| Clone Of: | Environment: | ||
| Last Closed: | 2022-05-17 15:53:57 UTC | Type: | Bug |
| Regression: | --- | Mount Type: | --- |
| Documentation: | --- | CRM: | |
| Verified Versions: | Category: | --- | |
| oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | |
| Cloudforms Team: | --- | Target Upstream Version: | |
| Embargoed: | |||
|
Description
Honggang LI
2021-10-19 06:16:55 UTC
The lastest RHEL9.0 build, RHEL-9.0.0-20220103.2, was tested and verified for UCX on MLX5 IB and ROCE devices.
There were failures on ucx_perftests and will file new bugzilla for them.
1. build and packages
DISTRO=RHEL-9.0.0-20220103.2
+ [22-01-04 08:04:53] cat /etc/redhat-release
Red Hat Enterprise Linux release 9.0 Beta (Plow)
+ [22-01-04 08:04:53] uname -a
Linux rdma-dev-20.rdma.lab.eng.rdu2.redhat.com 5.14.0-39.el9.x86_64 #1 SMP PREEMPT Fri Dec 24 00:07:58 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
+ [22-01-04 08:04:53] cat /proc/cmdline
BOOT_IMAGE=(hd0,msdos1)/vmlinuz-5.14.0-39.el9.x86_64 root=/dev/mapper/rhel_rdma--dev--20-root ro intel_idle.max_cstate=0 processor.max_cstate=0 intel_iommu=on iommu=on console=tty0 rd_NO_PLYMOUTH crashkernel=1G-4G:192M,4G-64G:256M,64G-:512M resume=/dev/mapper/rhel_rdma--dev--20-swap rd.lvm.lv=rhel_rdma-dev-20/root rd.lvm.lv=rhel_rdma-dev-20/swap console=ttyS1,115200n81
+ [22-01-04 08:04:53] rpm -q rdma-core linux-firmware
rdma-core-37.1-1.el9.x86_64
linux-firmware-20211027-123.el9.noarch
Installed:
ucx-cma-1.11.2-2.el9.x86_64 ucx-ib-1.11.2-2.el9.x86_64
ucx-rdmacm-1.11.2-2.el9.x86_64
Package ucx-1.11.2-2.el9.x86_64 is already installed.
Test results for ucx/ucx/ on rdma-dev-20:
5.14.0-39.el9.x86_64, rdma-core-37.1-1.el9, mlx5, ib0, & mlx5_2
Result | Status | Test
---------+--------+------------------------------------
PASS | 0 | install ucx
PASS | 0 | install ucx-cma ucx-ib ucx-rdmacm
PASS | 0 | ucx version info
PASS | 0 | ucx build info
PASS | 0 | ucx system info
PASS | 0 | ucx device info
PASS | 0 | ucx transport info - cma
PASS | 0 | ucx transport info - dc_mlx5
PASS | 0 | ucx transport info - posix
PASS | 0 | ucx transport info - rc_mlx5
PASS | 0 | ucx transport info - rc_verbs
PASS | 0 | ucx transport info - self
PASS | 0 | ucx transport info - sysv
PASS | 0 | ucx transport info - tcp
PASS | 0 | ucx transport info - ud_mlx5
PASS | 0 | ucx transport info - ud_verbs
PASS | 0 | ucx configuration info
PASS | 0 | ucp context info for a
PASS | 0 | ucp worker info for a
PASS | 0 | ucp endpoint config for a
PASS | 0 | ucp context info for r
PASS | 0 | ucp worker info for r
PASS | 0 | ucp endpoint config for r
PASS | 0 | ucp context info for t
PASS | 0 | ucp worker info for t
PASS | 0 | ucp endpoint config for t
PASS | 0 | ucp context info for m
PASS | 0 | ucp worker info for m
PASS | 0 | ucp endpoint config for m
PASS | 0 | ucp context info for ae
PASS | 0 | ucp worker info for ae
PASS | 0 | ucp endpoint config for ae
PASS | 0 | ucp context info for re
PASS | 0 | ucp worker info for re
PASS | 0 | ucp endpoint config for re
PASS | 0 | ucp context info for te
PASS | 0 | ucp worker info for te
PASS | 0 | ucp endpoint config for te
PASS | 0 | ucp context info for me
PASS | 0 | ucp worker info for me
PASS | 0 | ucp endpoint config for me
PASS | 0 | ucp context info for aw
PASS | 0 | ucp worker info for aw
PASS | 0 | ucp endpoint config for aw
PASS | 0 | ucp context info for rw
PASS | 0 | ucp worker info for rw
PASS | 0 | ucp endpoint config for rw
PASS | 0 | ucp context info for tw
PASS | 0 | ucp worker info for tw
PASS | 0 | ucp endpoint config for tw
PASS | 0 | ucp context info for mw
PASS | 0 | ucp worker info for mw
PASS | 0 | ucp endpoint config for mw
PASS | 0 | ucx type and struct info
FAIL | 255 | ucx_perftest am_lat
FAIL | 255 | ucx_perftest put_lat
FAIL | 255 | ucx_perftest add_lat
FAIL | 255 | ucx_perftest fadd
FAIL | 255 | ucx_perftest cswap
FAIL | 255 | ucx_perftest am_bw
FAIL | 255 | ucx_perftest put_bw
FAIL | 255 | ucx_perftest add_mr
PASS | 0 | ucx_perftest tag_lat
PASS | 0 | ucx_perftest tag_bw
PASS | 0 | ucx_perftest ucp_put_lat
PASS | 0 | ucx_perftest ucp_put_bw
PASS | 0 | ucx_perftest ucp_get
PASS | 0 | openmpi setup
PASS | 0 | openmpi built with ucx
PASS | 0 | openmpi ucx osu_bw
Since the problem described in this bug report should be resolved in a recent advisory, it has been closed with a resolution of ERRATA. For information on the advisory (new packages: RDMA stack), and where to find the updated files, follow the link below. If the solution does not work for you, open a new bug report. https://access.redhat.com/errata/RHEA-2022:3950 |