Bug 1931394

Summary: [RHEL8.4] pyverbs-tests failed with 3 errors and 3 failures when tested on HFI OPA devices
Product: Red Hat Enterprise Linux 8 Reporter: Brian Chae <bchae>
Component: rdma-coreAssignee: Nobody <nobody>
Status: CLOSED WONTFIX QA Contact: Brian Chae <bchae>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 8.4CC: bchae, hwkernel-mgr, rdma-dev-team
Target Milestone: rcKeywords: Triaged
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2022-08-22 07:28:28 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1903942    

Description Brian Chae 2021-02-22 11:03:32 UTC
Description of problem:

When run over HFI OPA devices, like over hosts, rdma-qe-14 and rdma-qe-15, pyverbs-tests failed with 3 errors and 3 failures.

Version-Release number of selected component (if applicable):

Clients: rdma-qe-14
Servers: rdma-qe-15

DISTRO=RHEL-8.4.0-20210216.n.0

Red Hat Enterprise Linux release 8.4 Beta (Ootpa)

Linux rdma-qe-15.lab.bos.redhat.com 4.18.0-287.el8.x86_64 #1 SMP Thu Feb 11 03:56:04 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
BOOT_IMAGE=(hd0,msdos1)/vmlinuz-4.18.0-287.el8.x86_64 root=UUID=1c0c36bb-b794-465b-b3e5-0528e7edf191 ro intel_idle.max_cstate=0 processor.max_cstate=0 console=tty0 rd_NO_PLYMOUTH intel_iommu=on crashkernel=auto resume=UUID=0dd624ff-d84b-49e3-b239-f891043ac206 console=ttyS1,115200

rdma-core-32.0-4.el8.x86_64

linux-firmware-20201218-102.git05789708.el8.noarch
1.27.0

02:00.0 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
02:00.1 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
03:00.0 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
03:00.1 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
04:00.0 Fabric controller: Intel Corporation Omni-Path HFI Silicon 100 Series [discrete] (rev 10)

How reproducible:

100%

Steps to Reproduce:

1. With the above RHEL8.4 build, install the following packages on both server and client hosts

    python3-pyverbs-32.0-4.el8.x86_64  

2. execute the pyverbs tests

    ./run_tests.py -v --dev $HCA_ID
        
         <HCA_ID: mlx5_bond_0>

Actual results:



======================================================================
ERROR: test_rdmacm_async_ex_multicast_traffic (tests.test_rdmacm.CMTestCase)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./utils.py", line 718, in inner
    return func(instance)
  File "./test_rdmacm.py", line 162, in test_rdmacm_async_ex_multicast_traffic
    port_space=ce.RDMA_PS_UDP, extended=True)
  File "./test_rdmacm.py", line 66, in two_nodes_rdmacm_traffic
    raise PyverbsError(res)
pyverbs.pyverbs_error.PyverbsError: Caught exception in passive side process: pid 23486
Exception message: Expected this event: 12, got this event: RDMA_CM_EVENT_MULTICAST_ERROR

======================================================================
ERROR: test_rdmacm_async_multicast_traffic (tests.test_rdmacm.CMTestCase)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./utils.py", line 718, in inner
    return func(instance)
  File "./test_rdmacm.py", line 156, in test_rdmacm_async_multicast_traffic
    port_space=ce.RDMA_PS_UDP)
  File "./test_rdmacm.py", line 66, in two_nodes_rdmacm_traffic
    raise PyverbsError(res)
pyverbs.pyverbs_error.PyverbsError: Caught exception in active side process: pid 23499
Exception message: Expected this event: 12, got this event: RDMA_CM_EVENT_MULTICAST_ERROR

======================================================================
ERROR: test_rdmacm_async_udp_traffic (tests.test_rdmacm.CMTestCase)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./test_rdmacm.py", line 170, in test_rdmacm_async_udp_traffic
    port_space=ce.RDMA_PS_UDP)
  File "./test_rdmacm.py", line 66, in two_nodes_rdmacm_traffic
    raise PyverbsError(res)
pyverbs.pyverbs_error.PyverbsError: Caught exception in active side process: pid 23547
Exception message: Expected this event: 2, got this event: RDMA_CM_EVENT_ROUTE_ERROR

======================================================================
FAIL: test_query_rc_qp (tests.test_qp.QPTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./test_qp.py", line 223, in test_query_rc_qp
    self.query_qp_common_test(e.IBV_QPT_RC)
  File "./test_qp.py", line 216, in query_qp_common_test
    self.verify_qp_attrs(caps, e.IBV_QPS_RESET, qp_init_attr, qp_attr)
  File "./test_qp.py", line 186, in verify_qp_attrs
    self.assertLessEqual(orig_cap.max_inline_data, init_attr.cap.max_inline_data)
AssertionError: 5 not less than or equal to 0

======================================================================
FAIL: test_query_uc_qp (tests.test_qp.QPTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./test_qp.py", line 230, in test_query_uc_qp
    self.query_qp_common_test(e.IBV_QPT_UC)
  File "./test_qp.py", line 209, in query_qp_common_test
    self.verify_qp_attrs(caps, e.IBV_QPS_RESET, qp_init_attr, qp_attr)
  File "./test_qp.py", line 186, in verify_qp_attrs
    self.assertLessEqual(orig_cap.max_inline_data, init_attr.cap.max_inline_data)
AssertionError: 13 not less than or equal to 0

======================================================================
FAIL: test_query_ud_qp (tests.test_qp.QPTest)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "./test_qp.py", line 237, in test_query_ud_qp
    self.query_qp_common_test(e.IBV_QPT_UD)
  File "./test_qp.py", line 209, in query_qp_common_test
    self.verify_qp_attrs(caps, e.IBV_QPS_RESET, qp_init_attr, qp_attr)
  File "./test_qp.py", line 186, in verify_qp_attrs
    self.assertLessEqual(orig_cap.max_inline_data, init_attr.cap.max_inline_data)
AssertionError: 5 not less than or equal to 0

----------------------------------------------------------------------
Ran 129 tests in 16.783s

FAILED (failures=3, errors=3, skipped=75)
---
- TEST RESULT FOR tests
-   Test:   Run pyverbs tests
-   Result: FAIL
-   Return: 1
---
Expected results:


Additional info:

Comment 4 RHEL Program Management 2022-08-22 07:28:28 UTC
After evaluating this issue, there are no plans to address it further or fix it in an upcoming release.  Therefore, it is being closed.  If plans change such that this issue will be fixed in an upcoming release, then the bug can be reopened.