mpi4py fails to build with Python 3.8.0b1. ====================================================================== FAIL: testCompareAndSwap (test_rma.TestRMASelf) ---------------------------------------------------------------------- Traceback (most recent call last): File "test/test_rma.py", line 228, in testCompareAndSwap self.assertEqual(rbuf[1], -1) AssertionError: 0 != -1 ====================================================================== FAIL: testFetchAndOp (test_rma.TestRMASelf) ---------------------------------------------------------------------- Traceback (most recent call last): File "test/test_rma.py", line 190, in testFetchAndOp self.assertEqual(rbuf[1], -1) AssertionError: 47 != -1 ====================================================================== FAIL: testCompareAndSwap (test_rma.TestRMAWorld) ---------------------------------------------------------------------- Traceback (most recent call last): File "test/test_rma.py", line 228, in testCompareAndSwap self.assertEqual(rbuf[1], -1) AssertionError: 0 != -1 ====================================================================== FAIL: testFetchAndOp (test_rma.TestRMAWorld) ---------------------------------------------------------------------- Traceback (most recent call last): File "test/test_rma.py", line 190, in testFetchAndOp self.assertEqual(rbuf[1], -1) AssertionError: -69 != -1 ---------------------------------------------------------------------- Ran 1102 tests in 7.053s FAILED (failures=4, skipped=46) -------------------------------------------------------------------------- Primary job terminated normally, but 1 process returned a non-zero exit code. Per user-direction, the job has been aborted. -------------------------------------------------------------------------- -------------------------------------------------------------------------- mpiexec detected that one or more processes exited with non-zero status, thus causing the job to be terminated. The first process to do so was: Process name: [[14054,1],0] Exit code: 1 -------------------------------------------------------------------------- [1562606212.394756] [22b764337d874def8754761c8bb283ea:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1562606212.540543] [22b764337d874def8754761c8bb283ea:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1562606213.398255] [22b764337d874def8754761c8bb283ea:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xb80) for ucp_am_bufs failed: Operation not permitted, please check shared memory limits by 'ipcs -l' This might actually be a copr problem, not sure. Let me know if you cannot reproduce it outside of mock. For the build logs, see: https://copr-be.cloud.fedoraproject.org/results/@python/python3.8/fedora-rawhide-x86_64/00964785-mpi4py/ For all our attempts to build mpi4py with Python 3.8, see: https://copr.fedorainfracloud.org/coprs/g/python/python3.8/package/mpi4py/ Testing and mass rebuild of packages is happening in copr. You can follow these instructions to test locally in mock if your package builds with Python 3.8: https://copr.fedorainfracloud.org/coprs/g/python/python3.8/ Let us know here if you have any questions.
Zbyszek, would you be able to help here?
It fails the same in normal rawhide on amd64. No idea. I'll update mpich to the lastest version, maybe that'll help.
[1564354515.215559] [08dfc006c2a24ed0bf7d9276d6077ef3:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1564354515.363010] [08dfc006c2a24ed0bf7d9276d6077ef3:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1564354516.244026] [08dfc006c2a24ed0bf7d9276d6077ef3:4889 :0] sys.c:618 UCX ERROR shmget(size=2097152 flags=0xb80) for ucp_am_bufs failed: Operation not permitted, please check shared memory limits by 'ipcs -l' This might be the cause. But I get the same failure on my machine, and it seems the limits are very high: $ ipcs -l ------ Messages Limits -------- max queues system wide = 32000 max size of message (bytes) = 8192 default max size of queue (bytes) = 16384 ------ Shared Memory Limits -------- max number of segments = 4096 max seg size (kbytes) = 18014398509465599 max total shared memory (kbytes) = 18014398509481980 min seg size (bytes) = 1 ------ Semaphore Limits -------- max number of arrays = 32000 max semaphores per array = 32000 max semaphores system wide = 1024000000 max ops per semop call = 500 semaphore max value = 32767
python3-mpich-3.1.1-1.fc31.x86_64 makes no difference ;(
https://bitbucket.org/mpi4py/mpi4py/issues/138/test-failure-in-fedora-rawhide
I made the build pass by ignoring the test failures. I don't think we gain much by keeping the package in FTBFS state. Maybe upstream will know how to fix this.