Bug 1839571

Summary: building MPI packages can fail due to oversubscription
Product: [Fedora] Fedora Reporter: david08741
Component: openmpiAssignee: Doug Ledford <dledford>
Status: CLOSED RAWHIDE QA Contact: Fedora Extras Quality Assurance <extras-qa>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: rawhideCC: dakingun, dledford, hladky.jiri, orion, pkfed
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: openmpi-4.0.4-0.2.rc1.fc33 Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-05-24 22:56:53 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description david08741 2020-05-24 19:11:27 UTC
Description of problem:
(re)building mpi packages can fail with openmpi, as openmpi does not run by default if oversubscribed. This can be avoided by exporting the env variable 
export OMPI_MCA_rmaps_base_oversubscribe=yes

Typically that is required in the check section, but sometimes it works fine on koji, but fails on copr, as copr has fewer cores available, the recent sundials update is an example.

Rather then ensuring that every package exports this in the %check section, openmpi could set this flag in %_openmpi_load

Version-Release number of selected component (if applicable):
current rawhide

How reproducible:
if check oversubscribes, always

Additional info:
Normally this is good to not oversubscribe, but we only want to check the package works, and oversubscription is generally accepted for testing.

devel thread that inspired this:
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org/thread/M4EZN5P6SYNLSL5NQZM6EG32TQVEUCJN/

Comment 1 Orion Poplawski 2020-05-24 22:56:53 UTC
I like it.  Let's give it a try.  Thanks.