This repository has been archived by the owner on Mar 20, 2023. It is now read-only.
Default docker run --shm-size=64MB
inadequate for some Intel MPI jobs
#8
Labels
When using the combined
shm:dapl
Intel MPI fabrics the/dev/shm
device is exposed through to the Docker container from the host. It is then used for MPI communications intra-node. Unfortunately, the default size of/dev/shm
is restricted to 64MB and this is inadequate. The result is MPI applications that crash at random points.Fix:
In
jobs.json
set theadditional_docker_run_options
to be--shm-size=256m
(or as appropriate).https://github.com/Azure/batch-shipyard/blob/master/config_templates/jobs.json
This is more of a suggestion/warning than a bug report.
Joint work with @chrisrichardson.
The text was updated successfully, but these errors were encountered: