Skip to content

MPI-IO Support

The Message Passing Interface (MPI) Standard, maintained by the MPI Forum, includes a chapter on MPI-IO.

ROMIO is a well-known implementation of MPI-IO and is included in many MPI implementations. DAOS provides its own MPI-IO ROMIO ADIO driver. This driver has been merged in the upstream MPICH repository, see the adio/ad_daos section in the MPICH git repository for details.

Supported MPI Version

MPICH

The DAOS ROMIO ADIO driver has been accepted into MPICH. It is included in mpich-3.4.1 (released Jan 2021) and in mpich-3.4.2 (released May 2021).

Note

Starting with DAOS 1.2, the --svc parameter (number of service replicas) is no longer needed, and the DAOS API has been changed accordingly. Patches have been contributed to MPICH that detect the DAOS API version to gracefully handle this change. MPICH 3.4.2 includes those changes, and works out of the box with DAOS 2.0. MPICH 3.4.1 does not include those changes. Please check the latest commits here for information on how to apply those changes to MPICH 3.4.1.

To build MPICH, including ROMIO with the DAOS ADIO driver:

export MPI_LIB=""

git clone https://github.com/pmodels/mpich

cd mpich

./autogen.sh

./configure --prefix=dir --enable-fortran=all --enable-romio \
 --enable-cxx --enable-g=all --enable-debuginfo --with-device=ch3:nemesis \
 --with-file-system=ufs+daos --with-daos=/usr

make -j8; make install

This assumes that DAOS is installed into the /usr tree, which is the case for the DAOS RPM installation. Other configure options can be added, modified, or removed as needed, like the network communicatio device, fortran support, etc. For those, please consule the mpich user guide.

Set the PATH and LD_LIBRARY_PATH to where you want to build your client apps or libs that use MPI to the path of the installed MPICH.

Intel MPI

The Intel MPI Library includes DAOS support since the 2019.8 release.

Note that Intel MPI uses libfabric and includes it as part of the Intel MPI installation: * 2019.8 and 2019.9 includes libfabric-1.10.1-impi * 2021.1, 2021.2 and 2021.3 includes libfabric-1.12.1-impi

Care must be taken to ensure that the version of libfabric that is used is at a level that includes the patches that are critical for DAOS. DAOS 1.0.1 includes libfabric-1.9.0, DAOS 1.2 includes libfabric-1.12, DAOS 2.0 includes libfabric-1.14, and DAOS 2.2 includes libfabric-1.15.

To use DAOS with Intel MPI, the libfabric that is supplied by DAOS (and that is installed into /usr/lib64 by default) must be used. Intel MPI provides a mechanism to indicate that the Intel MPI version of libfabric should not be used, by setting this variable before loading the Intel MPI environment:

export I_MPI_OFI_LIBRARY_INTERNAL=0

This is normally sufficient to ensure that the libfabric provided by DAOS is used. Depending on how the environment is set up, it may be necessary to add the system library search path back as the first path in the library search path:

export LD_LIBRARY_PATH="/usr/lib64/:$LD_LIBRARY_PATH"

There are other environment variables that need to be set on the client side to ensure proper functionality with the DAOS MPIIO driver, including:

export FI_UNIVERSE_SIZE=16383
export FI_OFI_RXM_USE_SRX=1

Open MPI

Open MPI 4.0.5 does not yet provide DAOS support. Since one of its MPI-IO implementations is based on ROMIO, it will likely pick up DAOS support in an upcoming release.

MVAPICH2

MVAPICH2 2.3.4 does not yet provide DAOS support. Since its MPI-IO implementation is based on ROMIO, it will likely pick up DAOS support in an upcoming release.

Testing MPI-IO with DAOS

Build any client (HDF5, ior, mpi test suites) normally with the mpicc command and mpich library installed above.

Using the UNS

DAOS UNS allows encoding pool and container information into a path on the filesystem, so one can easily access that container using that path instead of using explicit addressing using the pool and container uuids/labels.

Create a container with a path on dfuse or lustre, or any file system that supports extended attributes:

daos cont create mypool --label mycont --path=/mnt/dfuse/ --type POSIX

Then using that path, one can start creating files using the DAOS MPIIO driver by just appending daos: to the filename/path. For example: daos:/mnt/dfuse/file daos:/mnt/dfuse/dir1/file

Using a Prefix Environment Variable

Another way to use the DAOS MPIIO driver is using an environment variable to set the prefix itself for the file:

export DAOS_UNS_PREFIX="path"

That prefix path can be: 1. The UNS prefix if that exists (similar to the UNS mode above): /mnt/dfuse 2. A direct path using the pool and container label (or uuid): daos://pool/container/

Then one can specify the path to the file relative to the root of the container being set in the prefix. So in the example above, if the file to be accessed is under /dir1 in the container, one would pass `daos:/dir1/file' to MPI_File_open().

Using Pool and Container Environment Variables

This mode is meant just for quick testing to use the MPIIO DAOS driver bypassing the UNS and setting direct access with pool and container environment variables. At the client side, the following environment variables need to be set: export DAOS_POOL={uuid/label}; export DAOS_CONT={uuid/label}; export DAOS_BYPASS_DUNS=1. The user still need to append the daos: prefix to the file being passed to MPI_File_open().

Known limitations

Limitations of the current implementation include:

  • No support for MPI file atomicity, preallocate, or shared file pointers.
Back to top