MPI

The following MPIs can be used with the ABCI system.

To use one of these libraries, it is necessary to configure the user environment in advance using the module command. If you run the module command in an interactive node, environment variables for compilation are set automatically. If you run the module command in a compute node, environment variables both for compilation and execution are set automatically.

[username@login1 ~]$ module load hpcx/2.20

[username@login1 ~]$ module load intel-mpi/2021.13

The following is a list MPI versions installed in the ABCI system.

NVIDIA HPC-X

Module Version	Open MPI Version	Compute Node (H)
2.20	4.1.7a1	Yes

Using HPC-X

This section describes how to use the NVIDIA HPC-X module.

ABCI provides the following types of HPC-X modules.Please load the module according to your application.

Module Name	Description
hpcx	Standard
hpcx-mt	Multi-Threading support
hpcx-debug	for debug
hpcx-prof	for profiling

When executing the mpirun and mpiexec commands in a job, a host file is also specified in the -hostfile option. The host file is set in the $PBS_NODEFILE environment variable.

[username@login1 ~]$ qsub -I -P groupname -q rt_HF -l select=2:mpiprocs=192 -l walltime=01:00:00
[username@hnode001 ~]$ module load hpcx/2.20
[username@hnode001 ~]$ mpirun -np 2 -map-by ppr:1:node -hostfile $PBS_NODEFILE ./hello_c
Hello, world, I am 0 of 2, (Open MPI v4.1.7a1, package: Open MPI root@hnode001 Distribution, ident: 4.1.7a1, repo rev: v4.1.5-115-g41ba5192d2, Unreleased developer copy, 141)
Hello, world, I am 1 of 2, (Open MPI v4.1.7a1, package: Open MPI root@hnode001 Distribution, ident: 4.1.7a1, repo rev: v4.1.5-115-g41ba5192d2, Unreleased developer copy, 141)

NVIDIA HPC-X provides the NCCL-SHARP plug-in. The plug-in supports different versions of NCCL for different versions of HPC-X. See the table below for compatibility between HPC-X and NCCL.

HPC-X Version	NCCL Version
2.20	2.23

For more information about NVIDIA HPC-X, please refer to the official documentation.

Intel MPI

intel-mpi/	Compute Node (H)
2021.13	Yes