Skip to content

Known Issues

date content status
2019/10/04 MPI_Allreduce provided by MVAPICH2-GDR 2.3.2 raises floating point exceptions in the following combinations of nodes, GPUs and message sizes when reduction between GPU memories is conducted.
Nodes: 28, GPU/Node: 4, Message size: 256KB
Nodes: 30, GPU/Node: 4, Message size: 256KB
Nodes: 33, GPU/Node: 4, Message size: 256KB
Nodes: 34, GPU/Node: 4, Message size: 256KB
Will be solved in the next version
2019/04/10 The following qsub option requires to specify argument due to job scheduler update (8.5.4 -> 8.6.3).
resource type ( -l rt_F etc)
$ qsub -g GROUP -l rt_F=1
$ qsub -g GROUP -l rt_G.small=1
close
2019/04/10 The following qsub option requires to specify argument due to job scheduler update (8.5.4 -> 8.6.3).
use BEEOND ( -l USE_BEEOND)
$ qsub -g GROUP -l rt_F=2 -l USE_BEEOND=1
close
2019/04/05 Due to job scheduler update (8.5.4 -> 8.6.3), a comupte node can execute only up to 2 jobs each resource type "rt_G.small" and "rt_C.small" (normally up to 4 jobs ).This situation also occures with Reservation service, so to be careful when you submit job with "rt_G.small" or "rt_C.small".
$ qsub -ar ARID -l rt_G.small=1 -g GROUP run.sh (x 3 times)
$ qstat
job-ID prior name user state
--------
478583 0.25586 sample.sh username r
478584 0.25586 sample.sh username r
478586 0.25586 sample.sh username qw
2019/10/04
close