mirror of
https://github.com/deepinsight/insightface.git
synced 2026-05-14 12:17:55 +00:00
925 B
925 B
Train
Requirements
python==3.6
cuda==10.1
cudnn==765
mxnet-cu101==1.6.0.post0
pip install easydict mxboard opencv-python tqdm
nccl
openmpi==4.0.0
horovod==0.19.2
Failures due to SSH issues
The host where horovodrun is executed must be able to SSH to all other hosts without any prompts.
Run with horovodrun
Typically one GPU will be allocated per process, so if a server has 8 GPUs, you will run 8 processes. In horovodrun, the number of processes is specified with the -np flag.
To run on a machine with 8 GPUs:
horovodrun -np 8 -H localhost:8 bash config.sh
To run on two machine with 16 GPUs:
horovodrun -np 16 -H ip1:8,ip2:8 bash config.sh
Run with mpi
bash run.sh