Update README.md

2025-12-30 08:02:27 +00:00 · 2018-01-24 09:33:34 +08:00
parent d814f2e967
commit e0da1a9440
1 changed files with 173 additions and 65 deletions
--- a/README.md
+++ b/README.md
@@ -1,72 +1,180 @@
-# InsightFace
-Face Recognition Project
+
+# *InsightFace* : Implementation for paper 'ArcFace: Additive Angular Margin Loss for Deep Face Recognition'
+
+  Paper by Jiankang Deng, Jia Guo, and Stefanos Zafeiriou
+
+### License
+
+   InsightFace is released under the MIT License.
+
+### Contents
+0. [Introduction](#introduction)
+0. [Citation](#citation)
+0. [Requirements](#requirements)
+0. [Installation](#installation)
+0. [Usage](#usage)
+0. [Models](#models)
+0. [Results](#results)
+0. [Contribution](#contribution)
+0. [Contact](#contact)
+
+
+   ### Introduction
+
+   The repository contains the entire pipeline for deep face recognition with **`ArcFace`** and other popular methods including Softmax, Triplet Loss, SphereFace and AMSoftmax/CosineFace, etc..
+
+   **ArcFace** is a recently proposed face recognition method. It was initially described in an [arXiv technical report](https://arxiv.org/abs/1801.07698). By using ArcFace and this repository, you can simply achieve LFW 99.80+ and Megaface 98%+ by a single model.
+
+   We provide a refined MS1M dataset for training here, which was already packed in MXNet binary format. It allows researcher or industrial engineer to develop a deep face recognizer quickly by only two stages: 1. Download binary dataset; 2. Run training script.
+
+   In InsightFace, we support several popular network backbones and can be set just in one parameter. Below is the list until today:
+
+- ResNet
+
+- MobiletNet
+
+- InceptionResNetV2
+
+- DPN
+
+- DenseNet
+
+ In our paper, we found there're overlap identities between facescrub dataset and Megaface distractors which greatly affects the identification performance. Sometimes more than 10 percent improvement can be achieved after removing these overlaps. This list will be made public soon in this repository.
+
+
+   ArcFace achieves the state-of-the-art identification performance in MegaFace Challenge, at 98%+. 
+
+
+   ### Citation
+
+   If you find **InsightFace/ArcFace** useful in your research, please consider to cite our paper.
+
+   	
+
+  
+
+
+   ### Requirements
+      1. The only requirement is `MXNet` with GPU support(Python 2.7).
+
+   ### Installation
+   1. Install MXNet by 
+
+       ```
+       pip install mxnet-cu80
+       ```
+
+     
+
+   2. Clone the InsightFace repository. We'll call the directory that you cloned InsightFace as **`INSIGHTFACE_ROOT`**.
+
+       ```Shell
+       git clone --recursive https://github.com/deepinsight/insightface.git
+       ```
+      
+
+
+   ### Usage
+
+   *After successfully completing the [installation](#installation)*, you are ready to run all the following experiments.
+
+   #### Part 1: Dataset Downloading.
+   **Note:** In this part, we assume you are in the directory **`$INSIGHTFACE_ROOT/`**
+   1. Download the training set (`MS1M`) from [here] and place them in **`datasets/`**. Each training dataset includes following 7 files:
+
+      ```Shell
+      	- train.idx
+      	- train.rec
+      	- property
+      	- lfw.bin
+      	- cfp_ff.bin
+      	- cfp_fp.bin
+      	- agedb_30.bin
+      ```
+       The first three files are the dataset itself while the last four ones are binary verification sets.
+
+   #### Part 2: Train
+   **Note:** In this part, we assume you are in the directory **`$INSIGHTFACE_ROOT/src/`**. Before start  any training procedure, make sure you set the correct env params for MXNet to ensure the performance.
+
+```
+export MXNET_CPU_WORKER_NTHREADS=24
+export MXNET_ENGINE_TYPE=ThreadedEnginePerDevice
+```
+
+ Now we give some examples below. Our experiments were all done on Tesla P40 GPU.
+
+   1. Train ArcFace with LResNet100E-IR.
+
+      ```Shell
+      CUDA_VISIBLE_DEVICES='0,1,2,3' python -u train_softmax.py --data-dir ../datasets/faces_ms1mr_112x112 --network r100 --loss-type 4 --prefix ../model-r100-arcface --margin-m 0.5
+      ```
+      It will output verification results of *LFW*, *CFP-FF*, *CFP-FP* and *AgeDB-30* every 2000 batches. You can check all command line options in **train\_softmax.py**.
+
+      This model can achieve **LFW 99.80+ and MegaFace 98.0%+**
+
+   2. Train AMSoftmax/CosineFace with LResNet50E-IR.
+
+      ```Shell
+      CUDA_VISIBLE_DEVICES='0,1,2,3' python -u train_softmax.py --data-dir ../datasets/faces_ms1mr_112x112 --network r50 --loss-type 2 --prefix ../model-r50-amsoftmax --margin-m 0.35
+      ```
+
+   3. Train Softmax with LMobileNetE.
+
+      ```Shell
+      CUDA_VISIBLE_DEVICES='0,1,2,3' python -u train_softmax.py --data-dir ../datasets/faces_ms1mr_112x112 --network m1 --loss-type 0 --prefix ../model-m1-softmax
+      ```
+
+4. Re-Train with Triplet on above Softmax model.
+   ```Shell
+   CUDA_VISIBLE_DEVICES='0,1,2,3' python -u train_softmax.py --data-dir ../datasets/faces_ms1mr_112x112 --network m1 --loss-type 12 --pretrained ../model-m1-softmax,50 --prefix ../model-m1-triplet --lr 0.005 --mom 0.0 --per-batch-size 150
+   ```
+
+5. Train Softmax with LDPN107E.
+
+      ```Shell
+      CUDA_VISIBLE_DEVICES='0,1,2,3,4,5,6,7' python -u train_softmax.py --data-dir ../datasets/faces_vgg_112x112 --network p107 --loss-type 0 --prefix ../model-p107-softmax --per-batch-size 64
+      ```
+
+   #### Part 3: MegaFace Test
+
+   **Note:** In this part, we assume you are in the directory **`$INSIGHTFACE_ROOT/src/megaface/`**
+
+   1. Align all face images of facescrub dataset and megaface distractors. Please check the alignment scripts under **`$INSIGHTFACE_ROOT/src/align/`**. (We may plan to release these data soon, not sure.)
+
+   2. Next, generate feature files for both facescrub and megaface images.
+
+      ```Shell
+      python -u gen_megaface.py
+      ```
+
+   3. Remove Megaface noises which generates new feature files.
+
+      ```Matlab
+      python -u remove_noises.py
+      ```
+   4. Start to run megaface development kit to produce final result. 
+
+   ### Models
+      1. We plan to make some models public.
+
+   ### Results
+   
+   We simply report the performance of **LResNet100E-IR** network trained on **MS1M** dataset with **ArcFace** loss.
+
+| Method  | LFW(%) | CFP-FF(%) | CFP-FP(%) | AgeDB-30(%) | MegaFace1M(%) |
+| ------- | ------ | --------- | --------- | ----------- | ------------- |
+| ArcFace | 99.80+ | 99.85+    | 94.0+     | 97.90+      | **98.0+**     |



-### How to use
+   ### Contribution
+   - Any type of PR or third-party contribution are welcome.

-1. Download pre-aligned training dataset from our data repo which is a large binary file in MXnet .rec format(maybe ready soon), or align your dataset by yourself and then pack them to prevent random small files accessing. Check those scripts under src/common and src/align.
-2. Run src/train_softmax.py to train your model and set proper parameters. For example, loss-type=0 means pure softmax while loss-type=1 means SphereLoss. It will output LFW accuracy every 2000 batches and save the model if necessary.
-
-### Notes
-
-Default image size is 112x96 if not specified, all face images are aligned.
-
-In ResNet setting, \_v1 means original residual units.  \_v2 means pre-activation units.  \_v3 means BCBACB residual units.  LResNet means we use conv33+stride11 in its first convoluition layer instead of common conv77+stride22 to preserve high image resolution.   For ResNet50, we do not use bottleneck layers. For ResNet101 or ResNeXt101, we use.  
-
-In last several layers, some different options can be tried to determine how embedding layer looks like and it may affect the performance. The whole network architecture can be thought as {ConvLayers(->GlobalPool)->EmbeddingLayer->Softmax}. Embedding size is set to 512 expect for optionA, as embedding size in optionA is determined by the filter size of last convolution group.
-
- Option\*X: Same with Option\* but use dropout after GP.  OptionAX is the default setting for inceptions.
- OptionA: Use global pooling layer(GP). This is the default setting for all networks except inceptions.
- OptionB: Use one FC layer after GP.
- OptionC: Use FC->BN after GP.
- OptionD: Use FC->BN->PRelu after GP.
- OptionE: Use BN->Dropout->FC->BN after last conv layer.
- OptionF: Use BN->PRelu->Dropout->FC->BN after last conv layer.
-
-
-
-
-### Experiments
-
-
-
-
- **Softmax only on VGG2@112x112**
-
-|    Network/Dataset     |       LFW        |      ------      |      ------      |
-| :--------------------: | :--------------: | :--------------: | :--------------: |
-|      ResNet50D_v1      | 0.99350+-0.00293 |                  |                  |
-|    SE-ResNet50A\_v1    | 0.99367+-0.00233 |                  |                  |
-|    SE-ResNet50B_v1     | 0.99200+-0.00407 |                  |                  |
-|    SE-ResNet50C_v1     | 0.99317+-0.00404 |                  |                  |
-|    SE-ResNet50D_v1     | 0.99383+-0.00259 |                  |                  |
-|    SE-ResNet50E\_v1    | 0.99267+-0.00343 |                  |                  |
-|    SE-ResNet50F\_v1    | 0.99367+-0.00194 |                  |                  |
-|    SE-LResNet50C_v1    | 0.99567+-0.00238 |                  |                  |
-|    SE-LResNet50D_v1    | 0.99600+-0.00281 |                  |                  |
-|    SE-LResNet50E_v1    | 0.99650+-0.00174 |        -         |        -         |
-|    SE-LResNet50A_v3    | 0.99583+-0.00327 |                  |                  |
-|    SE-LResNet50D_v3    | 0.99617+-0.00358 |        -         |        -         |
-|    SE-LResNet50E_v3    | 0.99767+-0.00200 |        -         |        -         |
-|     LResNet50E_v3      | 0.99750+-0.00250 |                  |                  |
-|    SE-LResNet50F_v3    |                  |                  |                  |
-|   SE-LResNet50BX_v3    | 0.99350+-0.00263 |                  |                  |
-|    SE-ResNet101D_v3    | 0.99517+-0.00252 |                  |                  |
-|    SE-ResNet101E_v3    | 0.99467+-0.00221 |                  |                  |
-|    SE-ResNet152E_v3    | 0.99500+-0.00307 |                  |                  |
-|   Inception-ResNetBX   | 0.99417+-0.00375 |        -         |        -         |
-| SE-Inception-ResNet-v2 |        -         |        -         |        -         |
-|       MobileNetD       | 0.99150+-0.00425 |        -         |        -         |
-|      LMobileNetD       | 0.99383+-0.00409 |        -         |        -         |
-|      LMobileNetE       | 0.99633+-0.00314 |        -         |        -         |
-|      LMobileNetF       | 0.99617+-0.00211 |        -         |        -         |
-|    LResNeXt101E_v3     |                  |                  |                  |
-
-
-How weight decay affects:
-
-SE-LResNet50E-v3/vggface2/softmax:
+   ### Contact

+     [Jia Guo](guojia[at]gmail.com) and [Jiankang Deng](https://ibug.doc.ic.ac.uk/people/jdeng)

+     Questions can also be left as issues in the repository. We will be happy to answer them.
+   ```

+   ```