ZF/uniface

mirror of https://github.com/yakhyo/uniface.git synced 2025-12-30 09:02:25 +00:00

Files

Yakhyokhuja Valikhujaev 0c93598007 feat: Enhace emotion inference speed on ARM and add FaceAnalyzer, Face classes for ease of use. (#25 )

* feat: Update linting and type annotations, return types in detect

* feat: add face analyzer and face classes

* chore: Update the format and clean up some docstrings

* docs: Update usage documentation

* feat: Change AgeGender model output to 0, 1 instead of string (Female, Male)

* test: Update testing code

* feat: Add Apple silicon backend for torchscript inference

* feat: Add face analyzer example and add run emotion for testing

2025-11-30 20:32:07 +09:00

12 KiB

Raw Permalink Blame History

UniFace Model Zoo

Complete guide to all available models, their performance characteristics, and selection criteria.

Face Detection Models

RetinaFace Family

RetinaFace models are trained on the WIDER FACE dataset and provide excellent accuracy-speed tradeoffs.

Model Name	Params	Size	Easy	Medium	Hard	Use Case
`MNET_025`	0.4M	1.7MB	88.48%	87.02%	80.61%	Mobile/Edge devices
`MNET_050`	1.0M	2.6MB	89.42%	87.97%	82.40%	Mobile/Edge devices
`MNET_V1`	3.5M	3.8MB	90.59%	89.14%	84.13%	Balanced mobile
`MNET_V2` ⭐	3.2M	3.5MB	91.70%	91.03%	86.60%	Recommended default
`RESNET18`	11.7M	27MB	92.50%	91.02%	86.63%	Server/High accuracy
`RESNET34`	24.8M	56MB	94.16%	93.12%	88.90%	Maximum accuracy

Accuracy: WIDER FACE validation set (Easy/Medium/Hard subsets) - from RetinaFace paper Speed: Benchmark on your own hardware using scripts/run_detection.py --iterations 100

Usage

from uniface import RetinaFace
from uniface.constants import RetinaFaceWeights

# Default (recommended)
detector = RetinaFace()  # Uses MNET_V2

# Specific model
detector = RetinaFace(
    model_name=RetinaFaceWeights.MNET_025,  # Fastest
    conf_thresh=0.5,
    nms_thresh=0.4,
    input_size=(640, 640)
)

SCRFD Family

SCRFD (Sample and Computation Redistribution for Efficient Face Detection) models offer state-of-the-art speed-accuracy tradeoffs.

Model Name	Params	Size	Easy	Medium	Hard	Use Case
`SCRFD_500M`	0.6M	2.5MB	90.57%	88.12%	68.51%	Real-time applications
`SCRFD_10G` ⭐	4.2M	17MB	95.16%	93.87%	83.05%	High accuracy + speed

Accuracy: WIDER FACE validation set - from SCRFD paper Speed: Benchmark on your own hardware using scripts/run_detection.py --iterations 100

Usage

from uniface import SCRFD
from uniface.constants import SCRFDWeights

# Fast real-time detection
detector = SCRFD(
    model_name=SCRFDWeights.SCRFD_500M_KPS,
    conf_thresh=0.5,
    input_size=(640, 640)
)

# High accuracy
detector = SCRFD(
    model_name=SCRFDWeights.SCRFD_10G_KPS,
    conf_thresh=0.5
)

Face Recognition Models

ArcFace

State-of-the-art face recognition using additive angular margin loss.

Model Name	Backbone	Params	Size	Use Case
`MNET` ⭐	MobileNet	2.0M	8MB	Balanced (recommended)
`RESNET`	ResNet50	43.6M	166MB	Maximum accuracy

Dataset: Trained on MS1M-V2 (5.8M images, 85K identities) Accuracy: Benchmark on your own dataset or use standard face verification benchmarks

Usage

from uniface import ArcFace
from uniface.constants import ArcFaceWeights

# Default (MobileNet backbone)
recognizer = ArcFace()

# High accuracy (ResNet50 backbone)
recognizer = ArcFace(model_name=ArcFaceWeights.RESNET)

# Extract embedding
embedding = recognizer.get_normalized_embedding(image, landmarks)
# Returns: (1, 512) normalized embedding vector

MobileFace

Lightweight face recognition optimized for mobile devices.

Model Name	Backbone	Params	Size	LFW	CALFW	CPLFW	AgeDB-30	Use Case
`MNET_025`	MobileNetV1 0.25	0.36M	1MB	98.76%	92.02%	82.37%	90.02%	Ultra-lightweight
`MNET_V2` ⭐	MobileNetV2	2.29M	4MB	99.55%	94.87%	86.89%	95.16%	Mobile/Edge
`MNET_V3_SMALL`	MobileNetV3-S	1.25M	3MB	99.30%	93.77%	85.29%	92.79%	Mobile optimized
`MNET_V3_LARGE`	MobileNetV3-L	3.52M	10MB	99.53%	94.56%	86.79%	95.13%	Balanced mobile

Dataset: Trained on MS1M-V2 (5.8M images, 85K identities) Accuracy: Evaluated on LFW, CALFW, CPLFW, and AgeDB-30 benchmarks Note: These models are lightweight alternatives to ArcFace for resource-constrained environments

Usage

from uniface import MobileFace
from uniface.constants import MobileFaceWeights

# Lightweight
recognizer = MobileFace(model_name=MobileFaceWeights.MNET_V2)

SphereFace

Face recognition using angular softmax loss.

Model Name	Backbone	Params	Size	LFW	CALFW	CPLFW	AgeDB-30	Use Case
`SPHERE20`	Sphere20	24.5M	50MB	99.67%	95.61%	88.75%	96.58%	Research/Comparison
`SPHERE36`	Sphere36	34.6M	92MB	99.72%	95.64%	89.92%	96.83%	Research/Comparison

Dataset: Trained on MS1M-V2 (5.8M images, 85K identities) Accuracy: Evaluated on LFW, CALFW, CPLFW, and AgeDB-30 benchmarks Note: SphereFace uses angular softmax loss, an earlier approach before ArcFace. These models provide good accuracy with moderate resource requirements.

Usage

from uniface import SphereFace
from uniface.constants import SphereFaceWeights

recognizer = SphereFace(model_name=SphereFaceWeights.SPHERE20)

Facial Landmark Models

106-Point Landmark Detection

High-precision facial landmark localization.

Model Name	Points	Params	Size	Use Case
`2D106`	106	3.7M	14MB	Face alignment, analysis

Note: Provides 106 facial keypoints for detailed face analysis and alignment

Usage

from uniface import Landmark106

landmarker = Landmark106()
landmarks = landmarker.get_landmarks(image, bbox)
# Returns: (106, 2) array of (x, y) coordinates

Landmark Groups:

Face contour: 0-32 (33 points)
Eyebrows: 33-50 (18 points)
Nose: 51-62 (12 points)
Eyes: 63-86 (24 points)
Mouth: 87-105 (19 points)

Attribute Analysis Models

Age & Gender Detection

Model Name	Attributes	Params	Size	Use Case
`DEFAULT`	Age, Gender	2.1M	8MB	General purpose

Dataset: Trained on CelebA Note: Accuracy varies by demographic and image quality. Test on your specific use case.

Usage

from uniface import AgeGender

predictor = AgeGender()
gender_id, age = predictor.predict(image, bbox)
# Returns: (gender_id, age_in_years)
# gender_id: 0 for Female, 1 for Male

Emotion Detection

Model Name	Classes	Params	Size	Use Case
`AFFECNET7`	7	0.5M	2MB	7-class emotion
`AFFECNET8`	8	0.5M	2MB	8-class emotion

Classes (7): Neutral, Happy, Sad, Surprise, Fear, Disgust, Anger Classes (8): Above + Contempt

Dataset: Trained on AffectNet Note: Emotion detection accuracy depends heavily on facial expression clarity and cultural context

Usage

from uniface import Emotion
from uniface.constants import DDAMFNWeights

predictor = Emotion(model_name=DDAMFNWeights.AFFECNET7)
emotion, confidence = predictor.predict(image, landmarks)

Model Selection Guide

By Use Case

Mobile/Edge Devices

Detection: RetinaFace(MNET_025) or SCRFD(SCRFD_500M)
Recognition: MobileFace(MNET_V2)
Priority: Speed, small model size

Real-Time Applications (Webcam, Video)

Detection: RetinaFace(MNET_V2) or SCRFD(SCRFD_500M)
Recognition: ArcFace(MNET)
Priority: Speed-accuracy balance

High-Accuracy Applications (Security, Verification)

Detection: SCRFD(SCRFD_10G) or RetinaFace(RESNET34)
Recognition: ArcFace(RESNET)
Priority: Maximum accuracy

Server/Cloud Deployment

Detection: SCRFD(SCRFD_10G)
Recognition: ArcFace(RESNET)
Priority: Accuracy, batch processing

By Hardware

Apple Silicon (M1/M2/M3/M4)

Recommended: All models work well with ARM64 optimizations (automatically included)

pip install uniface

Recommended models:

Fast: SCRFD(SCRFD_500M) - Lightweight, real-time capable
Balanced: RetinaFace(MNET_V2) - Good accuracy/speed tradeoff
Accurate: SCRFD(SCRFD_10G) - High accuracy

Benchmark on your M4: python scripts/run_detection.py --iterations 100

NVIDIA GPU (CUDA)

Recommended: Larger models for maximum throughput

pip install uniface[gpu]

Recommended models:

Fast: SCRFD(SCRFD_500M) - Maximum throughput
Balanced: SCRFD(SCRFD_10G) - Best overall
Accurate: RetinaFace(RESNET34) - Highest accuracy

CPU Only

Recommended: Lightweight models

Recommended models:

Fast: RetinaFace(MNET_025) - Smallest, fastest
Balanced: RetinaFace(MNET_V2) - Recommended default
Accurate: SCRFD(SCRFD_10G) - Best accuracy on CPU

Note: FPS values vary significantly based on image size, number of faces, and hardware. Always benchmark on your specific setup.

Benchmark Details

How to Benchmark

Run benchmarks on your own hardware:

# Detection speed
python scripts/run_detection.py --image assets/test.jpg --iterations 100

# Compare models
python scripts/run_detection.py --image assets/test.jpg --method retinaface --iterations 100
python scripts/run_detection.py --image assets/test.jpg --method scrfd --iterations 100

Accuracy Metrics Explained

WIDER FACE: Standard face detection benchmark with three difficulty levels
- Easy: Large faces (>50px), clear backgrounds
- Medium: Medium-sized faces (30-50px), moderate occlusion
- Hard: Small faces (<30px), heavy occlusion, blur
Accuracy values are from the original papers - see references below
Model Size: ONNX model file size (affects download time and memory)
Params: Number of model parameters (affects inference speed)

Important Notes

Speed varies by:
- Image resolution
- Number of faces in image
- Hardware (CPU/GPU/CoreML)
- Batch size
- Operating system
Accuracy varies by:
- Image quality
- Lighting conditions
- Face pose and occlusion
- Demographic factors
Always benchmark on your specific use case before choosing a model

Model Updates

Models are automatically downloaded and cached on first use. Cache location: ~/.uniface/models/

Manual Model Management

from uniface.model_store import verify_model_weights
from uniface.constants import RetinaFaceWeights

# Download specific model
model_path = verify_model_weights(
    RetinaFaceWeights.MNET_V2,
    root='./custom_cache'
)

# Models are verified with SHA-256 checksums

Download All Models

# Using the provided script
python scripts/download_model.py

# Download specific model
python scripts/download_model.py --model MNET_V2

References

Model Training & Architectures

RetinaFace Training: yakhyo/retinaface-pytorch - PyTorch implementation and training code
Face Recognition Training: yakhyo/face-recognition - ArcFace, MobileFace, SphereFace training code
InsightFace: deepinsight/insightface - Model architectures and pretrained weights

Papers

RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild
SCRFD: Sample and Computation Redistribution for Efficient Face Detection
ArcFace: Additive Angular Margin Loss for Deep Face Recognition
SphereFace: Deep Hypersphere Embedding for Face Recognition

12 KiB Raw Permalink Blame History

UniFace Model Zoo

Face Detection Models

RetinaFace Family

Usage

SCRFD Family

Usage

Face Recognition Models

ArcFace

Usage

MobileFace

Usage

SphereFace

Usage

Facial Landmark Models

106-Point Landmark Detection

Usage

Attribute Analysis Models

Age & Gender Detection

Usage

Emotion Detection

Usage

Model Selection Guide

By Use Case

Mobile/Edge Devices

Real-Time Applications (Webcam, Video)

High-Accuracy Applications (Security, Verification)

Server/Cloud Deployment

By Hardware

Apple Silicon (M1/M2/M3/M4)

NVIDIA GPU (CUDA)

CPU Only

Benchmark Details

How to Benchmark

Accuracy Metrics Explained

Important Notes

Model Updates

Manual Model Management

Download All Models

References

Model Training & Architectures

Papers

12 KiB

Raw Permalink Blame History