Deep Face Recognition in PyTorch

Last update: Sep 11, 2022

Overview

Face Recognition in PyTorch

By Alexey Gruzdev and Vladislav Sovrasov

Introduction

A repository for different experimental Face Recognition models such as CosFace, ArcFace, SphereFace, SV-Softmax, etc.

Installation
Preparation
Train/Eval
Models
Face Recognition Demo

Installation

Create and activate virtual python environment

bash init_venv.sh
. venv/bin/activate

Preparation

For Face Recognition training you should download VGGFace2 data. We will refer to this folder as $VGGFace2_ROOT.
For Face Recognition evaluation you need to download LFW data and LFW landmarks. Place everything in one folder, which will be $LFW_ROOT.

Train/Eval

Go to $FR_ROOT folder

cd $FR_ROOT/

To start training FR model:

python train.py --train_data_root $VGGFace2_ROOT/train/ --train_list $VGGFace2_ROOT/meta/train_list.txt
--train_landmarks  $VGGFace2_ROOT/bb_landmark/ --val_data_root  $LFW_ROOT/lfw/ --val_list $LFW_ROOT/pairs.txt  
--val_landmarks $LFW_ROOT/lfw_landmark.txt --train_batch_size 200  --snap_prefix mobilenet_256 --lr 0.35
--embed_size 256 --model mobilenet --device 1

To evaluate FR snapshot (let's say we have MobileNet with 256 embedding size trained for 300k):

 python evaluate_lfw.py --val_data_root $LFW_ROOT/lfw/ --val_list $LFW_ROOT/pairs.txt
 --val_landmarks $LFW_ROOT/lfw_landmark.txt --snap /path/to/snapshot/mobilenet_256_300000.pt --model mobilenet --embed_size 256

Configuration files

Besides passing all the required parameters via command line, the training script allows to read them from a yaml configuration file. Each line of such file should contain a valid description of one parameter in the yaml fromat. Example:

#optimizer parameters
lr: 0.4
train_batch_size: 256
#loss options
margin_type: cos
s: 30
m: 0.35
#model parameters
model: mobilenet
embed_size: 256
#misc
snap_prefix: MobileFaceNet
devices: [0, 1]
#datasets
train_dataset: vgg
train_data_root: $VGGFace2_ROOT/train/
#... and so on

Path to the config file can be passed to the training script via command line. In case if any other arguments were passed before the config, they will be overwritten.

python train.py -m 0.35 @./my_config.yml #here m can be overwritten with the value from my_config.yml

Models

You can download pretrained model from fileshare as well - https://download.01.org/openvinotoolkit/open_model_zoo/training_toolbox_pytorch/models/fr/Mobilenet_se_focal_121000.pt

cd $FR_ROOT
python evaluate_lfw.py --val_data_root $LFW_ROOT/lfw/ --val_list $LFW_ROOT/pairs.txt --val_landmarks $LFW_ROOT/lfw_landmark.txt
--snap /path/to/snapshot/Mobilenet_se_focal_121000.pt --model mobilenet --embed_size 256

You should get the following output:

I1114 09:33:37.846870 10544 evaluate_lfw.py:242] Accuracy/Val_same_accuracy mean: 0.9923
I1114 09:33:37.847019 10544 evaluate_lfw.py:243] Accuracy/Val_diff_accuracy mean: 0.9970
I1114 09:33:37.847069 10544 evaluate_lfw.py:244] Accuracy/Val_accuracy mean: 0.9947
I1114 09:33:37.847179 10544 evaluate_lfw.py:245] Accuracy/Val_accuracy std dev: 0.0035
I1114 09:33:37.847229 10544 evaluate_lfw.py:246] AUC: 0.9995
I1114 09:33:37.847305 10544 evaluate_lfw.py:247] Estimated threshold: 0.7241

Demo

For setting up demo, please go to Face Recognition demo with OpenVINO Toolkit

Deep Face Recognition in PyTorch

Related tags

Overview

Face Recognition in PyTorch

Introduction

Contents

Installation

Preparation

Train/Eval

Configuration files

Models

Demo

Owner

Alexey Gruzdev

Bayesian Neural Networks in PyTorch

A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.

Code for generating a single image pretraining dataset

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

Pytorch implementation of Rosca, Mihaela, et al. "Variational Approaches for Auto-Encoding Generative Adversarial Networks."

Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

A PyTorch implementation of "DGC-Net: Dense Geometric Correspondence Network"

"Projelerle Yapay Zeka Ve Bilgisayarlı Görü" Kitabımın projeleri

Self-Supervised Methods for Noise-Removal

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Deeprl - Standard DQN and dueling network for simple games

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

Code for our EMNLP 2021 paper "Learning Kernel-Smoothed Machine Translation with Retrieved Examples"

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

Reviving Iterative Training with Mask Guidance for Interactive Segmentation