End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model

Last update: Dec 30, 2022

Related tags

Deep Learning onnx-facial-lmk-detector

Overview

onnx-facial-lmk-detector

End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model, model.onnx.

Demo

You can try this model at the following link. Thanks for hysts.

https://huggingface.co/spaces/hysts/atksh-onnx-facial-lmk-detector

Code

See src.

Example

import onnxruntime as ort
import cv2

sess = ort.InferenceSession("model.onnx")
img = cv2.imread("input.jpg")

scores, bboxes, keypoints, aligned_imgs, landmarks, affine_matrices = sess.run(None, {"input": img})
# float32 int64 int64 uint8 int64 float32
# (N,) (N, 4) (N, 5, 2) (N, 224, 224, 3) (N, 106, 2) (N, 2, 3)

This model requires onnxruntime>=1.11.

How does it work?

This is simply a merged model of the following underlying models with some pre- and post-processing.

Underlying models

	model	reference
face detection	SCRFD_10G_KPS	https://github.com/deepinsight/insightface/tree/master/detection/scrfd#pretrained-models
landmark detection	2d106det	https://github.com/deepinsight/insightface/blob/master/alignment/coordinate_reg/README.md#pretrained-models

Pre- and Post-Processing

Implemented the following processing by PyTorch and exported to ONNX.

Input transform:
- Resize and pad to (1920, 1920)
- BGR to RGB conversion
- Transpose (H, W, C) to (C, H, W)
(Face Detection)
Post-processing of face detection
- Predicted bounding boxes and Confidence Score Processing
- NMS (ONNX Operator)
Norm estimation and face cropping
- Estimate the norm and apply an affine transformation to each face.
- Crop the faces and resize them to (192, 192).
(Landmark Detection)
Perform post-processing for landmark detection.
- Process the predicted landmarks and apply the inverse affine transform to each face.

Note

Please check with the model provider regarding the license for your use.

This model includes the work that is distributed in the Apache License 2.0.

End-to-end face detection, cropping, norm estimation, and landmark detection in a single onnx model

Related tags

Overview

onnx-facial-lmk-detector

Demo

Code

Example

How does it work?

Underlying models

Pre- and Post-Processing

Note

Owner

atksh

Deepfake Scanner by Deepware.

SeqAttack: a framework for adversarial attacks on token classification models

All of the figures and notebooks for my deep learning book, for free!

An AutoML Library made with Optuna and PyTorch Lightning

Tensorflow python implementation of "Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos"

A GUI for Face Recognition, based upon Docker, Tkinter, GPU and a camera device.

Tensors and Dynamic neural networks in Python with strong GPU acceleration

PAWS 🐾 Predicting View-Assignments with Support Samples

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region

Pretrained Cost Model for Distributed Constraint Optimization Problems

Pipeline for employing a Lightweight deep learning models for LOW-power systems

How to Leverage Multimodal EHR Data for Better Medical Predictions?

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

Joint parameterization and fitting of stroke clusters

Trading environnement for RL agents, backtesting and training.

[CVPR'22] COAP: Learning Compositional Occupancy of People

A collection of easy-to-use, ready-to-use, interesting deep neural network models

An LSTM based GAN for Human motion synthesis