Deep Learning Head Pose Estimation using PyTorch.

Last update: Dec 26, 2022

Overview

Hopenet

Hopenet is an accurate and easy to use head pose estimation network. Models have been trained on the 300W-LP dataset and have been tested on real data with good qualitative performance.

For details about the method and quantitative results please check the CVPR Workshop paper.

new GoT trailer example video

new Conan-Cruise-Car example video

To use please install PyTorch and OpenCV (for video) - I believe that's all you need apart from usual libraries such as numpy. You need a GPU to run Hopenet (for now).

To test on a video using dlib face detections (center of head will be jumpy):

python code/test_on_video_dlib.py --snapshot PATH_OF_SNAPSHOT --face_model PATH_OF_DLIB_MODEL --video PATH_OF_VIDEO --output_string STRING_TO_APPEND_TO_OUTPUT --n_frames N_OF_FRAMES_TO_PROCESS --fps FPS_OF_SOURCE_VIDEO

To test on a video using your own face detections (we recommend using dockerface, center of head will be smoother):

python code/test_on_video_dockerface.py --snapshot PATH_OF_SNAPSHOT --video PATH_OF_VIDEO --bboxes FACE_BOUNDING_BOX_ANNOTATIONS --output_string STRING_TO_APPEND_TO_OUTPUT --n_frames N_OF_FRAMES_TO_PROCESS --fps FPS_OF_SOURCE_VIDEO

Face bounding box annotations should be in Dockerface format (n_frame x_min y_min x_max y_max confidence).

Pre-trained models:

300W-LP, alpha 1

300W-LP, alpha 2

300W-LP, alpha 1, robust to image quality

For more information on what alpha stands for please read the paper. First two models are for validating paper results, if used on real data we suggest using the last model as it is more robust to image quality and blur and gives good results on video.

Please open an issue if you have an problem.

Some very cool implementations of this work on other platforms by some cool people:

Gluon

MXNet

TensorFlow with Keras

A really cool lightweight version of HopeNet:

Deep Head Pose Light

If you find Hopenet useful in your research please cite:

@InProceedings{Ruiz_2018_CVPR_Workshops,
author = {Ruiz, Nataniel and Chong, Eunji and Rehg, James M.},
title = {Fine-Grained Head Pose Estimation Without Keypoints},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2018}
}

Nataniel Ruiz, Eunji Chong, James M. Rehg

Georgia Institute of Technology

Deep Learning Head Pose Estimation using PyTorch.

Related tags

Overview

Hopenet

Owner

Nataniel Ruiz

CompilerGym is a library of easy to use and performant reinforcement learning environments for compiler tasks

StrongSORT: Make DeepSORT Great Again

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Out of Distribution Detection on Natural Adversarial Examples

basic tutorial on pytorch

C3D is a modified version of BVLC caffe to support 3D ConvNets.

CAUSE: Causality from AttribUtions on Sequence of Events

Pytoydl: A toy deep learning framework built upon numpy.

converts nominal survey data into a numerical value based on a dictionary lookup.

FewBit — a library for memory efficient training of large neural networks

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Workshop Materials Delivered on 28/02/2022

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Continuous Diffusion Graph Neural Network

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

'Aligned mixture of latent dynamical systems' (amLDS) for stimulus decoding probabilistic manifold alignment across animals. P. Herrero-Vidal et al. NeurIPS 2021 code.

Deep Learning Head Pose Estimation using PyTorch.

Related tags

Overview

Hopenet

Owner

Nataniel Ruiz

CompilerGym is a library of easy to use and performant reinforcement learning environments for compiler tasks

StrongSORT: Make DeepSORT Great Again

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Out of Distribution Detection on Natural Adversarial Examples

basic tutorial on pytorch

C3D is a modified version of BVLC caffe to support 3D ConvNets.

CAUSE: Causality from AttribUtions on Sequence of Events

Pytoydl: A toy deep learning framework built upon numpy.

converts nominal survey data into a numerical value based on a dictionary lookup.

FewBit — a library for memory efficient training of large neural networks

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Workshop Materials Delivered on 28/02/2022

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Continuous Diffusion Graph Neural Network

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

'Aligned mixture of latent dynamical systems' (amLDS) for stimulus decoding probabilistic manifold alignment across animals. P. Herrero-Vidal et al. NeurIPS 2021 code.

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch