Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Last update: Oct 10, 2022

Related tags

Overview

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,
Linh Van Ma, Tin Trung Tran, Moongu Jeon, ICAIIC 2022 (The 4th International Conference on Artificial Intelligence in Information and Communication February 21 (Mon.) ~ 24 (Thur.), 2022, Guam, USA & Virtual Conference)

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

If you want to finetune this deep learning model. You first need to collect your dataset. You need to look at the center of each rectangle (36 rectangles).

python3 collect_dataset.py

Once you finish collecting your dataset. You need to change the folder of subject in run_finetune.py. Then, you can start finetuning this deep learning model.

python3 run_finetune.py

Remember to rebuild TensorRT if you first run this source in your device. You need to move your working folder to ext\tensorrt_mtcnn.

chmod +x ./build.sh
./build.sh

You now can run to test this gaze estimation by first connect a realsense camera to Jetson TX2. Run the following script.

python3 run_camera.py

To test with your recorded video, you should specify you video location in run_camera_test.py. Run the following script.

python3 run_camera_test.py

Dependencies

FAZE: Few-Shot Adaptive Gaze Estimation: https://github.com/NVlabs/few_shot_gaze
eos: https://github.com/patrikhuber/eos
HRNets: https://github.com/HRNet/HRNet-Facial-Landmark-Detection
mtcnn-pytorch: https://github.com/TropComplique/mtcnn-pytorch
Realtime-facial-landmark-detection: https://github.com/pathak-ashutosh/Realtime-facial-landmark-detection
MTCNN TensorRT(Demo #2: MTCNN): https://github.com/jkjung-avt/tensorrt_demos#mtcnn

5.1 TensorRT MTCNN Face Detector

5.2 Optimizing TensorRT MTCNN

Acknowledgement

A large part of the code is borrowed from FAZE: Few-Shot Adaptive Gaze Estimation and MTCNN TensorRT(Demo #2: MTCNN). Thanks for their wonderful works.

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Related tags

Overview

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

Dependencies

Acknowledgement

Owner

Linh

Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)

OpenIPDM is a MATLAB open-source platform that stands for infrastructures probabilistic deterioration model

Reinforcement learning library in JAX.

An experiment to bait a generalized frontrunning MEV bot

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Medical image analysis framework merging ANTsPy and deep learning

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

An Efficient Implementation of Analytic Mesh Algorithm for 3D Iso-surface Extraction from Neural Networks

Flexible-Modal Face Anti-Spoofing: A Benchmark

Rot-Pro: Modeling Transitivity by Projection in Knowledge Graph Embedding

Tensors and Dynamic neural networks in Python with strong GPU acceleration

This is the code for the paper "Jinkai Zheng, Xinchen Liu, Wu Liu, Lingxiao He, Chenggang Yan, Tao Mei: Gait Recognition in the Wild with Dense 3D Representations and A Benchmark. (CVPR 2022)"

A scikit-learn-compatible module for estimating prediction intervals.