Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Last update: Dec 31, 2022

Overview

ONNX-Mobile-Human-Pose-3D

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model.

Original image for inference: (https://static2.diariovasco.com/www/pre2017/multimedia/noticias/201412/01/media/DF0N5391.jpg)

❗ ⚠️ Known issues

The models works well when the person is looking forward and without occlusions, it will start to fail as soon as the person is occluded.
The model is fast, but the 3D representation is slow due to matplotlib, this will be fixed. The 3d representation can be ommitted for faster inference by setting draw_3dpose to False

Requirements

OpenCV, imread-from-url, scipy, onnx and onnxruntime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

ONNX model

The original models were converted to different formats (including .onnx) by PINTO0309, download the models from his repository and save them into the models folder.

YOLOv5s: You will also need an object detector to first detect the people in the image. Download the model from the model zoo and save the .onnx version into the models folder.

Original model

The original model was taken from the original repository.

Examples

Image inference:

python imagePoseEstimation.py

Video inference:

python videoPoseEstimation.py

Webcam inference:

python webcamPoseEstimation.py

Inference video Example

References:

Mobile human pose model: https://github.com/SangbumChoi/MobileHumanPose
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
3DMPPE_POSENET_RELEASE repository: https://github.com/mks0601/3DMPPE_POSENET_RELEASE
Original YOLOv5 repository: https://github.com/ultralytics/yolov5
Original paper: https://openaccess.thecvf.com/content/CVPR2021W/MAI/html/Choi_MobileHumanPose_Toward_Real-Time_3D_Human_Pose_Estimation_in_Mobile_Devices_CVPRW_2021_paper.html

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Related tags

Overview

ONNX-Mobile-Human-Pose-3D

❗ ⚠️ Known issues

Requirements

Installation

ONNX model

Original model

Examples

Inference video Example

References:

Owner

Ibai Gorordo

HCQ: Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

A library for answering questions using data you cannot see

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

GeneDisco is a benchmark suite for evaluating active learning algorithms for experimental design in drug discovery.

DexterRedTool - Dexter's Red Team Tool that creates cronjob/task scheduler to consistently creates users

make ASCII Art by Deep Learning

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

Reimplementation of Learning Mesh-based Simulation With Graph Networks

3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

Use unsupervised and supervised learning to predict stocks

Official Code Release for Container : Context Aggregation Network

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

PyTorch implementation of some learning rate schedulers for deep learning researcher.

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

An implementation of RetinaNet in PyTorch.

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines" submission to NeurIPS 2021 (Datasets & Benchmarks track)

A keras-based real-time model for medical image segmentation (CFPNet-M)

Near-Optimal Sparse Allreduce for Distributed Deep Learning (published in PPoPP'22)