Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Last update: Dec 27, 2022

Overview

Ultralight-SimplePose

Support NCNN mobile terminal deployment
Based on MXNET(>=1.5.1) GLUON(>=0.7.0) framework
Top-down strategy: The input image is the person ROI detected by the object detector
Lightweight mobile terminal human body posture key point model(COCO 17 person_keypoints)
Detector:https://github.com/dog-qiuqiu/MobileNetv2-YOLOV3

Model

Mobile inference frameworks benchmark (4*ARM_CPU)

Network	Resolution	Inference time (NCNN/Kirin 990)	FLOPS	Weight size	HeatmapAccuracy
Ultralight-Nano-SimplePose	W:192 H:256	~5.4ms	0.224BFlops	2.3MB	74.3%

COCO2017 val keypoints metrics evaluate

 Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets= 20 ] = 0.518
 Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets= 20 ] = 0.816
 Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets= 20 ] = 0.558
 Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets= 20 ] = 0.498
 Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets= 20 ] = 0.549
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 20 ] = 0.563
 Average Recall     (AR) @[ IoU=0.50      | area=   all | maxDets= 20 ] = 0.837
 Average Recall     (AR) @[ IoU=0.75      | area=   all | maxDets= 20 ] = 0.607
 Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets= 20 ] = 0.535
 Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets= 20 ] = 0.604

Install

pip install mxnet-cu101 gluoncv
pip install opencv-python cython pycocotools

Install mxnet according to your own cuda version

Demo

Test picture

python img_demo.py

Test camera stream

python cam_demo

How To Train

Download the coco2017 dataset

http://images.cocodataset.org/zips/train2017.zip
http://images.cocodataset.org/annotations/annotations_trainval2017.zip
http://images.cocodataset.org/zips/val2017.zip
Unzip the downloaded dataset zip file to the coco directory
交流qq群:1062122604

Train

python train_simple_pose.py

Ncnn Deploy

Dependent library: Opencv Ncnn
Read the camera video stream test by default, if you test the picture, please modify the code

Install ncnn

$ git clone https://github.com/Tencent/ncnn.git
$ cd <ncnn-root-dir>
$ mkdir -p build
$ cd build
$ make -j4
$ make install

Run ncnn sample

$ cp -rf ncnn/build/install/include ./Ultralight-SimplePose/ncnnsample/
$ cp -rf ncnn/build/install/lib ./Ultralight-SimplePose/ncnnsample/
$ g++ -o ncnnpose ncnnpose.cpp -I include/ncnn/ lib/libncnn.a `pkg-config --libs --cflags opencv` -fopenmp
$ ./ncnnpose

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Related tags

Overview

Ultralight-SimplePose

Model

Mobile inference frameworks benchmark (4*ARM_CPU)

COCO2017 val keypoints metrics evaluate

Install

Demo

Test picture

Test camera stream

How To Train

Download the coco2017 dataset

Train

Ncnn Deploy

Install ncnn

Run ncnn sample

Ncnn Picture test results

Android sample

Thanks

Owner

An LSTM based GAN for Human motion synthesis

Codes to pre-train T5 (Text-to-Text Transfer Transformer) models pre-trained on Japanese web texts

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

HAT: Hierarchical Aggregation Transformers for Person Re-identification

source code of “Visual Saliency Transformer” (ICCV2021)

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Code repo for "Transformer on a Diet" paper

Code for "Unsupervised State Representation Learning in Atari"

A modern pure-Python library for reading PDF files

Experiments for Fake News explainability project

SimBERT升级版（SimBERTv2）！

Adaptive Dropblock Enhanced GenerativeAdversarial Networks for Hyperspectral Image Classification

Official code for: A Probabilistic Hard Attention Model For Sequentially Observed Scenes

Federated Deep Reinforcement Learning for the Distributed Control of NextG Wireless Networks.

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

YOLOX-CondInst - Implement CondInst which is a instances segmentation method on YOLOX

Agile SVG maker for python

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.