Demo for Real-time RGBD-based Extended Body Pose Estimation paper

Last update: Dec 26, 2022

Related tags

Deep Learning rgbd-kinect-pose

Overview

Real-time RGBD-based Extended Body Pose Estimation

This repository is a real-time demo for our paper that was published at WACV 2021 conference

The output of our module is in SMPL-X parametric body mesh model:

RNN estimates body pose from joints detected by Azure Kinect Body Tracking API
For face (expression and jaw) and hand pose we crop from rgb image:
- for hand model we use minimal-hand
- our face NN takes media-pipe keypoints as input

Combined system runs at 30 fps on a 2080ti GPU and 8 core @ 4GHz CPU.

How to use

Build

Prereqs: your nvidia driver should support cuda 10.2, Windows or Mac are not supported.
Clone repo:
- git clone https://github.com/rmbashirov/rgbd-kinect-pose.git
- cd rgbd-kinect-pose
- git submodule update --force --init --remote
Docker setup:
- Install docker engine
- Install nvidia-docker
- Set nvidia your default runtime for docker
- Make docker run without sudo: create docker group and add current user to it:
```
sudo groupadd docker
sudo usermod -aG docker $USER
```
- reboot
Build docker image: run 2 cmds
Attach your Azure Kinect camera
Check your Azure Kinect camera is working inside Docker container:
- Enter Docker container: ./run_local.sh from docker dir
- Then run python -m pyk4a.viewer --vis_color --no_bt --no_depth inside docker container

Download data

Download our data archive smplx_kinect_demo_data.tar.gz
Unzip: mkdir /your/unpacked/dir, tar -zxf smplx_kinect_demo_data.tar.gz -C /your/unpacked/dir
Download models for hand, see link in "Download models from here" line in our fork, put to /your/unpacked/dir/minimal_hand/model
To download SMPL-X parametric body model go to this project website, register, go to the downloads section, download SMPL-X v1.1 model, put to /your/unpacked/dir/pykinect/body_models/smplx
/your/unpacked/dir should look like this
Set data_dirpath and output_dirpath variables in config file:
- data_dirpath is a path to /your/unpacked/dir
- output_dirpath is used to check timings or to store result images
- ensure these paths are visible inside docker container, set VOLUMES variable here

Run

Run demo: in src dir run ./run_server.sh, the latter will enter docker container and will use config file where shape of the person is loaded from an external file: in our work we did not focus on person's shape estimation

What else

Apart from our main body pose estimation contribution you can find this repository useful for:

minimal_pytorch_rasterizer python package: CUDA non-differentiable mesh rasterization library for pytorch tensors with python bindings
pyk4a python package: real-time streaming from Azure Kinect camera, this package also works in our provided docker environment
multiprocessing_pipeline python package: set-up pipeline graph of python blocks running in parallel, see usage in server.py

Citation

If you find the project helpful, please consider citing us:

@inproceedings{bashirov2021real,
  title={Real-Time RGBD-Based Extended Body Pose Estimation},
  author={Bashirov, Renat and Ianina, Anastasia and Iskakov, Karim and Kononenko, Yevgeniy and Strizhkova, Valeriya and Lempitsky, Victor and Vakhitov, Alexander},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={2807--2816},
  year={2021}
}

Non-commercial use only

Demo for Real-time RGBD-based Extended Body Pose Estimation paper

Related tags

Overview

Real-time RGBD-based Extended Body Pose Estimation

How to use

Build

Download data

Run

What else

Citation

Owner

Renat Bashirov

The Multi-Mission Maximum Likelihood framework (3ML)

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

This project aims at building a real-time wide band channel sounder using USRPs

TensorFlow implementation of AlexNet and its training and testing on ImageNet ILSVRC 2012 dataset

Generative Autoregressive, Normalized Flows, VAEs, Score-based models (GANVAS)

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

Turn based roguelike in python

1st place solution to the Satellite Image Change Detection Challenge hosted by SenseTime

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Self-Supervised Learning with Kernel Dependence Maximization

Implementation of Uformer, Attention-based Unet, in Pytorch

TreeSubstitutionCipher - Encryption system based on trees and substitution

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Unified file system operation experience for different backend

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Tensorflow implementation of soft-attention mechanism for video caption generation.

Large scale embeddings on a single machine.