[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Last update: Dec 28, 2022

Related tags

Deep Learning PoseTriplet

Overview

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Kehong Gong*, Bingbing Li*, Jianfeng Zhang*, Tao Wang*, Jing Huang, Bi Mi, Jiashi Feng, Xinchao Wang

CVPR 2022 (Oral Presentation, arxiv)

Framework

Pose-triplet contains three components: estimator, imitator and hallucinator

The three components form dual-loop during the training process, complementing and strengthening one another.

Improvement through co-evolving

Here is imitated motion of different rounds, the estimator and imitator get improved over the rounds of training, and thus the imitated motion becomes more accurate and realistic from round 1 to 3.

Video demo

04806-supp.mp4

Comparasion

Here we compared our results with two recent works Yu et al. and Hu et al.

Installation

Please refer to README_env.md for the python environment setup.

Data Preparation

Please refer to estimator/README.md for the preparation of the dataset files.

Training

Please refer to script-summary for the training process. We also provide a checkpoint folder here with better performance, which support that this framework has the potential to reach the same performance as fully-supervised approaches.
Note: checkpoint for the RL policy is not include due to the size limitation, please following the training code to train the policy.

Inference

We provide an inference code here. Please follow the instruction and download the pretrained model for inference on videos.

Talk

Here is a slidestalk (PPT in english, speak in chinese).

Citation

If you find this code useful for your research, please consider citing the following paper:

@inproceedings{gong2022posetriplet,
  title      = {PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision},
  author     = {Gong, Kehong and Li, Bingbing and Zhang, Jianfeng and Wang, Tao and Huang, Jing and Mi, Michael Bi and Feng, Jiashi and Wang, Xinchao},
  booktitle  = {CVPR},
  year       = {2022}
}

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Related tags

Overview

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

CVPR 2022 (Oral Presentation, arxiv)

Framework

Improvement through co-evolving

Video demo

Comparasion

Installation

Data Preparation

Training

Inference

Talk

Citation

Owner

Using pretrained language models for biomedical knowledge graph completion.

Manage the availability of workspaces within Frappe/ ERPNext (sidebar) based on user-roles

Pytorch implementation of the paper Time-series Generative Adversarial Networks

Covid19-Forecasting - An interactive website that tracks, models and predicts COVID-19 Cases

SysWhispers Shellcode Loader

fklearn: Functional Machine Learning

This is the repository for Learning to Generate Piano Music With Sustain Pedals

RRxIO - Robust Radar Visual/Thermal Inertial Odometry: Robust and accurate state estimation even in challenging visual conditions.

Toolbox to analyze temporal context invariance of deep neural networks

2.86% and 15.85% on CIFAR-10 and CIFAR-100

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Simulating an AI playing 2048 using the Expectimax algorithm

Learning based AI for playing multi-round Koi-Koi hanafuda card games. Have fun.

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Compute FID scores with PyTorch.

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Vehicle Detection Using Deep Learning and YOLO Algorithm

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Code for "Layered Neural Rendering for Retiming People in Video."