Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Last update: Dec 13, 2022

Overview

Long-term-Motion-in-3D-Scenes

This is an implementation of the CVPR'21 paper "Synthesizing Long-Term 3D Human Motion and Interaction in 3D".

Please check our paper and the project webpage for more details.

Citation

If you use our code or paper, please consider citing:

@article{wang2020synthesizing,
  title={Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes},
  author={Wang, Jiashun and Xu, Huazhe and Xu, Jingwei and Liu, Sifei and Wang, Xiaolong},
  journal={arXiv preprint arXiv:2012.05522},
  year={2020}
}

Dependencies

Requirements:

python3.6
pytorch==1.1.0
trimesh
open3d
Chamfer Pytorch
Human Body Prior
SMPL-X

Datasets

We use PROX and PROXE datasets as our training data. After downloading them, please put them in './data/'. We provide generate_routepose_data.ipynb and generate_sub_data.ipynb for data generation. Note in PROX, the human meshes and the scene meshes are not in the same area in the world coordinates. Different from PROX and PROXE, we apply the inverse of the camera extrinsics to the scene mesh. Since the scene is the input and we need it to be aligned with the human bodies. This is done in the data generation code. Thus for contact calculating, you do not need to apply transformation to them. While for collision calculating, you still need to apply the transformation to the human bodies similar to PROXE to make it be aligned with SDF. Please be careful with this during training or testing, especially if you want to test on other scenes such as Matterport3D. Please put body_segments data in './data/' as well.

Demo

We provide demo.ipynb to help you play with our method. Before running, please put a downsampled MPH16.ply mesh and the SDF data of this scene in './demo_data/'. You can download them from PROX and PROXE. Still, please be careful with the camera extrinsics when you want to test other scenes, make sure the human body is in the scene. This code will also show you how to optimize the whole motion.

Models

We use SMPL-X to represent human bodies. Please download the SMPL-X models and put them in './models/' and it may look like './models/smplx/SMPLX_NEUTRAL.npz'. Please download vposer model and put it in './' ('./vposer_v1_0/').

We also provide our pretrained model here

Training

After you generate the data. You can train the networks directly,

python train_subgoal.py

python train_route.py

Please train the posenet after you finished training routenet with your own pretrained routenet model,

python train_pose.py

Acknowledgement

This work was supported, in part, by grants from DARPA LwLL, NSF 1730158 CI-New: Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI), NSF ACI-1541349 CC*DNI Pacific Research Platform, and gifts from Qualcomm and TuSimple. Part of our code is based on PROXE and it may help you with the dependencies and dataset parts as well. Many thanks!

License

Apache-2.0 License

Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Related tags

Overview

Long-term-Motion-in-3D-Scenes

Citation

Dependencies

Datasets

Demo

Models

Training

Acknowledgement

License

Owner

Jiashun Wang

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

Fully convolutional deep neural network to remove transparent overlays from images

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"

Open-AI's DALL-E for large scale training in mesh-tensorflow.

unet for image segmentation

The 2nd place solution of 2021 google landmark retrieval on kaggle.

TDmatch is a Python library developed to perform matching tasks in three categories:

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Generative Exploration and Exploitation - This is an improved version of GENE.

【CVPR 2021, Variational Inference Framework, PyTorch】 From Rain Generation to Rain Removal

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Code for Multinomial Diffusion

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

RATCHET is a Medical Transformer for Chest X-ray Diagnosis and Reporting

ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

rastrainer is a QGIS plugin to training remote sensing semantic segmentation model based on PaddlePaddle.