[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Last update: Oct 12, 2022

Related tags

Deep Learning PeeledHuman

Overview

PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

International Conference on 3D Vision, 2020

Sai Sagar Jinka¹, Rohan Chacko¹, Avinash Sharma¹, P. J. Narayanan¹

¹Center for Visual Information Technology, IIIT Hyderabad

[Project page] [Paper] [ArXiv]

Abstract
We introduce PeeledHuman - a novel shape representation of the human body that is robust to self-occlusions. PeeledHuman encodes the human body as a set of Peeled Depth and RGB maps in 2D, obtained by performing raytracing on the 3D body model and extending each ray beyond its first intersection. This formulation allows us to handle self-occlusions efficiently compared to other representations. Given a monocular RGB image, we learn these Peeled maps in an end-to-end generative adversarial fashion using our novel framework - PeelGAN. We train PeelGAN using a 3D Chamfer loss and other 2D losses to generate multiple depth values per-pixel and a corresponding RGB field per-vertex in a dual-branch setup. In our simple non-parametric solution, the generated Peeled Depth maps are back-projected to 3D space to obtain a complete textured 3D shape. The corresponding RGB maps provide vertex-level texture details. We compare our method with current parametric and non-parametric methods in 3D reconstruction and find that we achieve state-of-theart-results. We demonstrate the effectiveness of our representation on publicly available BUFF and MonoPerfCap datasets as well as loose clothing data collected by our calibrated multi-Kinect setup.

Testing

Install environment

$ conda env create -f environment.yml

Download the checkpoint from here and store it in ./checkpoints/test/. The provided checkpoint was trained on the MonoPerfCap dataset.

Run the inference script

python test.py                            \
  --test_folder_path <path/to/images/dir> \
  --results_dir <path/to/results/dir>     \
  --name test                             \
  --direction AtoB                        \
  --model pix2pix                         \
  --netG resnet_18blocks                  \
  --output_nc 4                           \
  --load_size 512                         \
  --eval

The script looks for the checkpoint file in checkpoints/<checkpoint/name>

Citation

@inproceedings {jinka2020peeledhuman,
  author = {S. Jinka and R. Chacko and A. Sharma and P. Narayanan},
  booktitle = {2020 International Conference on 3D Vision (3DV)},
  title = {PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction},
  year = {2020},
  pages = {879-888},
  doi = {10.1109/3DV50981.2020.00098},
  publisher = {IEEE Computer Society},
  }

Acknowledgements

Our network derives from the pix2pix work and hence builds on the official PyTorch implementation of pix2pix. This README template was borrowed from Aakash Kt. Please open an issue in case of any bugs/queries.

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Related tags

Overview

PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

Testing

Acknowledgements

Owner

Rohan Chacko

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Implementation of Nalbach et al. 2017 paper.

PyTorch implementation of the implicit Q-learning algorithm (IQL)

Official implementation of ACTION-Net: Multipath Excitation for Action Recognition (CVPR'21).

Domain Generalization with MixStyle, ICLR'21.

TensorFlow for Raspberry Pi

A Pytorch Implementation for Compact Bilinear Pooling.

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

A treasure chest for visual recognition powered by PaddlePaddle

Code for BMVC2021 "MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation"

Tools for manipulating UVs in the Blender viewport.

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Location-Sensitive Visual Recognition with Cross-IOU Loss

python library for invisible image watermark (blind image watermark)

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Application of the L2HMC algorithm to simulations in lattice QCD.

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".