[CVPR2021] De-rendering the World's Revolutionary Artefacts

Last update: Nov 06, 2022

Overview

De-rendering the World's Revolutionary Artefacts

Project Page | Video | Paper

In CVPR 2021

Shangzhe Wu^1,4, Ameesh Makadia⁴, Jiajun Wu², Noah Snavely⁴, Richard Tucker⁴, Angjoo Kanazawa^3,4

¹ University of Oxford, ² Stanford University, ³ University of California, Berkeley, ⁴ Google Research

teaser.mp4

We propose a model that de-renders a single image of a vase into shape, material and environment illumination, trained using only a single image collection, without explicit 3D, multi-view or multi-light supervision.

Setup (with conda)

1. Install dependencies:

conda env create -f environment.yml

OR manually:

conda install -c conda-forge matplotlib opencv scikit-image pyyaml tensorboard

2. Install PyTorch:

conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch

Note: The code is tested with PyTorch 1.4.0 and CUDA 10.1. A GPU version is required, as the neural_renderer package only has a GPU implementation.

3. Install neural_renderer:

This package is required for training and testing, and optional for the demo. It requires a GPU device and GPU-enabled PyTorch.

pip install neural_renderer_pytorch==1.1.3

Note: If this fails or runtime error occurs, try compiling it from source. If you don't have a gcc>=5, you could one available on conda: conda install gxx_linux-64=7.3.

git clone https://github.com/daniilidis-group/neural_renderer.git
cd neural_renderer
python setup.py install

Datasets

1. Metropolitan Museum Vases

This vase dataset is collected from Metropolitan Museum of Art Collection through their open-access API under the CC0 License. It contains 1888 training images and 526 testing images of museum vases with segmentation masks obtained using PointRend and GrabCut.

Download the preprocessed dataset using the provided script:

cd data && sh download_met_vases.sh

2. Synthetic Vases

This synthetic vase dataset is generated with random vase-like shapes, poses (elevation), lighting (using spherical Gaussian) and shininess materials. The diffuse texture is generated using the texture maps provided in CC0 Textures under the CC0 License.

Download the dataset using the provided script:

cd data && sh download_syn_vases.sh

Pretrained Models

Download the pretrained models using the scripts provided in pretrained/, eg:

cd pretrained && sh download_pretrained_met_vase.sh

Training and Testing

Check the configuration files in configs/ and run experiments, eg:

python run.py --config configs/train_met_vase.yml --gpu 0 --num_workers 4

Evaluation on Synthetic Vases

Check and run:

python eval/eval_syn_vase.py

Render Animations

To render animations of rotating vases and rotating light, check and run this script:

python render_animation.py

Citation

@InProceedings{wu2021derender,
  author={Shangzhe Wu and Ameesh Makadia and Jiajun Wu and Noah Snavely and Richard Tucker and Angjoo Kanazawa},
  title={De-rendering the World's Revolutionary Artefacts},
  booktitle = {CVPR},
  year = {2021}
}

[CVPR2021] De-rendering the World's Revolutionary Artefacts

Related tags

Overview

De-rendering the World's Revolutionary Artefacts

Project Page | Video | Paper

Setup (with conda)

1. Install dependencies:

2. Install PyTorch:

3. Install neural_renderer:

Datasets

1. Metropolitan Museum Vases

2. Synthetic Vases

Pretrained Models

Training and Testing

Evaluation on Synthetic Vases

Render Animations

Citation

Owner

Blender Python - Node-based multi-line text and image flowchart

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

[ICML 2020] "When Does Self-Supervision Help Graph Convolutional Networks?" by Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

The 3rd place solution for competition

Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

Multi-Joint dynamics with Contact. A general purpose physics simulator.

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

《LXMERT: Learning Cross-Modality Encoder Representations from Transformers》(EMNLP 2020)

A library for performing coverage guided fuzzing of neural networks

Implementation of Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2

Code for Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data? (SDM 2022)

This reporistory contains the test-dev data of the paper "xGQA: Cross-lingual Visual Question Answering".

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

In-Place Activated BatchNorm for Memory-Optimized Training of DNNs

Pytorch implementation of SenFormer: Efficient Self-Ensemble Framework for Semantic Segmentation