OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Last update: Dec 15, 2022

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

This repository contains the code for the CVPR 2022 paper OcclusionFusion, where we introduce a novel method to calculate occlusion-aware 3D motion to guide dynamic 3D reconstruction.

In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network.

Currently, we provide a pretrained model and a demo. Code for data pre-processing, network training and evaluation will be available soon.

Setup

We use python 3.8.10, pytorch-1.8.0 and pytorch-geometric-1.7.2.

conda create -n occlusionfu python==3.8.10
conda activate occlusionfu
pip install -r requirements.txt
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=10.2 -c pytorch
pip install torch-scatter==2.0.8 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-sparse==0.6.12 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-cluster==1.5.9 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-spline-conv==1.2.1 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-geometric==1.7.2

Running the demo

Run the demo with the pretrained model and prepared inputs:

python demo.py

Visualize the input and output:

python visualize.py

The defualt setting of visualize.py will render the network's input and output to a video as follow. You can also change the setting to view the network's input and output with Open3D viewer.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{lin2022occlusionfusion,
    title={OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction}, 
    author={Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu}, 
    journal={Conference on Computer Vision and Pattern Recognition (CVPR)}, 
    year={2022}
}

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Related tags

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

Setup

Running the demo

Citation

Owner

Wenbin Lin

A repository for benchmarking neural vocoders by their quality and speed.

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

Pytorch implementation of Learning Rate Dropout.

Python parser for DTED data.

Code to reproduce the results for Compositional Attention

A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.

Controlling a game using mediapipe hand tracking

Deploy a ML inference service on a budget in less than 10 lines of code.

Bayesian regularization for functional graphical models.

PyoMyo - Python Opensource Myo library

A cross-document event and entity coreference resolution system, trained and evaluated on the ECB+ corpus.

Categorical Depth Distribution Network for Monocular 3D Object Detection

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

2021 credit card consuming recommendation

Simulation of Self Driving Car

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages

A repository for storing njxzc final exam review material

We propose a new method for effective shadow removal by regarding it as an exposure fusion problem.