[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Last update: Dec 24, 2022

Overview

NerfingMVS

Project Page | Paper | Video | Data

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
Yi Wei, Shaohui Liu, Yongming Rao, Wang Zhao, Jiwen Lu, Jie Zhou
ICCV 2021 (Oral Presentation)

Installation

Pull NerfingMVS repo.

git clone --recursive [email protected]:weiyithu/NerfingMVS.git

Install python packages with anaconda.

conda create -n NerfingMVS python=3.7
conda activate NerfingMVS
conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 -c pytorch
pip install -r requirements.txt

We use COLMAP to calculate poses and sparse depths. However, original COLMAP does not have fusion mask for each view. Thus, we add masks to COLMAP and denote it as a submodule. Please follow https://colmap.github.io/install.html to install COLMAP in ./colmap folder.

Usage

Download 8 ScanNet scene data used in the paper here and put them under ./data folder. We also upload final results and checkpoints of each scene here.
Run NerfingMVS
```
sh run.sh $scene_name
```
The whole procedure takes about 3.5 hours on one NVIDIA GeForce RTX 2080 GPU, including COLMAP, depth priors training, NeRF training, filtering and evaluation. COLMAP can be accelerated with multiple GPUs.You will get per-view depth maps in ./logs/$scene_name/filter. Note that these depth maps have been aligned with COLMAP poses. COLMAP results will be saved in ./data/$scene_name while others will be preserved in ./logs/$scene_name

Run on Your Own Data!

Place your data with the following structure:
```
NerfingMVS
|───data
|    |──────$scene_name
|    |   |   train.txt
|    |   |──────images
|    |   |    |    001.jpg
|    |   |    |    002.jpg
|    |   |    |    ...
|───configs
|    $scene_name.txt
|     ...
```
train.txt contains names of all the images. Images can be renamed arbitrarily and '001.jpg' is just an example. You also need to imitate ScanNet scenes to create a config file in ./configs. Note that factor parameter controls the resolution of output depth maps. You also should adjust depth_N_iters, depth_H, depth_W in options.py accordingly.
Run NerfingMVS without evaluation
```
sh demo.sh $scene_name
```
Since our work currently relies on COLMAP, the results are dependent on the quality of the acquired poses and sparse reconstruction from COLMAP.

Acknowledgement

Our code is based on the pytorch implementation of NeRF: NeRF-pytorch. We also refer to mannequin challenge.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{wei2021nerfingmvs,
  author    = {Wei, Yi and Liu, Shaohui and Rao, Yongming and Zhao, Wang and Lu, Jiwen and Zhou, Jie},
  title     = {NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo},
  booktitle = {ICCV},
  year = {2021}
}

[ICCV 2021 Oral] NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

Related tags

Overview

NerfingMVS

Project Page | Paper | Video | Data

Installation

Usage

Run on Your Own Data!

Acknowledgement

Citation

Owner

Yi Wei

CrossNorm and SelfNorm for Generalization under Distribution Shifts (ICCV 2021)

Codebase for the paper titled "Continual learning with local module selection"

L-Verse: Bidirectional Generation Between Image and Text

Affine / perspective transformation in Pose Estimation with Tensorflow 2

This source code is implemented using keras library based on "Automatic ocular artifacts removal in EEG using deep learning"

Controlling the MicriSpotAI robot from scratch

Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · R. Huang

Planner_backend - Academic planner application designed for students and counselors.

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

Very Deep Convolutional Networks for Large-Scale Image Recognition

Group Fisher Pruning for Practical Network Compression(ICML2021)

Python library for tracking human heads with FLAME (a 3D morphable head model)

U-Net implementation in PyTorch for FLAIR abnormality segmentation in brain MRI

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure.

LAnguage Model Analysis

A TensorFlow Implementation of "Deep Multi-Scale Video Prediction Beyond Mean Square Error" by Mathieu, Couprie & LeCun.

SmartSim Infrastructure Library.

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes