3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Last update: Feb 06, 2022

Overview

3DMV

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans. This work is based on our ECCV'18 paper, 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation.

Code

Installation:

Training is implemented with PyTorch. This code was developed under PyTorch 0.2 and recently upgraded to PyTorch 0.4.

Training:

See python train.py --help for all train options. Example train call:

python train.py --gpu 0 --train_data_list [path to list of train files] --data_path_2d [path to 2d image data] --class_weight_file [path to txt file of train histogram] --num_nearest_images 5 --model2d_path [path to pretrained 2d model]

Trained models: models.zip

Testing

See python test.py --help for all test options. Example test call:

python test.py --gpu 0 --scene_list [path to list of test scenes] --model_path [path to trained model.pth] --data_path_2d [path to 2d image data] --data_path_3d [path to test scene data] --num_nearest_images 5 --model2d_orig_path [path to pretrained 2d model]

Data:

This data has been precomputed from the ScanNet (v2) dataset.

Train data for ScanNet v2: 3dmv_scannet_v2_train.zip (6.2G)

2D train images can be processed from the ScanNet dataset using the 2d data preparation script in prepare_data
Expected file structure for 2D data:

scene0000_00/
|--color/
   |--[framenum].jpg
       ⋮
|--depth/
   |--[framenum].png   (16-bit pngs)
       ⋮
|--pose/
   |--[framenum].txt   (4x4 rigid transform as txt file)
       ⋮
|--label/    (if applicable)
   |--[framenum].png   (8-bit pngs)
       ⋮
scene0000_01/
⋮

Test scenes for ScanNet v2: 3dmv_scannet_v2_test_scenes.zip (110M)

Citation:

If you find our work useful in your research, please consider citing:

@inproceedings{dai20183dmv,
 author = {Dai, Angela and Nie{\ss}ner, Matthias},
 booktitle = {Proceedings of the European Conference on Computer Vision ({ECCV})},
 title = {3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation},
 year = {2018}
}

Contact:

If you have any questions, please email Angela Dai at [email protected].

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Related tags

Overview

3DMV

Code

Installation:

Training:

Testing

Data:

Citation:

Contact:

Owner

Владислав Молодцов

Learning from Synthetic Shadows for Shadow Detection and Removal [Inoue+, IEEE TCSVT 2020].

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

Constrained Logistic Regression - How to apply specific constraints to logistic regression's coefficients

A annotation of yolov5-5.0

A Python Package for Convex Regression and Frontier Estimation

MT-GAN-PyTorch - PyTorch Implementation of Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

🎁 3,000,000+ Unsplash images made available for research and machine learning

Repository for the paper "From global to local MDI variable importances for random forests and when they are Shapley values"

Vision Transformer for 3D medical image registration (Pytorch).

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

Physics-Informed Neural Networks (PINN) and Deep BSDE Solvers of Differential Equations for Scientific Machine Learning (SciML) accelerated simulation

Can we do Customers Segmentation using PHP and Unsupervized Machine Learning ? Yes we can ! 🤡

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Styled Augmented Translation

The source code of CVPR17 'Generative Face Completion'.

Demonstration of the Model Training as a CI/CD System in Vertex AI

ParmeSan: Sanitizer-guided Greybox Fuzzing

Supporting code for short YouTube series Neural Networks Demystified.