Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

Last update: Dec 26, 2022

Overview

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

Alexey Nekrasov*, Jonas Schult*, Or Litany, Bastian Leibe, Francis Engelmann

Mix3D is a data augmentation technique for 3D segmentation methods that improves generalization.

[Project Webpage] [arXiv]

News

12. October 2021: Code released.
6. October 2021: Mix3D accepted for oral presentation at 3DV 2021. Paper on [arXiv].
30. July 2021: Mix3D ranks 1st on the ScanNet semantic labeling benchmark.

Running the code

This repository contains the code for the analysis experiments of section 4.2. Motivation and Analysis Experiments from the paper For the ScanNet benchmark and Table 1 (main paper) we use the original SpatioTemporalSegmentation-Scannet code. To add Mix3D to the original MinkowskiNet codebase, we provide the patch file SpatioTemporalSegmentation.patch. Check the supplementary for more details.

Code structure

├── mix3d
│   ├── __init__.py
│   ├── __main__.py     <- the main file
│   ├── conf            <- hydra configuration files
│   ├── datasets
│   │   ├── outdoor_semseg.py       <- outdoor dataset
│   │   ├── preprocessing       <- folder with preprocessing scripts
│   │   ├── semseg.py       <- indoor dataset
│   │   └── utils.py        <- code for mixing point clouds
│   ├── logger
│   ├── models      <- MinkowskiNet models
│   ├── trainer
│   │   ├── __init__.py
│   │   └── trainer.py      <- train loop
│   └── utils
├── data
│   ├── processed       <- folder for preprocessed datasets
│   └── raw     <- folder for raw datasets
├── scripts
│   ├── experiments
│   │   └── 1000_scene_merging.bash
│   ├── init.bash
│   ├── local_run.bash
│   ├── preprocess_matterport.bash
│   ├── preprocess_rio.bash
│   ├── preprocess_scannet.bash
│   └── preprocess_semantic_kitti.bash
├── docs
├── dvc.lock
├── dvc.yaml        <- dvc file to reproduce the data
├── poetry.lock
├── pyproject.toml      <- project dependencies
├── README.md
├── saved       <- folder that stores models and logs
└── SpatioTemporalSegmentation-ScanNet.patch        <- patch file for original repo

Dependencies

The main dependencies of the project are the following:

python: 3.7
cuda: 10.1

For others, the project uses the poetry dependency management package. Everything can be installed with the command:

poetry install

Check scripts/init.bash for more details.

Data preprocessing

After the dependencies are installed, it is important to run the preprocessing scripts. They will bring scannet, matterport, rio, semantic_kitti datasets to a single format. By default, the scripts expect to find datsets in the data/raw/ folder. Check scripts/preprocess_*.bash for more details.

dvc repro scannet # matterport, rio, semantic_kitti

This command will run the preprocessing for scannet and will save the result using the dvc data versioning system.

Training and testing

Train MinkowskiNet on the scannet dataset without Mix3D with a voxel size of 5cm:

poetry run train

Train MinkowskiNet on the scannet dataset with Mix3D with a voxel size of 5cm:

poetry run train data/collation_functions=voxelize_collate_merge

BibTeX

@inproceedings{Nekrasov213DV,
  title     = {{Mix3D: Out-of-Context Data Augmentation for 3D Scenes}},
  author    = {Nekrasov, Alexey and Schult, Jonas and Litany, Or and Leibe, Bastian and Engelmann, Francis},
  booktitle = {{International Conference on 3D Vision (3DV)}},
  year      = {2021}
}

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

Related tags

Overview

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

News

Running the code

Code structure

Dependencies

Data preprocessing

Training and testing

BibTeX

Owner

Alexey Nekrasov

Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)

A very short and easy implementation of Quantile Regression DQN

Code for the CVPR2021 paper "Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition"

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

A simple, unofficial implementation of MAE using pytorch-lightning

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Gym environment for FLIPIT: The Game of "Stealthy Takeover"

HyDiff: Hybrid Differential Software Analysis

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)

[ECCVW2020] Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DiMP)

NovelD: A Simple yet Effective Exploration Criterion

Compare outputs between layers written in Tensorflow and layers written in Pytorch

use machine learning to recognize gesture on raspberrypi

This repository holds the code for the paper "Deep Conditional Gaussian Mixture Model forConstrained Clustering".

Source code of all the projects of Udacity Self-Driving Car Engineer Nanodegree.

Benchmarks for Model-Based Optimization

This is a Machine Learning Based Hand Detector Project, It Uses Machine Learning Models and Modules Like Mediapipe, Developed By Google!

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

Sequence-tagging using deep learning