A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Last update: Dec 28, 2022

Related tags

Deep Learning ManhattanSLAM

Overview

ManhattanSLAM

Authors: Raza Yunus, Yanyan Li and Federico Tombari

ManhattanSLAM is a real-time SLAM library for RGB-D cameras that computes the camera pose trajectory, a sparse 3D reconstruction (containing point, line and plane features) and a dense surfel-based 3D reconstruction. Further details can be found in the related publication. The code is based on ORB-SLAM2.

Related Publication:

Raza Yunus, Yanyan Li and Federico Tombari, ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames, in 2021 IEEE International Conference on Robotics and Automation (ICRA) . PDF.

1. License

ManhattanSLAM is released under a GPLv3 license. For a list of all code/library dependencies (and associated licenses), please see Dependencies.md.

If you use ManhattanSLAM in an academic work, please cite:

@inproceedings{yunus2021manhattanslam,
    author = {R. Yunus, Y. Li and F. Tombari},
    title = {ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames},
    year = {2021},
    booktitle = {2021 IEEE international conference on Robotics and automation (ICRA)},
}

2. Prerequisites

We have tested the library in Ubuntu 16.04, but it should be easy to compile on other platforms. A powerful computer (e.g. i7) will ensure real-time performance and provide more stable and accurate results. Following is the list of dependecies for ManhattanSLAM and their versions tested by us:

OpenCV: 3.3.0
PCL: 1.7.2
Eigen3: 3.3
DBoW2: Included in Thirdparty folder
g2o: Included in Thirdparty folder
Pangolin
tinyply

3. Building and testing

Clone the repository:

git clone https://github.com/razayunus/ManhattanSLAM

There is a script build.sh to build the Thirdparty libraries and ManhattanSLAM. Please make sure you have installed all required dependencies (see section 2). Execute:

cd ManhattanSLAM
chmod +x build.sh
./build.sh

This will create libManhattanSLAM.so in lib folder and the executable manhattan_slam in Example folder.

To test the system:

Download a sequence for one of the following datasets and uncompress it:
- TUM RGB-D: https://vision.in.tum.de/data/datasets/rgbd-dataset
- ICL-NUIM: https://www.doc.ic.ac.uk/~ahanda/VaFRIC/iclnuim.html
- TAMU RGB-D: http://telerobot.cs.tamu.edu/MFG/rgbd/livo/data.html
Associate RGB images and depth images using the python script associate.py. You can generate an associations file by executing:

python associate.py PATH_TO_SEQUENCE/rgb.txt PATH_TO_SEQUENCE/depth.txt > associations.txt

Execute the following command. Change Config.yaml to ICL.yaml for ICL-NUIM sequences, TAMU.yaml for TAMU RGB-D sequences or TUM1.yaml, TUM2.yaml or TUM3.yaml for freiburg1, freiburg2 and freiburg3 sequences of TUM RGB-D respectively. Change PATH_TO_SEQUENCE_FOLDERto the uncompressed sequence folder. Change ASSOCIATIONS_FILE to the path to the corresponding associations file.

./Example/manhattan_slam Vocabulary/ORBvoc.txt Example/Config.yaml PATH_TO_SEQUENCE_FOLDER ASSOCIATIONS_FILE

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Related tags

Overview

ManhattanSLAM

Related Publication:

1. License

2. Prerequisites

3. Building and testing

Owner

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

Point Cloud Registration using Representative Overlapping Points.

pyspark🍒🥭 is delicious，just eat it!😋😋

PyTorch reimplementation of minimal-hand (CVPR2020)

Analysis of Smiles through reservoir sampling & RDkit

Unsupervised 3D Human Mesh Recovery from Noisy Point Clouds

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

CVNets: A library for training computer vision networks

HGCAE Pytorch implementation. CVPR2021 accepted.

Studying Python release adoptions by looking at PyPI downloads

This is a repository of our model for weakly-supervised video dense anticipation.

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Source codes for the paper "Local Additivity Based Data Augmentation for Semi-supervised NER"

Segmentation and Identification of Vertebrae in CT Scans using CNN, k-means Clustering and k-NN

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence) and pre-trained model on ImageNet dataset

Using deep learning model to detect breast cancer.