Pytorch Lightning Implementation of SC-Depth Methods.

Last update: Dec 30, 2022

Related tags

Overview

SC_Depth_pl:

This is a pytorch lightning implementation of SC-Depth (V1, V2) for self-supervised learning of monocular depth from video.

In the V1 (IJCV 2021 & NeurIPS 2019), we propose (i) geometry consistency loss for scale-consistent depth prediction over video and (ii) self-discovered mask for detecting and removing dynamic regions during training towards higher accuracy. We also validate the predicted depth in the Visual SLAM scenario.

In the V2 (TPMAI 2022), we propose auto-recitify network (ARN) to remove relative image rotation in hand-held camera captured videos, e.g., some indoor datasets. We show that the proposed ARN, which is self-supervised trained in an end-to-end fashion, greatly eases the training and significantly boosts the performance.

Install

conda create -n sc_depth_env python=3.6
conda activate sc_depth_env
conda install pytorch==1.6.0 torchvision==0.7.0 cudatoolkit=10.2 -c pytorch
pip install -r requirements.txt

Dataset

We preprocess all existing video datasets to the following general video format for training and testing:

Dataset
  -Training
    --Scene0000
      ---*.jpg (list of images)
      ---cam.txt (3x3 intrinsic)
      ---depth (a folder containing gt depths, optional for validation)
    --Scene0001
    ...
    train.txt (containing training scene names)
    val.txt (containing validation scene names)
  -Testing
    --color (containg testing images)
    --depth (containg ground truth depths)

You can convert it by yourself (on your own video data) or download our pre-processed standard datasets:

[kitti_raw] [nyu]

Training

We provide "scripts/run_train.sh", which shows how to train on kitti and nyu.

Testing

We provide "scripts/run_test.sh", which shows how test on kitti and nyu.

Inference

We provide "scripts/run_inference.sh", which shows how to save depths (.npy) and visualization results (.jpg).

Pretrained models

We provide pretrained models on kitti and nyu datasets. You need to uncompress it and put it into "ckpt" folder. If you run the "scripts/run_test.sh" with the pretrained model (fix the path before running), you should get the following results:

[kitti_scv1_model]:

Models	Abs Rel	Sq Rel	Log10	RMSE	RMSE(log)	Acc.1	Acc.2	Acc.3
resnet18	0.119	0.878	0.053	4.987	0.196	0.859	0.956	0.981

[nyu_scv2_model]:

Models	Abs Rel	Sq Rel	Log10	RMSE	RMSE(log)	Acc.1	Acc.2	Acc.3
resnet18	0.142	0.112	0.061	0.554	0.186	0.808	0.951	0.987

References

SC-DepthV1:

Unsupervised Scale-consistent Depth Learning from Video (IJCV 2021)
Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Zhichao Li, Le Zhang, Chunhua Shen, Ming-Ming Cheng, Ian Reid [paper]

@article{bian2021ijcv, 
  title={Unsupervised Scale-consistent Depth Learning from Video}, 
  author={Bian, Jia-Wang and Zhan, Huangying and Wang, Naiyan and Li, Zhichao and Zhang, Le and Shen, Chunhua and Cheng, Ming-Ming and Reid, Ian}, 
  journal= {International Journal of Computer Vision (IJCV)}, 
  year={2021} 
}

which is an extension of previous conference version: Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video (NeurIPS 2019)
Jia-Wang Bian, Zhichao Li, Naiyan Wang, Huangying Zhan, Chunhua Shen, Ming-Ming Cheng, Ian Reid [paper]

@inproceedings{bian2019neurips,
  title={Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video},
  author={Bian, Jiawang and Li, Zhichao and Wang, Naiyan and Zhan, Huangying and Shen, Chunhua and Cheng, Ming-Ming and Reid, Ian},
  booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
  year={2019}
}

SC-DepthV2:

Auto-Rectify Network for Unsupervised Indoor Depth Estimation (TPAMI 2022)
Jia-Wang Bian, Huangying Zhan, Naiyan Wang, Tat-Jun Chin, Chunhua Shen, Ian Reid [paper]

@article{bian2021tpami, 
  title={Auto-Rectify Network for Unsupervised Indoor Depth Estimation}, 
  author={Bian, Jia-Wang and Zhan, Huangying and Wang, Naiyan and Chin, Tat-Jin and Shen, Chunhua and Reid, Ian}, 
  journal= {IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)}, 
  year={2021} 
}

Pytorch Lightning Implementation of SC-Depth Methods.

Related tags

Overview

SC_Depth_pl:

Install

Dataset

Training

Testing

Inference

Pretrained models

References

SC-DepthV1:

SC-DepthV2:

Owner

JiaWang Bian

Interactive Terraform visualization. State and configuration explorer.

Code release for ICCV 2021 paper "Anticipative Video Transformer"

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

PyTorch implementations of deep reinforcement learning algorithms and environments

[TIP2020] Adaptive Graph Representation Learning for Video Person Re-identification

This is the codebase for Diffusion Models Beat GANS on Image Synthesis.

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

ONNX-GLPDepth - Python scripts for performing monocular depth estimation using the GLPDepth model in ONNX

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

LIMEcraft: Handcrafted superpixel selectionand inspection for Visual eXplanations

A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

A Flow-based Generative Network for Speech Synthesis

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

Joint Gaussian Graphical Model Estimation: A Survey

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined with the POMDPs.jl generative interface.

《Geo Word Clouds》paper implementation