Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Last update: Dec 13, 2022

Related tags

Deep Learning DistDepth

Overview

Toward Practical Monocular Indoor Depth Estimation

Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su

[arXiv] [project site]

DistDepth

Our DistDepth is a highly robust monocular depth estimation approach for generic indoor scenes.

Trained with stereo sequences without their groundtruth depth
Structured and metric-accurate
Run in an interactive rate with Laptop GPU
Sim-to-real: trained on simulation and becomes transferrable to real scenes

Single Image Inference Demo

We test on Ubuntu 20.04 LTS with an laptop NVIDIA 2080 GPU (only GPU mode is supported).

Install packages

Use conda

conda create --name distdepth python=3.8 conda activate distdepth
Install pre-requisite common packages. Go to https://pytorch.org/get-started/locally/ and install pytorch that is compatible to your computer. We test on pytorch v1.9.0 and cudatoolkit-11.1. (The codes should work under other v1.0+ versions)

conda install pytorch==1.9.0 torchvision==0.10.0 torchaudio==0.9.0 cudatoolkit=11.3 -c pytorch -c conda-forge
Install other dependencies: opencv-python and matplotlib.

pip install opencv-python, matplotlib

Download pretrained models

Download pretrained models [here] (ResNet152, 246MB).
Move the downloaded item under this folder, and then unzip it. You should be able to see a new folder 'ckpts' that contains the pretrained models.
Run

python demo.py
Results will be stored under results/

Data

Download SimSIN [here]. For UniSIN and VA, please download at the [project site].

Depth-aware AR effects

Virtual object insertion:

Dragging objects along a trajectory:

Citation

@inproceedings{wu2022toward,
title={Toward Practical Monocular Indoor Depth Estimation},
author={Wu, Cho-Ying and Wang, Jialiang and Hall, Michael and Neumann, Ulrich and Su, Shuochen},
booktitle={CVPR},
year={2022}
}

License

DistDepth is CC-BY-NC licensed, as found in the LICENSE file.

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Related tags

Overview

Toward Practical Monocular Indoor Depth Estimation

DistDepth

Single Image Inference Demo

Data

Depth-aware AR effects

Citation

License

Owner

Meta Research

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

The world's largest toxicity dataset.

This is code of book "Learn Deep Learning with PyTorch"

This is a vision-based 3d model manipulation and control UI

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Unsupervised Feature Ranking via Attribute Networks.

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Out-of-Town Recommendation with Travel Intention Modeling (AAAI2021)

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

A Pytorch loader for MVTecAD dataset.

Custom Implementation of Non-Deep Networks

A lossless neural compression framework built on top of JAX.

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Pytorch implementation of ICASSP 2022 paper Attention Probe: Vision Transformer Distillation in the Wild

Two-stage CenterNet

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Related tags

Overview

Toward Practical Monocular Indoor Depth Estimation

DistDepth

Single Image Inference Demo

Data

Depth-aware AR effects

Citation

License

Owner

Meta Research

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

The world's largest toxicity dataset.

This is code of book "Learn Deep Learning with PyTorch"

This is a vision-based 3d model manipulation and control UI

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Unsupervised Feature Ranking via Attribute Networks.

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Out-of-Town Recommendation with Travel Intention Modeling (AAAI2021)

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

A Pytorch loader for MVTecAD dataset.

Custom Implementation of Non-Deep Networks

A lossless neural compression framework built on top of JAX.

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Pytorch implementation of ICASSP 2022 paper Attention Probe: Vision Transformer Distillation in the Wild

Two-stage CenterNet

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥