Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Last update: Jan 01, 2023

Related tags

Deep Learning GLPDepth

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

[Downloads] Trained ckpt files for NYU Depth V2 and KITTI
[Downloads] Predicted depth maps png files for NYU Depth V2 and KITTI Eigen split test set

Requirements

Tested on

python==3.7.7
torch==1.6.0
h5py==3.6.0
scipy==1.7.3
opencv-python==4.5.5
mmcv==1.4.3
timm=0.5.4
albumentations=1.1.0
tensorboardX==2.4.1

You can install above package with

$ pip install -r requirements.txt

Inference and Evaluate

Dataset

NYU Depth V2

$ cd ./datasets
$ wget http://horatio.cs.nyu.edu/mit/silberman/nyu_depth_v2/nyu_depth_v2_labeled.mat
$ python ../code/utils/extract_official_train_test_set_from_mat.py nyu_depth_v2_labeled.mat splits.mat ./nyu_depth_v2/official_splits/

KITTI

Download annotated depth maps data set (14GB) from [link] into ./datasets/kitti/data_depth_annotated

$ cd ./datasets/kitti/data_depth_annotated/
$ unzip data_depth_annotated.zip

With above two instrtuctions, you can perform eval_with_pngs.py/test.py for NYU Depth V2 and eval_with_pngs for KITTI.

To fully perform experiments, please follow [BTS] repository to obtain full dataset for NYU Depth V2 and KITTI datasets.

Your dataset directory should be

root
- nyu_depth_v2
  - bathroom_0001
  - bathroom_0002
  - ...
  - official_splits
- kitti
  - data_depth_annotated
  - raw_data
  - val_selection_cropped

Evaluation

Evaluate with png images

for NYU Depth V2

$ python ./code/eval_with_pngs.py --dataset nyudepthv2 --pred_path ./best_nyu_preds/ --gt_path ./datasets/nyu_depth_v2/ --max_depth_eval 10.0

for KITTI

$ python ./code/eval_with_pngs.py --dataset kitti --split eigen_benchmark --pred_path ./best_kitti_preds/ --gt_path ./datasets/kitti/ --max_depth_eval 80.0 --garg_crop

Evaluate with model (NYU Depth V2)

Result images will be saved in ./args.result_dir/args.exp_name (default: ./results/test)

To evaluate only

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --do_evaluate  --max_depth 10.0 --max_depth_eval 10.0

To save pngs for eval_with_pngs

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_eval_pngs  --max_depth 10.0 --max_depth_eval 10.0

To save visualized depth maps

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_visualize  --max_depth 10.0 --max_depth_eval 10.0

In case of kitti, modify arguments to --dataset kitti --max_depth 80.0 --max_depth_eval 80.0 and add --kitti_crop [garg_crop or eigen_crop]

Inference

Inference with image directory

$ python ./code/test.py --dataset imagepath --data_path 
     
       --save_visualize

To-Do

Add inference
Add training codes
Add dockerHub link
Add colab

References

[1] From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation. [code]

[2] SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. [code]

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Related tags

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

Requirements

Inference and Evaluate

Dataset

NYU Depth V2

KITTI

Evaluation

Inference

To-Do

References

Owner

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

Training DiffWave using variational method from Variational Diffusion Models.

SMD-Nets: Stereo Mixture Density Networks

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Like a cowsay but without cows!

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Dynamic Graph Event Detection

Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.

Generate images from texts. In Russian

Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)

TensorFlow Tutorials with YouTube Videos

Athena is the only tool that you will ever need to optimize your portfolio.

Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples / ICLR 2018

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

CIFAR-10 Photo Classification

Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)

LLVM-based compiler for LightGBM gradient-boosted trees. Speeds up prediction by ≥10x.

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

SEJE Pytorch implementation

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers