Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Last update: Jan 01, 2023

Related tags

Deep Learning GLPDepth

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

[Downloads] Trained ckpt files for NYU Depth V2 and KITTI
[Downloads] Predicted depth maps png files for NYU Depth V2 and KITTI Eigen split test set

Requirements

Tested on

python==3.7.7
torch==1.6.0
h5py==3.6.0
scipy==1.7.3
opencv-python==4.5.5
mmcv==1.4.3
timm=0.5.4
albumentations=1.1.0
tensorboardX==2.4.1

You can install above package with

$ pip install -r requirements.txt

Inference and Evaluate

Dataset

NYU Depth V2

$ cd ./datasets
$ wget http://horatio.cs.nyu.edu/mit/silberman/nyu_depth_v2/nyu_depth_v2_labeled.mat
$ python ../code/utils/extract_official_train_test_set_from_mat.py nyu_depth_v2_labeled.mat splits.mat ./nyu_depth_v2/official_splits/

KITTI

Download annotated depth maps data set (14GB) from [link] into ./datasets/kitti/data_depth_annotated

$ cd ./datasets/kitti/data_depth_annotated/
$ unzip data_depth_annotated.zip

With above two instrtuctions, you can perform eval_with_pngs.py/test.py for NYU Depth V2 and eval_with_pngs for KITTI.

To fully perform experiments, please follow [BTS] repository to obtain full dataset for NYU Depth V2 and KITTI datasets.

Your dataset directory should be

root
- nyu_depth_v2
  - bathroom_0001
  - bathroom_0002
  - ...
  - official_splits
- kitti
  - data_depth_annotated
  - raw_data
  - val_selection_cropped

Evaluation

Evaluate with png images

for NYU Depth V2

$ python ./code/eval_with_pngs.py --dataset nyudepthv2 --pred_path ./best_nyu_preds/ --gt_path ./datasets/nyu_depth_v2/ --max_depth_eval 10.0

for KITTI

$ python ./code/eval_with_pngs.py --dataset kitti --split eigen_benchmark --pred_path ./best_kitti_preds/ --gt_path ./datasets/kitti/ --max_depth_eval 80.0 --garg_crop

Evaluate with model (NYU Depth V2)

Result images will be saved in ./args.result_dir/args.exp_name (default: ./results/test)

To evaluate only

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --do_evaluate  --max_depth 10.0 --max_depth_eval 10.0

To save pngs for eval_with_pngs

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_eval_pngs  --max_depth 10.0 --max_depth_eval 10.0

To save visualized depth maps

$ python ./code/test.py --dataset nyudepthv2 --data_path ./datasets/ --ckpt_dir 
       
         --save_visualize  --max_depth 10.0 --max_depth_eval 10.0

In case of kitti, modify arguments to --dataset kitti --max_depth 80.0 --max_depth_eval 80.0 and add --kitti_crop [garg_crop or eigen_crop]

Inference

Inference with image directory

$ python ./code/test.py --dataset imagepath --data_path 
     
       --save_visualize

To-Do

Add inference
Add training codes
Add dockerHub link
Add colab

References

[1] From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation. [code]

[2] SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. [code]

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Related tags

Overview

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Downloads

Requirements

Inference and Evaluate

Dataset

NYU Depth V2

KITTI

Evaluation

Inference

To-Do

References

Owner

Spherical Confidence Learning for Face Recognition, accepted to CVPR2021.

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Repository for training material for the 2022 SDSC HPC/CI User Training Course

Migration of Edge-based Distributed Federated Learning

Implementation of OpenAI paper with Simple Noise Scale on Fastai V2

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

Source code of D-HAN: Dynamic News Recommendation with Hierarchical Attention Network

Rule Based Classification Project

《Geo Word Clouds》paper implementation

PyTorch implementation of adversarial patch

TensorFlow implementation of "Learning from Simulated and Unsupervised Images through Adversarial Training"

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

Wind Speed Prediction using LSTMs in PyTorch

Official implementation of MSR-GCN (ICCV 2021 paper)

Gapmm2: gapped alignment using minimap2 (align transcripts to genome)

This is the official github repository of the Met dataset

A library for graph deep learning research

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.