Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Last update: Nov 25, 2021

Related tags

Overview

This is a Pytorch Lightning version PSMNet which is based on JiaRenChang/PSMNet.

use python main.py to start training.

PSM-Net

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network" paper (CVPR 2018) by Jia-Ren Chang and Yong-Sheng Chen.

Official repository: JiaRenChang/PSMNet

Usage

1) Requirements

Python3.5+
Pytorch0.4
Opencv-Python
Matplotlib
TensorboardX
Tensorboard

All dependencies are listed in requirements.txt, you execute below command to install the dependencies.

pip install -r requirements.txt

2) Train

usage: train.py [-h] [--maxdisp MAXDISP] [--logdir LOGDIR] [--datadir DATADIR]
                [--cuda CUDA] [--batch-size BATCH_SIZE]
                [--validate-batch-size VALIDATE_BATCH_SIZE]
                [--log-per-step LOG_PER_STEP]
                [--save-per-epoch SAVE_PER_EPOCH] [--model-dir MODEL_DIR]
                [--lr LR] [--num-epochs NUM_EPOCHS]
                [--num-workers NUM_WORKERS]

PSMNet

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --logdir LOGDIR       log directory
  --datadir DATADIR     data directory
  --cuda CUDA           gpu number
  --batch-size BATCH_SIZE
                        batch size
  --validate-batch-size VALIDATE_BATCH_SIZE
                        batch size
  --log-per-step LOG_PER_STEP
                        log per step
  --save-per-epoch SAVE_PER_EPOCH
                        save model per epoch
  --model-dir MODEL_DIR
                        directory where save model checkpoint
  --lr LR               learning rate
  --num-epochs NUM_EPOCHS
                        number of training epochs
  --num-workers NUM_WORKERS
                        num workers in loading data

For example:

python train.py --batch-size 16 \
                --logdir log/exmaple \
                --num-epochs 500

3) Visualize result

This repository uses tensorboardX to visualize training result. Find your log directory and launch tensorboard to look over the result. The default log directory is /log.

tensorboard --logdir <your_log_dir>

Here are some of my training results (have been trained for 1000 epochs on KITTI2015):

4) Inference

usage: inference.py [-h] [--maxdisp MAXDISP] [--left LEFT] [--right RIGHT]
                    [--model-path MODEL_PATH] [--save-path SAVE_PATH]

PSMNet inference

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --left LEFT           path to the left image
  --right RIGHT         path to the right image
  --model-path MODEL_PATH
                        path to the model
  --save-path SAVE_PATH
                        path to save the disp image

For example:

python inference.py --left test/left.png \
                    --right test/right.png \
                    --model-path checkpoint/08/best_model.ckpt \
                    --save-path test/disp.png

5) Pretrained model

A model trained for 1000 epochs on KITTI2015 dataset can be download here. (I choose the best model among the 1000 epochs)

state {
    'epoch': 857,
    '3px-error': 3.466
}

Task List

Contact

Email: [email protected]

Welcome for any discussions!

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Related tags

Overview

PSM-Net

Usage

1) Requirements

2) Train

3) Visualize result

4) Inference

5) Pretrained model

Task List

Contact

Owner

XIAOTIAN LIU

[IJCAI-2021] A benchmark of data-free knowledge distillation from paper "Contrastive Model Inversion for Data-Free Knowledge Distillation"

Implementation of gaze tracking and demo

Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval.

source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT

Tensorflow implementation of Swin Transformer model.

Recurrent Conditional Query Learning

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

SANet: A Slice-Aware Network for Pulmonary Nodule Detection

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

A library to inspect itermediate layers of PyTorch models.

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

Studying Python release adoptions by looking at PyPI downloads

Pyramid Scene Parsing Network, CVPR2017.

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Full-featured Decision Trees and Random Forests learner.

PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection?

[SDM 2022] Towards Similarity-Aware Time-Series Classification

A Tensorflow implementation of BicycleGAN.

Ground truth data for the Optical Character Recognition of Historical Classical Commentaries.

Neural Tangent Generalization Attacks (NTGA)