Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

Last update: Dec 30, 2022

Overview

[CVPR 2022] Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation

This repository contains MegEngine implementation of our paper:

Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation
Jiankun Li, Peisen Wang, Pengfei Xiong, Tao Cai, Ziwei Yan, Lei Yang, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu
CVPR 2022

arXiv | BibTeX

Datasets

The Proposed Dataset

Download

There are two ways to download the dataset(~400GB) proposed in our paper:

Download using shell scripts dataset_download.sh

sh dataset_download.sh

the dataset will be downloaded and extracted in ./stereo_trainset/crestereo

Download from BaiduCloud here(Extraction code: aa3g) and extract the tar files manually.

Disparity Format

The disparity is saved as .png uint16 format which can be loaded using opencv imread function:

def get_disp(disp_path):
    disp = cv2.imread(disp_path, cv2.IMREAD_UNCHANGED)
    return disp.astype(np.float32) / 32

Other Public Datasets

Other public datasets we use including

Dependencies

CUDA Version: 10.1, Python Version: 3.6.9

MegEngine v1.8.2
opencv-python v3.4.0
numpy v1.18.1
Pillow v8.4.0
tensorboardX v2.1

python3 -m pip install -r requirements.txt

We also provide docker to run the code quickly:

docker run --gpus all -it -v /tmp:/tmp ylmegvii/crestereo
shotwell /tmp/disparity.png

Inference

Download the pretrained MegEngine model from here and run:

python3 test.py --model_path path_to_mge_model --left img/test/left.png --right img/test/right.png --size 1024x1536 --output disparity.png

Training

Modify the configurations in cfgs/train.yaml and run the following command:

python3 train.py

You can launch a TensorBoard to monitor the training process:

tensorboard --logdir ./train_log

and navigate to the page at http://localhost:6006 in your browser.

Acknowledgements

Part of the code is adapted from previous works:

RAFT(code base)
LoFTR(attention module)
HSMNet(data augmentaion)

We thank all the authors for their awesome repos.

Citation

If you find the code or datasets helpful in your research, please cite:

@misc{Li2022PracticalSM,
      title={Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation},
      author={Jiankun Li and Peisen Wang and Pengfei Xiong and Tao Cai and Ziwei Yan and Lei Yang and Jiangyu Liu and Haoqiang Fan and Shuaicheng Liu},
      year={2022},
      eprint={2203.11483},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

Related tags

Overview

[CVPR 2022] Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation

Datasets

The Proposed Dataset

Download

Disparity Format

Other Public Datasets

Dependencies

Inference

Training

Acknowledgements

Citation

Owner

MEGVII Research

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

TransZero++: Cross Attribute-guided Transformer for Zero-Shot Learning

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

Nb workflows - A workflow platform which allows you to run parameterized notebooks programmatically

Implementation of "A MLP-like Architecture for Dense Prediction"

SpecAugmentPyTorch - A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Implementing Vision Transformer (ViT) in PyTorch

Code for Understanding Pooling in Graph Neural Networks

This repository gives an example on how to preprocess the data of the HECKTOR challenge

PyTorch implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

[ACM MM 2019 Oral] Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

Predict stock movement with Machine Learning and Deep Learning algorithms

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Regulatory Instruments for Fair Personalized Pricing.

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)