Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Last update: Nov 02, 2022

Overview

This is a Pytorch implementation of

Janai, J., Güney, F., Ranjan, A., Black, M. and Geiger, A., Unsupervised Learning of Multi-Frame Optical Flow with Occlusions. ECCV 2018.

[Link to Paper] [Project Page] [Original Torch Code]

Requirements

Runs and tested on Pytorch 0.3.1, it should be compatible with higher versions with little/no modifications.
Correlation package is taken from NVIDIA/flownet2-pytorch and it can be installed using

cd correlation_package
bash make.sh

If you are using Pytorch>0.3.1, you can use correlation layer from here.

Usage

To use the model, go to your favorite python environment

from back2future import Model
model = Model(pretrained='pretrained/path_to_your_favorite_model')

There are two pretrained models in pretrained/, that are fine tuned on Sintel and KITTI in an unsupervised way.

Refer to demo.py for more.

Testing

To test performance on KITTI, use

python3 test_back2future.py --pretrained-flow path/to/pretrained/model --kitti-dir path/to/kitti/2015/root

Training

Please use the [original torch code] for training new models.

License

This is a reimplementation. License for the original work can be found at JJanai/back2future.

While using this code, please cite

@inproceedings{Janai2018ECCV,
  title = {Unsupervised Learning of Multi-Frame Optical Flow with Occlusions },
  author = {Janai, Joel and G{"u}ney, Fatma and Ranjan, Anurag and Black, Michael J. and Geiger, Andreas},
  booktitle = {European Conference on Computer Vision (ECCV)},
  volume = {Lecture Notes in Computer Science, vol 11220},
  pages = {713--731},
  publisher = {Springer, Cham},
  month = sep,
  year = {2018},
  month_numeric = {9}
}

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Related tags

Overview

Requirements

Usage

Testing

Training

License

While using this code, please cite

Owner

Anurag Ranjan

Playable Video Generation

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

GoodNews Everyone! Context driven entity aware captioning for news images

Adaptive Graph Convolution for Point Cloud Analysis

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

[ICCV2021] Learning to Track Objects from Unlabeled Videos

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

Sandbox for training deep learning networks

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A python library for time-series smoothing and outlier detection in a vectorized way.

D2Go is a toolkit for efficient deep learning

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

FFCV: Fast Forward Computer Vision (and other ML workloads!)

[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Related tags

Overview

Requirements

Usage

Testing

Training

License

While using this code, please cite

Owner

Anurag Ranjan

Playable Video Generation

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

GoodNews Everyone! Context driven entity aware captioning for news images

Adaptive Graph Convolution for Point Cloud Analysis

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

[ICCV2021] Learning to Track Objects from Unlabeled Videos

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

Sandbox for training deep learning networks

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A python library for time-series smoothing and outlier detection in a vectorized way.

D2Go is a toolkit for efficient deep learning

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

FFCV: Fast Forward Computer Vision (and other ML workloads!)

[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.