an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Last update: Jan 08, 2023

Overview

This work has now been superseded by: https://github.com/sniklaus/revisiting-sepconv

sepconv-slomo

This is a reference implementation of Video Frame Interpolation via Adaptive Separable Convolution [1] using PyTorch. Given two frames, it will make use of adaptive convolution [2] in a separable manner to interpolate the intermediate frame. Should you be making use of our work, please cite our paper [1].

For a reimplemntation of our work, see: https://github.com/martkartasev/sepconv
And for another adaptation, consider: https://github.com/HyeongminLEE/pytorch-sepconv
For softmax splatting, please see: https://github.com/sniklaus/softmax-splatting

setup

The separable convolution layer is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

If you plan to process videos, then please also make sure to have pip install moviepy installed.

usage

To run it on your own pair of frames, use the following command. You can either select the l1 or the lf model, please see our paper for more details. In short, the l1 model should be used for quantitative evaluations and the lf model for qualitative comparisons.

python run.py --model lf --one ./images/one.png --two ./images/two.png --out ./out.png

To run in on a video, use the following command.

python run.py --model lf --video ./videos/car-turn.mp4 --out ./out.mp4

For a quick benchmark using examples from the Middlebury benchmark for optical flow, run python benchmark.py. You can use it to easily verify that the provided implementation runs as expected.

video

license

The provided implementation is strictly for academic purposes only. Should you be interested in using our technology for any commercial use, please feel free to contact us.

references

[1]  @inproceedings{Niklaus_ICCV_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Separable Convolution},
         booktitle = {IEEE International Conference on Computer Vision},
         year = {2017}
     }

[2]  @inproceedings{Niklaus_CVPR_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Convolution},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2017}
     }

acknowledgment

This work was supported by NSF IIS-1321119. The video above uses materials under a Creative Common license or with the owner's permission, as detailed at the end.

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Related tags

Overview

sepconv-slomo

setup

usage

video

license

references

acknowledgment

Owner

Simon Niklaus

A Graph Neural Network Tool for Recovering Dense Sub-graphs in Random Dense Graphs.

Convert Python 3 code to CUDA code.

Code & Data for Enhancing Photorealism Enhancement

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

一些经典的CTR算法的复现; LR, FM, FFM, AFM, DeepFM，xDeepFM, PNN, DCN, DCNv2, DIFM, AutoInt, FiBiNet,AFN,ONN,DIN, DIEN ... （pytorch, tf2.0）

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

《Dual-Resolution Correspondence Network》(NeurIPS 2020)

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA results for single-image motion deblurring, image deraining, image denoising (synthetic and real data), and dual-pixel defocus deblurring.

Riemannian Geometry for Molecular Surface Approximation (RGMolSA)

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Generative Modelling of BRDF Textures from Flash Images [SIGGRAPH Asia, 2021]

Codes for SIGIR'22 Paper 'On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation'

🏃‍♀️ A curated list about human motion capture, analysis and synthesis.

An excellent hash algorithm combining classical sponge structure and RNN.

Implementation of Hierarchical Transformer Memory (HTM) for Pytorch

Adaptive Denoising Training (ADT) for Recommendation.