RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Last update: Dec 09, 2022

Overview

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YouTube | BiliBili

16X interpolation results from two input images:

Introduction

This project is an official implementation (MegEngine implementation) of RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation. For Pytorch implementation, please refers to this repo. Currently, our model can run 30+FPS for 2X 720p interpolation on a 2080Ti GPU. It supports arbitrary-timestep interpolation between a pair of images.

CLI Usage

Installation

git clone [email protected]:MegEngine/arXiv2020-RIFE
cd arXiv2020-RIFE
pip3 install -r requirements.txt

Download the pretrained HD models from here.
Unzip and move the pretrained parameters to train_log/*
This model is not reported by our paper, for our paper model please refer to evaluation.

Run

Image Interpolation

python3 inference_img.py --img img0.png img1.png --exp=4

(2^4=16X interpolation results) After that, you can use pngs to generate mp4:

ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -c:v libx264 -pix_fmt yuv420p output/slomo.mp4 -q:v 0 -q:a 0

You can also use pngs to generate gif:

ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -vf "split[s0][s1];[s0]palettegen=stats_mode=single[p];[s1][p]paletteuse=new=1" output/slomo.gif

Evaluation

Download RIFE model or RIFE_m model reported by our paper.

MiddleBury: Download MiddleBury OTHER dataset at ./other-data and ./other-gt-interp

HD: Download HD dataset at ./HD_dataset. We also provide a google drive download link.

We provide code for evaluating with datasets above, please follow lines:

python3 benchmark/HD_multi_4X.py
python3 benchmark/HD.py
python3 benchmark/MiddleBury_Other.py
python3 benchmark/yuv_frame_io.py
python3 testtime.py

Training and Reproduction

Download Vimeo90K dataset.

We use 16 CPUs, 4 GPUs and 20G memory for training:

python3 train.py --arbitrary=False

Citation

@article{huang2020rife,
  title={RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation},
  author={Huang, Zhewei and Zhang, Tianyuan and Heng, Wen and Shi, Boxin and Zhou, Shuchang},
  journal={arXiv preprint arXiv:2011.06294},
  year={2020}
}

Reference

Optical Flow: ARFlow pytorch-liteflownet RAFT pytorch-PWCNet

Video Interpolation: DVF TOflow SepConv DAIN CAIN MEMC-Net SoftSplat BMBC EDSC

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Related tags

Overview

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YouTube | BiliBili

Introduction

CLI Usage

Installation

Run

Evaluation

Training and Reproduction

Citation

Reference

Owner

旷视天元 MegEngine

Tensorflow implementation of our method: "Triangle Graph Interest Network for Click-through Rate Prediction".

Learning-Augmented Dynamic Power Management

MvtecAD unsupervised Anomaly Detection

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Energy consumption estimation utilities for Jetson-based platforms

Unofficial JAX implementations of Deep Learning models

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

A Convolutional Transformer for Keyword Spotting

Kalidokit is a blendshape and kinematics solver for Mediapipe/Tensorflow.js face, eyes, pose, and hand tracking models

This implementation contains the application of GPlearn's symbolic transformer on a commodity futures sector of the financial market.

Official PaddlePaddle implementation of Paint Transformer

A Streamlit demo demonstrating the Deep Dream technique. Adapted from the TensorFlow Deep Dream tutorial.

Myia prototyping

The Official TensorFlow Implementation for SPatchGAN (ICCV2021)

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

Accuracy Aligned. Concise Implementation of Swin Transformer

Code for CPM-2 Pre-Train

Code base for the paper "Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation"