an implementation of softmax splatting for differentiable forward warping using PyTorch

Last update: Dec 28, 2022

Overview

softmax-splatting

This is a reference implementation of the softmax splatting operator, which has been proposed in Softmax Splatting for Video Frame Interpolation [1], using PyTorch. Softmax splatting is a well-motivated approach for differentiable forward warping. It uses a translational invariant importance metric to disambiguate cases where multiple source pixels map to the same target pixel. Should you be making use of our work, please cite our paper [1].

setup

The softmax splatting is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

The provided example script is using OpenCV to load and display images, as well as to read the provided optical flow file. An easy way to install OpenCV for Python is using the pip install opencv-contrib-python package.

usage

We provide a small script to replicate the third figure of our paper [1]. You can simply run python run.py to obtain the comparison between summation splatting, average splatting, linear splatting, and softmax splatting. Please see this exemplatory run.py for additional information on how to use the provided reference implementation of our proposed softmax splatting operator for differentiable forward warping.

xiph

In our paper, we propose to use 4K video clips from Xiph to evaluate video frame interpolation on high-resolution footage. Please see the supplementary benchmark.py on how to reproduce the shown metrics.

video

license

The provided implementation is strictly for academic purposes only. Should you be interested in using our technology for any commercial use, please feel free to contact us.

references

[1]  @inproceedings{Niklaus_CVPR_2020,
         author = {Simon Niklaus and Feng Liu},
         title = {Softmax Splatting for Video Frame Interpolation},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2020}
     }

acknowledgment

The video above uses materials under a Creative Common license as detailed at the end.

an implementation of softmax splatting for differentiable forward warping using PyTorch

Related tags

Overview

softmax-splatting

setup

usage

xiph

video

license

references

acknowledgment

Owner

Simon Niklaus

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Java and SHACL code commented in the paper "Towards compliance checking in reified I/O logic via SHACL" submitted to ICAIL 2021

Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".

This is a Machine Learning Based Hand Detector Project, It Uses Machine Learning Models and Modules Like Mediapipe, Developed By Google!

This is the official code for the paper "Ad2Attack: Adaptive Adversarial Attack for Real-Time UAV Tracking".

[SDM 2022] Towards Similarity-Aware Time-Series Classification

Supervised Contrastive Learning for Product Matching

The official code for paper "R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling".

An automated facial recognition based attendance system (desktop application)

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"

It is a system used to detect bone fractures. using techniques deep learning and image processing

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Unified MultiWOZ evaluation scripts for the context-to-response task.

Small utility to demangle Nim symbols in callgrind files

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

Deep ViT Features as Dense Visual Descriptors

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Material related to the Principles of Cloud Computing course.