official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Last update: Dec 27, 2022

Related tags

Deep Learning FuseFormer

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li.

This repo is the official Pytorch implementation of FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Introduction

Usage

Prerequisites

Python >= 3.6
Pytorch >= 1.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/FuseFormer.git

Install other packages:

cd FuseFormer
pip install -r requirements.txt

Training

Dataset preparation

Download datasets (YouTube-VOS and DAVIS) into the data folder.

mkdir data

Training script

python train.py -c configs/youtube-vos.json

Test

Download pre-trained model into checkpoints folder.

mkdir checkpoints

Test script

python test.py -c checkpoints/fuseformer.pth -v data/DAVIS/JPEGImages/blackswan -m data/DAVIS/Annotations/blackswan

Citing FuseFormer

If you find FuseFormer useful in your research, please consider citing:

@InProceedings{Liu_2021_FuseFormer,
  title={FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting},
  author={Liu, Rui and Deng, Hanming and Huang, Yangyi and Shi, Xiaoyu and Lu, Lewei and Sun, Wenxiu and Wang, Xiaogang and Dai, Jifeng and Li, Hongsheng},
  booktitle = {International Conference on Computer Vision (ICCV)},
  year={2021}
}

Acknowledement

This code borrows heavily from the video inpainting framework spatial-temporal transformer net.

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Related tags

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Introduction

Usage

Prerequisites

Install

Training

Dataset preparation

Training script

Test

Test script

Citing FuseFormer

Acknowledement

Owner

REGTR: End-to-end Point Cloud Correspondences with Transformers

Spectrum is an AI that uses machine learning to generate Rap song lyrics

Convert ONNX model graph to Keras model format.

Pytorch implementation of RED-SDS (NeurIPS 2021).

Camview - A CLI-tool used to stream CCTV online footage based on URL params

Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal computer!

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

HGCN: Harmonic Gated Compensation Network For Speech Enhancement

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Geometry-Free View Synthesis: Transformers and no 3D Priors

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

Mixed Neural Likelihood Estimation for models of decision-making

Continual Learning of Electronic Health Records (EHR).

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

A flexible framework of neural networks for deep learning

Scalable, event-driven, deep-learning-friendly backtesting library

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Use Python, OpenCV, and MediaPipe to control a keyboard with facial gestures