Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Last update: Dec 13, 2022

Related tags

Deep Learning DSTT

Overview

Decoupled Spatial-Temporal Transformer for Video Inpainting

By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li.

This repo is the official Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting.

Introduction

Usage

Prerequisites

Python >= 3.6
Pytorch >= 1.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/DSTT.git

Install other packages:

cd DSTT
pip install -r requirements.txt

Training

Dataset preparation

Download datasets (YouTube-VOS and DAVIS) into the data folder.

mkdir data

Training script

python train.py -c configs/youtube-vos.json

Test

Download pre-trained model into checkpoints folder.

mkdir checkpoints

Test script

python test.py -c checkpoints/dstt.pth -v data/DAVIS/JPEGImages/blackswan -m data/DAVIS/Annotations/blackswan

Citing DSTT

If you find DSTT useful in your research, please consider citing:

@article{Liu_2021_DSTT,
  title={Decoupled Spatial-Temporal Transformer for Video Inpainting},
  author={Liu, Rui and Deng, Hanming and Huang, Yangyi and Shi, Xiaoyu and Lu, Lewei and Sun, Wenxiu and Wang, Xiaogang and Li Hongsheng},
  journal={arXiv preprint arXiv:2104.06637},
  year={2021}
}

Acknowledement

This code relies heavily on the video inpainting framework from spatial-temporal transformer net.

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Related tags

Overview

Decoupled Spatial-Temporal Transformer for Video Inpainting

Introduction

Usage

Prerequisites

Install

Training

Dataset preparation

Training script

Test

Test script

Citing DSTT

Acknowledement

Owner

Finetuning Pipeline

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

Generative Adversarial Networks(GANs)

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

Pytorch implementation of Compressive Transformers, from Deepmind

make ASCII Art by Deep Learning

Implementation of paper "Graph Condensation for Graph Neural Networks"

Türkiye Canlı Mobese Görüntülerinde Profesyonel Nesne Takip Sistemi

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

Rethinking Nearest Neighbors for Visual Classification

Codes for paper "KNAS: Green Neural Architecture Search"

VISNOTATE: An Opensource tool for Gaze-based Annotation of WSI Data

【CVPR 2021, Variational Inference Framework, PyTorch】 From Rain Generation to Rain Removal

A containerized REST API around OpenAI's CLIP model.

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

Official repository for the paper "Instance-Conditioned GAN"

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python