VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Last update: Nov 13, 2022

Related tags

Overview

VSR-Transformer

By Jiezhang Cao, Yawei Li, Kai Zhang, Luc Van Gool

This paper proposes a new Transformer for video super-resolution (called VSR-Transformer). Our VSR-Transformer block contains a spatial-temporal convolutional self-attention layer and a bidirectionaloptical flow-based feed-forward layer. Our VSR-Transformer is able to improve the performance of VSR. This repository is the official implementation of "Video Super-Resolution Transformer".

Dependencies and Installation

Python >= 3.7 (Recommend to use Anaconda or Miniconda)
PyTorch >= 1.3
NVIDIA GPU + CUDA

Clone repository

git clone https://github.com/caojiezhang/VSR-Transformer.git

Install dependent packages

cd VSR-Transformer
pip install -r requirements.txt

Compile environment
```
python setup.py develop
```

Dataset Preparation

Please refer to DatasetPreparation.md for more details.
The descriptions of currently supported datasets (torch.utils.data.Dataset classes) are in Datasets.md.

Training

Please refer to configuration of training for more details and pretrained models.

# Train on REDS
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/train.py -opt options/train/train_vsrTransformer_x4_REDS.yml --launcher pytorch
# Train on Vimeo-90K
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/train.py -opt options/train/train_vsrTransformer_x4_Vimeo.yml --launcher pytorch

Testing

Please refer to configuration of testing for more details.

# Test on REDS
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_REDS.yml --launcher pytorch

# Test on Vimeo-90K
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_Vimeo.yml --launcher pytorch

# Test on Vid4
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python -m torch.distributed.launch --nproc_per_node=8 --master_port=4321 basicsr/test.py -opt options/test/test_vsrTransformer_x4_Vid4.yml --launcher pytorch

Citation

If you use this code of our paper please cite:

@article{cao2021vsrt,
  title={Video Super-Resolution Transformer},
  author={Cao, Jiezhang and Li, Yawei and Zhang, Kai and Van Gool, Luc},
  journal={arXiv},
  year={2021}
}

Acknowledgments

This repository is implemented based on BasicSR. If you use the repository, please consider citing BasicSR.

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Related tags

Overview

VSR-Transformer

Dependencies and Installation

Dataset Preparation

Training

Testing

Citation

Acknowledgments

Owner

Jiezhang Cao

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

pytorch implementation of trDesign

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Revisiting Temporal Alignment for Video Restoration

An experimental technique for efficiently exploring neural architectures.

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Implementation of the Paper: "Parameterized Hypercomplex Graph Neural Networks for Graph Classification" by Tuan Le, Marco Bertolini, Frank Noé and Djork-Arné Clevert

pip install python-office

This repo contains implementation of different architectures for emotion recognition in conversations.

Github project for Attention-guided Temporal Coherent Video Object Matting.

Example of semantic segmentation in Keras

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Ensemble Learning Priors Driven Deep Unfolding for Scalable Snapshot Compressive Imaging [PyTorch]

PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.

FirmWire is a full-system baseband firmware emulation platform for fuzzing, debugging, and root-cause analysis of smartphone baseband firmwares

TLDR: Twin Learning for Dimensionality Reduction

Circuit Training: An open-source framework for generating chip floor plans with distributed deep reinforcement learning

Implementation of momentum^2 teacher