Detail-Preserving Transformer for Light Field Image Super-Resolution

Last update: Jan 01, 2023

Related tags

Overview

DPT

Official Pytorch implementation of the paper "Detail-Preserving Transformer for Light Field Image Super-Resolution" accepted by AAAI 2022 .

Updates

2022.01: Our method is available at the newly-released repository BasicLFSR, an open-source and easy-to-use toolbox for LF image SR.
2022.01: The code is released.

Requirements

Python 3.7.7
Pytorch=1.5.0
torchvision=0.6.0
h5py=2.8.0
Matlab

Dataset

We use the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and testing. You can download the above dataset from Baidu Drive (key:912V).

Download the visual results

We share the super-resolved results generated by our DPT. Then, researchers can compare their methods to our DPT without performing inference. Results are available at Baidu Drive (key:912V).

Prepare the datasets

To generate the training data,

 Using Matlab to run `GenerateTrainingData.m`

To generate the testing data,

 Using Matlab to run `GenerateTestData.m`

We also provide the processed datasets we used in the paper. The processed datasets are avaliable at Baidu Drive (key:912V).

Train

To perform DPT training, please run

python train.py

Checkpoint will be saved to ./log/.

Test

To evaluate DPT performance, please run

python test.py

The performance of DPT on five datasets will be printed on the screen. The visual result of each scene will be saved in ./Results/. The PSNR and SSIM values of each scene will aslo be saved in ./PSNRSSIM/.

Generate visual results

To generate the visual super-resolved results,

Using Matlab to run `GenerateResultImages.m`

The '.mat' files in ./Results/ will be converted to '.png' images to ./SRimages/.

To generate the visual gradient results, please run

python generate_visual_gradient_map.py

Gradient results will be saved to ./GRAimages/.

Citation

If you find this work helpful, please consider citing the following paper:

@article{wang2022detail,
  title={Detail Preserving Transformer for Light Field Image Super-Resolution},
  author={Wang, Shunzhou and Zhou, Tianfei and Lu, Yao and Di, Huijun},
  journal={arXiv preprint arXiv:2201.00346},
  year={2022}
}

Acknowledgements

This code is heavily based on LF-DFNet. We also refer to the codes in VSR-Transformer, COLA-Net, and SPSR. We thank the authors for sharing the codes. We would like to thank Yingqian Wang for his help with LFSR. We would also like to thank Zhengyu Liang for adding our DPT to the repository BasicLFSR.

Contact

If you have any question about this work, feel free to concat with me via [email protected].

Detail-Preserving Transformer for Light Field Image Super-Resolution

Related tags

Overview

DPT

Updates

Requirements

Dataset

Download the visual results

Prepare the datasets

Train

Test

Generate visual results

Citation

Acknowledgements

Contact

Owner

Accuracy Aligned. Concise Implementation of Swin Transformer

Aircraft design optimization made fast through modern automatic differentiation

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

The Video-based Accident Detection System built in Python

A simple API wrapper for Discord interactions.

Collection of NLP model explanations and accompanying analysis tools

My implementation of Fully Convolutional Neural Networks in Keras

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

Language model Prompt And Query Archive

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.

Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation

Implementation of character based convolutional neural network

pytorch implementation of trDesign

A general-purpose encoder-decoder framework for Tensorflow

Multispectral Object Detection with Yolov5

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

Software for Multimodalty 2D+3D Facial Expression Recognition (FER) UI

Python library for science observations from the James Webb Space Telescope

A Benchmark For Measuring Systematic Generalization of Multi-Hierarchical Reasoning

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)