Image super-resolution through deep learning

Last update: Dec 28, 2022

Related tags

Overview

srez

Image super-resolution through deep learning. This project uses deep learning to upscale 16x16 images by a 4x factor. The resulting 64x64 images display sharp features that are plausible based on the dataset that was used to train the neural net.

Here's an random, non cherry-picked, example of what this network can do. From left to right, the first column is the 16x16 input image, the second one is what you would get from a standard bicubic interpolation, the third is the output generated by the neural net, and on the right is the ground truth.

As you can see, the network is able to produce a very plausible reconstruction of the original face. As the dataset is mainly composed of well-illuminated faces looking straight ahead, the reconstruction is poorer when the face is at an angle, poorly illuminated, or partially occluded by eyeglasses or hands.

This particular example was produced after training the network for 3 hours on a GTX 1080 GPU, equivalent to 130,000 batches or about 10 epochs.

How it works

In essence the architecture is a DCGAN where the input to the generator network is the 16x16 image rather than a multinomial gaussian distribution.

In addition to that the loss function of the generator has a term that measures the L1 difference between the 16x16 input and downscaled version of the image produced by the generator.

The adversarial term of the loss function ensures the generator produces plausible faces, while the L1 term ensures that those faces resemble the low-res input data. We have found that this L1 term greatly accelerates the convergence of the network during the first batches and also appears to prevent the generator from getting stuck in a poor local solution.

Finally, the generator network relies on ResNet modules as we've found them to train substantially faster than more old-fashioned architectures. The adversarial network is much simpler as the use of ResNet modules did not provide an advantage during our experimentation.

Requirements

You will need Python 3 with Tensorflow, numpy, scipy and moviepy. See requirements.txt for details.

Dataset

After you have the required software above you will also need the Large-scale CelebFaces Attributes (CelebA) Dataset. The model expects the Align&Cropped Images version. Extract all images to a subfolder named dataset. I.e. srez/dataset/lotsoffiles.jpg.

Training the model

Training with default settings: python3 srez_main.py --run train. The script will periodically output an example batch in PNG format onto the srez/train folder, and checkpoint data will be stored in the srez/checkpoint folder.

After the network has trained you can also produce an animation showing the evolution of the output by running python3 srez_main.py --run demo.

About the author

LinkedIn profile of David Garcia.

Image super-resolution through deep learning

Related tags

Overview

srez

How it works

Requirements

Dataset

Training the model

About the author

Owner

David Garcia

Simulating an AI playing 2048 using the Expectimax algorithm

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

H&M Fashion Image similarity search with Weaviate and DocArray

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

This repository contains the code for the paper ``Identifiable VAEs via Sparse Decoding''.

YOLOv4-v3 Training Automation API for Linux

Pure python implementations of popular ML algorithms.

Official implementation of VQ-Diffusion

This repository contains the code for the paper Neural RGB-D Surface Reconstruction

AirLoop: Lifelong Loop Closure Detection

Algorithm to texture 3D reconstructions from multi-view stereo images

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

Paper: Cross-View Kernel Similarity Metric Learning Using Pairwise Constraints for Person Re-identification

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch