Image morphing without reference points by applying warp maps and optimizing over them.

Last update: Dec 19, 2022

Overview

Differentiable Morphing

Image morphing without reference points by applying warp maps and optimizing over them.

Differentiable Morphing is machine learning algorithm that can morph any two images without reference points. It called "differentiable morphing" because neural network here is not used in traditional data to label mapping sense, but as an easy way to solve optimization problem where one image is mapped to another via warp maps that are found by gradient descent. So after maps are found there is no need for the network itself.

Results

Dependencies

Tensorflow 2.1.3 and above.

Usage

Install proper dependencies:

pip install -r requirements.txt

Use the program:

morph.py -s images/img_1.jpg -t images/img_2.jpg

-s Source file
-t Target file

Unnecessary parameters:
-e Number of epochs to train maps on training stage
-a Addition map multiplier
-m Multiplication map multiplier
-w Warp map multiplier
-add_first If true add map would be applied to the source image before mult map. (might work better in some cases)

Idea

Suppose we want to produce one image from another in a way that we use as much useful information as possible, so if two given images share any similarities between them we make use of these similarities.

After several trials I found out that the best way to achieve such effect is to use following formula.

Here "Mult map" removes unnecessary parts of an image and shifts color balance, "Add map" creates new colors that are not present in original image and "Warp map" distort an image in some way to reproduce shifting, rotation and scaling of objects. W operation is dense_image_warp method that present in tensorflow and usually used for optical flow estimation tasks.

All maps are found by gradient descent using very simple convolution network. Now, by applying alpha scaling parameter to every map we will get smooth transition from one image to another without any loss of useful data (at least for the given toy example).

Thoughts

Notice that all maps produced generate somewhat meaningful interpolation without any understanding of what exactly present in the images. That means that warp operation might be very useful in images processing tasks. In some sense warp operation might be thought as long range convolution, because it can "grab" data from any point of an image and reshape it in some useful way. Therefore it might be beneficial to use warp operation in classification tasks and might allow networks be less susceptible to small perturbations of the data. But especially, it should be beneficial to use in generation task. It should be much easier to produce new data by combining and perturbating several examples of known data points than to learn a function that represents all data points at ones.

Image morphing without reference points by applying warp maps and optimizing over them.

Related tags

Overview

Differentiable Morphing

Image morphing without reference points by applying warp maps and optimizing over them.

Results

Dependencies

Usage

Idea

Thoughts

Owner

Alex K

Human Pose estimation with TensorFlow framework

Neural network for digit classification powered by cuda

RaceBERT -- A transformer based model to predict race and ethnicty from names

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

RL and distillation in CARLA using a factorized world model

Experiments for Operating Systems Lab (ETCS-352)

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

A tensorflow implementation of Fully Convolutional Networks For Semantic Segmentation

A PyTorch Toolbox for Face Recognition

SpineAI Bilsky Grading With Python

Pytorch implementation of VAEs for heterogeneous likelihoods.

Official implementation for "Image Quality Assessment using Contrastive Learning"

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Minimisation of a negative log likelihood fit to extract the lifetime of the D^0 meson (MNLL2ELDM)

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

Voila - Voilà turns Jupyter notebooks into standalone web applications