a morph transfer UGATIT for image translation.

Last update: Nov 14, 2022

Related tags

Deep Learning Morph-UGATIT

Overview

Morph-UGATIT

a morph transfer UGATIT for image translation.

Introduction

中文技术文档

This is Pytorch implementation of UGATIT, paper "U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation".

Additionally, I DIY the model by adding two modules, a MLP module to learn a latent zone and an identity preserving loss. These two factors make UGATIT to achieve a progressive domain transfer for image translation. I call this method Morph UGATIT.

My work has two aspects:

Firstly, according to official TensorFlow code of UGATIT, I use PyTorch to reimplement it, very close to original TF model including network, training hyper parameters.
I add a MLP module, introducing a latent code for generator. And an identity preserving loss is used to learn more common feature for different domains.

I train model on two datasets, "adult2child" and "selfie2anime".

Requirements

python3.7
Pytorch >= 1.6
dlib. Before installing dlib, you should install Cmake and Boost

pip install Cmake
pip install Boost
pip install dlib

other common-used libraries.

How to Use

There are many models in my repo, but you just need two models and corresponding python script files.

UGATIT: "configs/cfgs_ugatit.py", "models/ugatit.py", "tool/train_ugatit.py", "tool/demo_ugatit.py"
Morph UGATIT: "configs/cfgs_s_ugatit_plus.py", "models/s_ugatit_plus.py", "tool/train_s_ugatit_plus.py", "tool/demo_morph_ugatit.py"

train step

getting dataset. The "adult2child" dataset comes from G-Lab, which is generated by StyleGAN. You can download here

The "selfie2anime" dataset comes from official UGATIT repo.

set configurations. configuration files can be found "configs" dir. You just focus on "cfgs_ugatit.py" and "cfgs_s_ugatit_plus.py". Please change:

dirA: domain A dataset path.
dirB: domain B dataset path.
anime: whether dataset is "selfie2anime".
tensorboard: tensorboard log path.
saved_dir: save model weight into "saved_dir".

start to train.

cd tool
python train_ugatit.py   # ugatit
python train_s_ugatit_plus.py   #  morph ugatit

you can also use tensorboard to check loss curves and some visualizations.

evaluation step

Since dlib is necessary, you should download dlib model weight here. change "alignment_loc" at "tool/demo_xxxx.py". "xxx" means "ugatit" or "morph_ugatit" to your dlib model weight path. Then put a test image into a dir.

cd tool
python demo_ugatit.py --type ugatit --resume ${ckpt path}$ --input ${image dir}$ --saved-dir ${result location}$ --align
python demo_morph_ugatit.py --resume ${ckpt path}$ --input ${image dir}$ --saved-dir ${result location}$ --align

Note: if you want to try "selfie2anime", please add a extra term "--anime".

Here I provide my pretrained model weights.

for "adult2child" dataset

ugatit

morph ugatit

for "selfie2anime" dataset

ugatit

More results can be seen here

References

official UGATIT repo
official CycleGAN repo
GLab, http://www.seeprettyface.com/
paper "Lifespan age transformation synthesis" and its' official code.

a morph transfer UGATIT for image translation.

Related tags

Overview

Morph-UGATIT

Introduction

Requirements

How to Use

train step

evaluation step

References

Owner

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Implementation of UNet on the Joey ML framework

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

PyTorch reimplementation of minimal-hand (CVPR2020)

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

Spatiotemporal resampling methods for mlr3

SAS: Self-Augmentation Strategy for Language Model Pre-training

With this package, you can generate mixed-integer linear programming (MIP) models of trained artificial neural networks (ANNs) using the rectified linear unit (ReLU) activation function

Contrastive Learning for Metagenomic Binning

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

Gym-TORCS is the reinforcement learning (RL) environment in TORCS domain with OpenAI-gym-like interface.

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Ranger deep learning optimizer rewrite to use newest components

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

Collection of sports betting AI tools.

A tensorflow model that predicts if the image is of a cat or of a dog.

PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"

Next-gen Rowhammer fuzzer that uses non-uniform, frequency-based patterns.

Author's PyTorch implementation of TD3 for OpenAI gym tasks