Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Last update: Jan 04, 2023

Related tags

Deep Learning UGATIT-pytorch

Overview

U-GAT-IT — Official PyTorch Implementation

: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Paper | Official Tensorflow code

The results of the paper came from the Tensorflow code

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Abstract We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner. The attention module guides our model to focus on more important regions distinguishing between source and target domains based on the attention map obtained by the auxiliary classifier. Unlike previous attention-based methods which cannot handle the geometric changes between domains, our model can translate both images requiring holistic changes and images requiring large shape changes. Moreover, our new AdaLIN (Adaptive Layer-Instance Normalization) function helps our attention-guided model to flexibly control the amount of change in shape and texture by learned parameters depending on datasets. Experimental results show the superiority of the proposed method compared to the existing state-of-the-art models with a fixed network architecture and hyper-parameters.

Usage

├── dataset
   └── YOUR_DATASET_NAME
       ├── trainA
           ├── xxx.jpg (name, format doesn't matter)
           ├── yyy.png
           └── ...
       ├── trainB
           ├── zzz.jpg
           ├── www.png
           └── ...
       ├── testA
           ├── aaa.jpg 
           ├── bbb.png
           └── ...
       └── testB
           ├── ccc.jpg 
           ├── ddd.png
           └── ...

Train

> python main.py --dataset selfie2anime

If the memory of gpu is not sufficient, set --light to True

Test

> python main.py --dataset selfie2anime --phase test

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Related tags

Overview

U-GAT-IT — Official PyTorch Implementation

: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Paper | Official Tensorflow code

Usage

Train

Test

Architecture

Results

Ablation study

User study

Comparison

Owner

Hyeonwoo Kang

This is an open solution to the Home Credit Default Risk challenge 🏡

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Code for the prototype tool in our paper "CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning".

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

Build Low Code Automated Tensorflow, What-IF explainable models in just 3 lines of code.

Solve a Rubiks Cube using Python Opencv and Kociemba module

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

Repositório da disciplina de APC, no segundo semestre de 2021

Orthogonal Over-Parameterized Training

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

ETMO: Evolutionary Transfer Multiobjective Optimization

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

这是一个facenet-pytorch的库，可以用于训练自己的人脸识别模型。

Object Tracking and Detection Using OpenCV

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

MediaPipe Kullanarak İleri Seviye Bilgisayarla Görü

PyTorch implementation of Spiking Neural Networks trained on surrogate gradient & BPTT using snntorch.