GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

Last update: Dec 14, 2022

Related tags

Deep Learning GLANet

Overview

GLANet

The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

Framework: visualization results:

Getting Started

Installation

This code was tested with Pytorch 1.7.0, CUDA 10.2, and Python 3.7

Install Pytoch 1.7.0, torchvision, and other dependencies from http://pytorch.org
Install python libraries visdom and dominate for visualization

pip install visdom dominate

Clone this repo:

git clone https://github.com/ygjwd12345/GLANet.git
cd GLANet

Datasets

Please refer to the original CUT and CycleGAN to download datasets and learn how to create your own datasets.

    sh ./datasets/download_cyclegan_dataset.sh a2b

Available datasets are: apple2orange, summer2winter_yosemite, horse2zebra, monet2photo, cezanne2photo, ukiyoe2photo, vangogh2photo, maps, facades, iphone2dslr_flower, ae_photos

    sh ./datasets/download_pix2pix_dataset.sh xx

Available datasets are night2day, edges2handbags, edges2shoes, facades, maps

The Cityscapes dataset can be downloaded from https://cityscapes-dataset.com. After that, use the script ./datasets/prepare_cityscapes_dataset.py to prepare the dataset.

Training

Train the single-modal I2I translation model. Please check run.sh. For instance:

python train.py  \
--dataroot ./datasets/summer2winter \
--name summer2winter \
--model sc \
--gpu_ids 0 \
--lambda_spatial 10 \
--lambda_gradient 0 \
--attn_layers 4,7,9 \
--loss_mode cos \
--gan_mode lsgan \
--display_port 8093 \
--direction BtoA \
--patch_size 64

Testing

Test the FID score for all training epochs, please also check run.sh. For instance:

python test_fid.py \
--dataroot ./datasets/horse2zebra \
--checkpoints_dir ./checkpoints \
--name horse2zebra \
--gpu_ids 0 \
--model sc \
--num_test 0

Test the KID, cityscape score, D&C, LPIPS, please check run_dc_lpips.sh in evaluations folder. For instance:

python PerceptualSimilarity/lpips_2dirs.py -d0 /data2/gyang/TAGAN/results/summer2winter-F64-mixer/test_350/images/real_B -d1 /data2/gyang/TAGAN/results/summer2winter-F64-mixer/test_350/images/fake_B -o ./example_dists.txt --use_gpu
python3 segment.py test -d ./datasets/cityscapes -c 19 --arch drn_d_22 \
    --pretrained ./drn_d_22_cityscapes.pth --phase val --batch-size 1

Acknowledge

Our code is developed based on FSeSim and unguided. We also thank pytorch-fid for FID computation, LPIPS for diversity score, and D&C for density and coverage evaluation.

GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

Related tags

Overview

GLANet

Getting Started

Installation

Datasets

Training

Testing

Acknowledge

Owner

stanley

Implementation of the state of the art beat-detection, downbeat-detection and tempo-estimation model

Churn prediction

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

Implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

TensorFlow, PyTorch and Numpy layers for generating Orthogonal Polynomials

A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

Yolact-keras实例分割模型在keras当中的实现

Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

A tensorflow implementation of an HMM layer

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs

Learning Facial Representations from the Cycle-consistency of Face (ICCV 2021)

DTCN IJCAI - Sequential prediction learning framework and algorithm

Sound Source Localization for AI Grand Challenge 2021

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Code repository for our paper regarding the L3D dataset.

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language