Aligning Latent and Image Spaces to Connect the Unconnectable

Last update: Jan 03, 2023

Related tags

Overview

About

This repo contains the official implementation of the Aligning Latent and Image Spaces to Connect the Unconnectable paper. It is a GAN model which can generate infinite images of diverse and complex scenes.

[Project page] [Paper]

Installation

To install, run the following command:

conda env create --file environment.yml --prefix ./env
conda activate ./env

Note: the tensorboard requirement is crucial, because otherwise upfirdn2d will not compile for some magical reason.

Training

To train the model, navigate to the project directory and run:

python infra/launch_local.py hydra.run.dir=. +experiment_name=my_experiment_name +dataset=dataset_name num_gpus=4

where dataset_name is the name of the dataset without .zip extension inside data/ directory (you can easily override the paths in configs/main.yml). So make sure that data/dataset_name.zip exists and should be a plain directory of images. See StyleGAN2-ADA repo for additional data format details. This training command will create an experiment inside experiments/ directory and will copy the project files into it. This is needed to isolate the code which produces the model.

Inference

The inference example can be found in notebooks/generate.ipynb

Data format

We use the same data format as the original StyleGAN2-ADA repo: it is a zip of images. It is assumed that all data is located in a single directory, specified in configs/main.yml. Put your datasets as zip archives into data/ directory.

Pretrained checkpoints

We provide checkpoints for the following datasets:

LHQ 1024x1024 with FID = 7.8. Note: this checkpoint has patch size of 1024x512, i.e. the image is generated in just 2 halves.

License

The project is based on the StyleGAN2-ADA repo developed by NVidia. I am not a lawyer, but I suppose that NVidia License applies to this project then.

Aligning Latent and Image Spaces to Connect the Unconnectable

Related tags

Overview

About

Installation

Training

Inference

Data format

Pretrained checkpoints

License

Owner

Ivan Skorokhodov

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

A Simplied Framework of GAN Inversion

Portfolio Optimization and Quantitative Strategic Asset Allocation in Python

SeqTR: A Simple yet Universal Network for Visual Grounding

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

Python scripts using the Mediapipe models for Halloween.

Algorithmic trading with deep learning experiments

Reaction SMILES-AA mapping via language modelling

Code for the Lovász-Softmax loss (CVPR 2018)

Code for the paper Task Agnostic Morphology Evolution.

A distributed deep learning framework that supports flexible parallelization strategies.

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Distinguishing Commercial from Editorial Content in News

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

Code for GNMR in ICDE 2021

GARCH and Multivariate LSTM forecasting models for Bitcoin realized volatility with potential applications in crypto options trading, hedging, portfolio management, and risk management