Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Last update: Jun 17, 2022

Related tags

Deep Learning TCA-latent-space

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

dependencies

Firstly, to install the required packages, please run:

$ pip install -r requirements.txt

Pretrained weights

To replicate the results in the paper, you'll need to first download the pre-trained weights. To do so, simply run this from the command line:

./download_weights.sh

Quantitative results

building the prediction matrices

To reproduce Fig. 5, one can then run the ./quant.ipynb notebook using the pre-computed classification scores (please see this notebook for more details).

manually computing predictions

To call the Microsoft Azure Face API to generate the predictions again from scratch, one can run the shell script in ./quant/classify.sh. Firstly however, you need to generate our synthetic images to classify, which we detail below.

Qualitative results

generating the images

Reproducing the qualitative results (i.e. in Fig. 6) involves generating synthetic faces and 3 edited versions with the 3 attributes of interest (hair colour, yaw, and pitch). To generate these images (which are also used for the quantitative results), simply run:

$ ./generate_quant_edits.sh

mode-wise edits

Manual edits along individual modes of the tensor are made by calling main.py with the --mode edit_modewise flag. For example, one can reproduce the images from Fig. 3 with:

$ python main.py --cp_rank 0 --tucker_ranks "4,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 1000
  --n_to_edit 10 \
  --mode edit_modewise \
  --attribute_to_edit male

multilinear edits

Edits achieved with the 'multilinear mixing' are achieved instead by loading the relevant weights and supplying the --mode edit_multilinear flag. For example, the images in Fig. 4 are generated with:

$ python main.py --cp_rank 0 --tucker_ranks "256,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 200000
  --n_to_edit 10 \
  --mode edit_multilinear \
  --attribute_to_edit thick

Please feel free to get in touch at: [email protected], where x=oldfield

credits

All the code in ./architectures/ and utils.py is directly imported from https://github.com/genforce/genforce, only lightly modified to support performing the forward pass through the models partially, and returning the intermediate tensors.

The structure of the codebase follows https://github.com/yunjey/stargan, and hence we use their code as a template to build off. For this reason, you will find small helper functions (e.g. the first few lines of main.py) are borrowed from the StarGAN codebase.

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Related tags

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

dependencies

Pretrained weights

Quantitative results

building the prediction matrices

manually computing predictions

Qualitative results

generating the images

mode-wise edits

multilinear edits

credits

Owner

James Oldfield

Contrastive Multi-View Representation Learning on Graphs

TargetAllDomainObjects - A python wrapper to run a command on against all users/computers/DCs of a Windows Domain

Faster RCNN with PyTorch

Minecraft Hack Detection With Python

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

TensorFlow tutorials and best practices.

pytorch implementation of dftd2 & dftd3

CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energy Management, 2020, PikaPika team

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

Compact Bilinear Pooling for PyTorch

DeepLab-ResNet rebuilt in TensorFlow

This repository contains the files for running the Patchify GUI.

MediaPipe is a an open-source framework from Google for building multimodal

A micro-game "flappy bird".

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

Videocaptioning.pytorch - A simple implementation of video captioning

Code for paper "A Critical Assessment of State-of-the-Art in Entity Alignment" (https://arxiv.org/abs/2010.16314)

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch