Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Last update: Dec 06, 2022

Related tags

Deep Learning LARGE

Overview

LARGE: Latent-Based Regression through GAN Semantics

[Project Website] [Google Colab] [Paper]

Yotam Nitzan^*, Rinon Gal^*, Ofir Brenner, and Daniel Cohen-Or

Abstract: We propose a novel method for solving regression tasks using few-shot or weak supervision. At the core of our method is the fundamental observation that GANs are incredibly successful at encoding semantic information within their latent space, even in a completely unsupervised setting. For modern generative frameworks, this semantic encoding manifests as smooth, linear directions which affect image attributes in a disentangled manner. These directions have been widely used in GAN-based image editing. We show that such directions are not only linear, but that the magnitude of change induced on the respective attribute is approximately linear with respect to the distance traveled along them. By leveraging this observation, our method turns a pre-trained GAN into a regression model, using as few as two labeled samples. This enables solving regression tasks on datasets and attributes which are difficult to produce quality supervision for. Additionally, we show that the same latent-distances can be used to sort collections of images by the strength of given attributes, even in the absence of explicit supervision. Extensive experimental evaluations demonstrate that our method can be applied across a wide range of domains, leverage multiple latent direction discovery frameworks, and achieve state-of-the-art results in few-shot and low-supervision settings, even when compared to methods designed to tackle a single task.

Sorting Examples

Black to Blond hair

Age

Fur Fluffiness

Sickness

Credits

StyleGAN2 implementation:
https://github.com/rosinality/stylegan2-pytorch
Copyright (c) 2019 Kim Seonghyeon
License (MIT) https://github.com/rosinality/stylegan2-pytorch/blob/master/LICENSE

pSp model and implementation:
https://github.com/eladrich/pixel2style2pixel
Copyright (c) 2020 Elad Richardson, Yuval Alaluf
License (MIT) https://github.com/eladrich/pixel2style2pixel/blob/master/LICENSE

e4e model and implementation:
https://github.com/omertov/encoder4editing Copyright (c) 2021 omertov
License (MIT) https://github.com/omertov/encoder4editing/blob/main/LICENSE

ReStyle model and implementation:
https://github.com/yuval-alaluf/restyle-encoder/ Copyright (c) 2021 Yuval Alaluf
License (MIT) https://github.com/yuval-alaluf/restyle-encoder/blob/main/LICENSE

Acknowledgement

We would like to thank Raja Gyres, Yangyan Li, Or Patashnik, Yuval Alaluf, Amit Attia, Noga Bar and Zonzge Wu for helpful comments. We additionaly thank Zonzge Wu for the trained e4e models for AFHQ cats and dogs.

Citation

If you use this code for your research, please cite our papers.

@misc{nitzan2021large,
      title={LARGE: Latent-Based Regression through GAN Semantics}, 
      author={Yotam Nitzan and Rinon Gal and Ofir Brenner and Daniel Cohen-Or},
      year={2021},
      eprint={2107.11186},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Related tags

Overview

LARGE: Latent-Based Regression through GAN Semantics

[Project Website] [Google Colab] [Paper]

Sorting Examples

Credits

Acknowledgement

Citation

Owner

code for "Feature Importance-aware Transferable Adversarial Attacks"

Deep Learning Slide Captcha

Spherical CNNs

Equivariant Imaging: Learning Beyond the Range Space

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

This project intends to use SVM supervised learning to determine whether or not an individual is diabetic given certain attributes.

Generic ecosystem for feature extraction from aerial and satellite imagery

Face recognize system

Hand-distance-measurement-game - Hand Distance Measurement Game

Weakly Supervised Segmentation by Tensorflow.

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Hard cater examples from Hopper ICLR paper

Implementation of character based convolutional neural network

This repo contains the code for the paper "Efficient hierarchical Bayesian inference for spatio-temporal regression models in neuroimaging" that has been accepted to NeurIPS 2021.

Python scripts for performing stereo depth estimation using the HITNET Tensorflow model.

PFLD pytorch Implementation

Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020.

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Bayesian regularization for functional graphical models.

A tensorflow model that predicts if the image is of a cat or of a dog.