Learning What and Where to Draw

Last update: Nov 18, 2022

Related tags

Deep Learning nips2016

Overview

###Learning What and Where to Draw Scott Reed, Zeynep Akata, Santosh Mohan, Samuel Tenka, Bernt Schiele, Honglak Lee

This is the code for our NIPS 2016 paper on text- and location-controllable image synthesis using conditional GANs. Much of the code is adapted from reedscot/icml2016 and dcgan.torch.

####Setup Instructions

You will need to install Torch, CuDNN, stnbhwd and the display package.

####How to train a text to image model:

Download the data including captions, location annotations and pretrained models.
Download the birds and humans image data.
Modify the CONFIG file to point to your data.
Run one of the training scripts, e.g. ./scripts/train_cub_keypoints.sh

####How to generate samples:

./scripts/run_all_demos.sh.
html files will be generated with results like the following:

Moving the bird's position via bounding box:

Moving the bird's position via keypoints:

Birds text to image with ground-truth keypoints:

Birds text to image with generated keypoints:

Humans text to image with ground-truth keypoints:

Humans text to image with generated keypoints:

####Citation

If you find this useful, please cite our work as follows:

@inproceedings{reed2016learning,
  title={Learning What and Where to Draw},
  author={Scott Reed and Zeynep Akata and Santosh Mohan and Samuel Tenka and Bernt Schiele and Honglak Lee},
  booktitle={Advances in Neural Information Processing Systems},
  year={2016}
}

Learning What and Where to Draw

Related tags

Overview

Owner

Scott Ellison Reed

Py-FEAT: Python Facial Expression Analysis Toolbox

Monify: an Expense tracker Program implemented in a Graphical User Interface that allows users to keep track of their expenses

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

face property detection pytorch

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Clustergram - Visualization and diagnostics for cluster analysis in Python

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

SphereFace: Deep Hypersphere Embedding for Face Recognition

Impelmentation for paper Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

InsTrim: Lightweight Instrumentation for Coverage-guided Fuzzing

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

Flow is a computational framework for deep RL and control experiments for traffic microsimulation.

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

Auditing Black-Box Prediction Models for Data Minimization Compliance

Privacy-Preserving Machine Learning (PPML) Tutorial Presented at PyConDE 2022

Re-implement CycleGAN in Tensorlayer

Few-Shot Object Detection via Association and DIscrimination