Deep Convolutional Generative Adversarial Networks

Last update: Dec 29, 2022

Related tags

Deep Learning dcgan_code

Overview

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, Luke Metz, Soumith Chintala

All images in this paper are generated by a neural network. They are NOT REAL.

Full paper here: http://arxiv.org/abs/1511.06434

###Other implementations of DCGAN

##Summary of DCGAN We

stabilize Generative Adversarial networks with some architectural constraints
- Replace any pooling layers with strided convolutions (discriminator) and fractional-strided convolutions (generator).
- Use batchnorm in both the generator and the discriminator
- Remove fully connected hidden layers for deeper architectures. Just use average pooling at the end.
- Use ReLU activation in generator for all layers except for the output, which uses Tanh.
- Use LeakyReLU activation in the discriminator for all layers.
use the discriminator as a pre-trained net for CIFAR-10 classification and show pretty decent results.
generate really cool bedroom images that look super real
To convince you that the network is not cheating:
- show the interpolated latent space, where transitions are really smooth and every image in the latent space is a bedroom.
- show bedrooms after one epoch of training (with a 0.0002 learning rate), come on the network cant really memorize at this stage.
To explore what the representations that the network learnt,
- show deconvolution over the filters, to show that maximal activations occur at objects like windows and beds
- figure out a way to identify and remove filters that draw windows in generation.
  - Now you can control the generator to not output certain objects.
Because we are tripping
- Smiling woman - neutral woman + neutral man = Smiling man. Whuttttt!
- man with glasses - man without glasses + woman without glasses = woman with glasses. Omg!!!!
learnt a latent space in a completely unsupervised fashion where ROTATIONS ARE LINEAR in this latent space. WHHHAAATT????!!!!!!
Figure 11, trained on imagenet has a plane with bird legs. so cooool.

Bedrooms after 5 epochs

Generated bedrooms after five epochs of training. There appears to be evidence of visual under-fitting via repeated textures across multiple samples.

Bedrooms after 1 epoch

Generated bedrooms after one training pass through the dataset. Theoretically, the model could learn to memorize training examples, but this is experimentally unlikely as we train with a small learning rate and minibatch SGD. We are aware of no prior empirical evidence demonstrating memorization with SGD and a small learning rate in only one epoch.

Walking from one point to another in bedroom latent space

Interpolation between a series of 9 random points in Z show that the space learned has smooth transitions, with every image in the space plausibly looking like a bedroom. In the 6th row, you see a room without a window slowly transforming into a room with a giant window. In the 10th row, you see what appears to be a TV slowly being transformed into a window.

Forgetting to draw windows

Top row: un-modified samples from model. Bottom row: the same samples generated with dropping out ”window” filters. Some windows are removed, others are transformed into objects with similar visual appearance such as doors and mirrors. Although visual quality decreased, overall scene composition stayed similar, suggesting the generator has done a good job disentangling scene representation from object representation. Extended experiments could be done to remove other objects from the image and modify the objects the generator draws.

Deep Convolutional Generative Adversarial Networks

Related tags

Overview

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, Luke Metz, Soumith Chintala

Bedrooms after 5 epochs

Bedrooms after 1 epoch

Walking from one point to another in bedroom latent space

Forgetting to draw windows

Google image search from generations

Arithmetic on faces

Rotations are linear in latent space

More faces

Album covers

Imagenet generations

Owner

Alec Radford

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

PyTorch implementation of Octave Convolution with pre-trained Oct-ResNet and Oct-MobileNet models

When are Iterative GPs Numerically Accurate?

Latent Execution for Neural Program Synthesis

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

A developer interface for creating Chat AIs for the Chai app.

Gems & Holiday Package Prediction

The official implementation of Variable-Length Piano Infilling (VLI).

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

Time Series Cross-Validation -- an extension for scikit-learn

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"

A toolkit for developing and comparing reinforcement learning algorithms.

subpixel: A subpixel convnet for super resolution with Tensorflow

PyContinual (An Easy and Extendible Framework for Continual Learning)

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

This repository contains numerical implementation for the paper Intertemporal Pricing under Reference Effects: Integrating Reference Effects and Consumer Heterogeneity.

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation