The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Last update: May 28, 2022

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers". The repository is modified from StudioGAN. If you find our work useful, please consider citing the following paper:

@inproceedings{chen2021ECGAN,
  title   = {A Unified View of cGANs with and without Classifiers},
  author  = {Si-An Chen and Chun-Liang Li and Hsuan-Tien Lin},
  booktitle = {Advances in Neural Information Processing Systems},
  year    = {2021}
}

Please feel free to contact Si-An Chen if you have any questions about the code/paper.

Introduction

We propose a new Conditional Generative Adversarial Network (cGAN) framework called Energy-based Conditional Generative Adversarial Network (ECGAN) which provides a unified view of cGANs and achieves state-of-the-art results. We use the decomposition of the joint probability distribution to connect the goals of cGANs and classification as a unified framework. The framework, along with a classic energy model to parameterize distributions, justifies the use of classifiers for cGANs in a principled manner. It explains several popular cGAN variants, such as ACGAN, ProjGAN, and ContraGAN, as special cases with different levels of approximations. An illustration of the framework is shown below.

Requirements

Anaconda
Python >= 3.6
6.0.0 <= Pillow <= 7.0.0
scipy == 1.1.0 (Recommended for fast loading of Inception Network)
sklearn
seaborn
h5py
tqdm
torch >= 1.6.0 (Recommended for mixed precision training and knn analysis)
torchvision >= 0.7.0
tensorboard
5.4.0 <= gcc <= 7.4.0 (Recommended for proper use of adaptive discriminator augmentation module)

You can install the recommended environment as follows:

conda env create -f environment.yml -n studiogan

With docker, you can use:

docker pull mgkang/studiogan:0.1

Quick Start

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPU 0

CUDA_VISIBLE_DEVICES=0 python3 src/main.py -t -e -c CONFIG_PATH

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPUs (0, 1, 2, 3) and DataParallel

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 src/main.py -t -e -c CONFIG_PATH

Try python3 src/main.py to see available options.

Dataset

CIFAR10: StudioGAN will automatically download the dataset once you execute main.py.
Tiny Imagenet, Imagenet, or a custom dataset:
1. download Tiny Imagenet and Imagenet. Prepare your own dataset.
2. make the folder structure of the dataset as follows:

┌── docs
├── src
└── data
    └── ILSVRC2012 or TINY_ILSVRC2012 or CUSTOM
        ├── train
        │   ├── cls0
        │   │   ├── train0.png
        │   │   ├── train1.png
        │   │   └── ...
        │   ├── cls1
        │   └── ...
        └── valid
            ├── cls0
            │   ├── valid0.png
            │   ├── valid1.png
            │   └── ...
            ├── cls1
            └── ...

Examples and Results

The src/configs directory contains config files used in our experiments.

CIFAR10 (3x32x32)

To train and evaluate ECGAN-UC on CIFAR10:

python3 src/main.py -t -e -c src/configs/CIFAR10/ecgan_v2_none_0_0p01.json

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	9.746	8.034	0.995	0.994	-	-	-
ContraGAN	StudioGAN	9.729	8.065	0.993	0.992	-	-	-
Ours	-	10.078	7.936	0.990	0.988	Cfg	Log	Link

Tiny ImageNet (3x64x64)

To train and evaluate ECGAN-UC on Tiny ImageNet:

python3 src/main.py -t -e -c src/configs/TINY_ILSVRC2012/ecgan_v2_none_0_0p01.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	11.998	31.92	0.956	0.879	-	-	-
ContraGAN	StudioGAN	13.494	27.027	0.975	0.902	-	-	-
Ours	-	18.445	18.319	0.977	0.973	Cfg	Log	Link

ImageNet (3x128x128)

To train and evaluate ECGAN-UCE on ImageNet (~12 days on 8 NVIDIA V100 GPUs):

python3 src/main.py -t -e -l -sync_bn -c src/configs/ILSVRC2012/imagenet_ecgan_v2_contra_1_0p05.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN	StudioGAN	28.633	24.684	0.941	0.921	-	-	-
ContraGAN	StudioGAN	25.249	25.161	0.947	0.855	-	-	-
Ours	-	80.685	8.491	0.984	0.985	Cfg	Log	Link

Generated Images

Here are some selected images generated by ECGAN.

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

Prior-Guided Multi-View 3D Head Reconstruction

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields.

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Source code for the paper "Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect"

Fuzzing tool (TFuzz): a fuzzing tool based on program transformation

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

Python code for loading the Aschaffenburg Pose Dataset.

A hybrid framework (neural mass model + ML) for SC-to-FC prediction

Learnable Boundary Guided Adversarial Training (ICCV2021)

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Dahua Camera and Doorbell Home Assistant Integration

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

covid question answering datasets and fine tuned models

Jiminy Cricket Environment (NeurIPS 2021)

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Genetic feature selection module for scikit-learn