Tensorflow Implementation of the paper "Spectral Normalization for Generative Adversarial Networks" (ICML 2017 workshop)

Last update: Nov 25, 2022

Overview

tf-SNDCGAN

Tensorflow implementation of the paper "Spectral Normalization for Generative Adversarial Networks" (https://www.researchgate.net/publication/318572189_Spectral_Normalization_for_Generative_Adversarial_Networks, ICML 2017)

The implementation is based on the author's original code at: https://github.com/pfnet-research/chainer-gan-lib

This implementation works for tensorflow default data format "NHWC"

Spectral Normalization for Generative Adversarial Networks:

This method enforces Lipschitz-1 condition on the Discrminator of Wasserstein-GAN by normalizing its weight matrices with their own respective maximum singular value. This can be used together with Gradient Penalty in the paper "Improved Training of Wasserstein GAN".

The author uses a fast approximation method to compute the maximum singular value of weight matrices.

Quick run:

Keras is required for loading Cifar10 data set

python3 train.py

How to use spectral normalization:

# Import spectral norm wrapper
from libs.sn import spectral_normed_weight
# Create weight variable
W = tf.Variable(np.random.normal(size=[784, 10], scale=0.02), name='W', dtype=tf.float32)
# name of tf collection used for storing the update ops (u)
SPECTRAL_NORM_UPDATE_OPS = "spectral_norm_update_ops"
# call wrapping function, W_bar will be the spectral normed weight matrix
W_bar = spectral_normed_weight(W, num_iters=1, update_collection=SPECTRAL_NORM_UPDATE_OPS)
# Get the update ops
spectral_norm_update_ops = tf.get_collection(SPECTRAL_NORM_UPDATE_OPS)
...
# During training, run the update ops at the end of the iteration
for iter in range(max_iters):
    # Training goes here
    ...
    # Update ops at the end
    for update_op in spectral_norm_update_ops:
        sess.run(update_op)

For an example, see the file test_sn_implementation.py

Training curve:

Generated image samples on Cifar10:

Inception score:

After using in place batch norm update and use the optimal training parameters from the paper, I was able to match their claimed Inception score at 100k iteration: 7.4055686 +/- 0.087728456

The official github repostiory has an inception score of 7.41

Issues:

GPU under-utilization: The original implementation of the author in chainer uses 80%+ GPU most of the time. On an NVIDIA GTX 1080TI, their implementation run at nearly 3 iterations/s. This implementation use less than 50% GPU and run at less than 2 iterations/s. Solved. It was the global_step assignment that makes tensorflow create new assign node for graph each iteration, slow down the execution. This also made the graph become very large over time leading to gigantic event files. GPU utilization is now around 85+%
No Fréchet Inception Distance (https://arxiv.org/abs/1706.08500) evaluation yet.

Tensorflow Implementation of the paper "Spectral Normalization for Generative Adversarial Networks" (ICML 2017 workshop)

Related tags

Overview

tf-SNDCGAN

Spectral Normalization for Generative Adversarial Networks:

Quick run:

How to use spectral normalization:

Training curve:

Generated image samples on Cifar10:

Inception score:

Issues:

Owner

Nhat M. Nguyen

Simultaneous Demand Prediction and Planning

Pansharpening by convolutional neural networks in the full resolution framework

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Implement face detection, and age and gender classification, and emotion classification.

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

Cooperative Driving Dataset: a dataset for multi-agent driving scenarios

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

A Jinja extension (compatible with Flask and other frameworks) to compile and/or compress your assets.

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Denoising images with Fourier Ring Correlation loss

Simple implementation of Mobile-Former on Pytorch

Crawl & visualize ICLR papers and reviews

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come