Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training because the log-likelihood can be undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can thus be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive models. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy and classification performance. We demonstrate that ev-softmax successfully reduces the dimensionality of output probability distributions while maintaining multimodality.

Setup

Required packages are listed in requirements.txt.

Running

The implementation for the ev-softmax function and its loss function can be found in evsoftmax.py.

The MNIST CVAE and VQ-VAE experiments can be run using run_mnist_cvae.sh and run_vqvae.sh, respectively. Instructions for the SSVAE experiment can be found in mnist_ssvae/README.md, and scripts used for preprocessing, training, and evaluating can be found in mnist_ssvae/scripts. Instructions for the translation experiment can be found in translation/README.md, and scripts used for preprocessing, training, and evaluating can be found in translation/scripts/iwslt.

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Setup

Running

Owner

Stanford Intelligent Systems Laboratory

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

Implementation of Online Label Smoothing in PyTorch

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

Semantic Segmentation Suite in TensorFlow

Unofficial implementation of Google "CutPaste: Self-Supervised Learning for Anomaly Detection and Localization" in PyTorch

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Feature board for ERPNext

Preparation material for Dropbox interviews

SemiNAS: Semi-Supervised Neural Architecture Search

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

[CVPR-2021] UnrealPerson: An adaptive pipeline for costless person re-identification

Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.

Learning Representational Invariances for Data-Efficient Action Recognition

Official PyTorch implementation of RobustNet (CVPR 2021 Oral)

CIFAR-10_train-test - training and testing codes for dataset CIFAR-10

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting

render sprites into your desktop environment as shaped windows using GTK

Estimating Example Difficulty using Variance of Gradients

Code for Massive-scale Decoding for Text Generation using Lattices