Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Last update: Jun 06, 2022

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training because the log-likelihood can be undefined for sparse probability distributions. Furthermore, many sparse normalization functions often collapse the multimodality of distributions. In this work, we present ev-softmax, a sparse normalization function that preserves the multimodality of probability distributions. We derive its properties, including its gradient in closed-form, and introduce a continuous family of approximations to ev-softmax that have full support and can thus be trained with probabilistic loss functions such as negative log-likelihood and Kullback-Leibler divergence. We evaluate our method on a variety of generative models, including variational autoencoders and auto-regressive models. Our method outperforms existing dense and sparse normalization techniques in distributional accuracy and classification performance. We demonstrate that ev-softmax successfully reduces the dimensionality of output probability distributions while maintaining multimodality.

Setup

Required packages are listed in requirements.txt.

Running

The implementation for the ev-softmax function and its loss function can be found in evsoftmax.py.

The MNIST CVAE and VQ-VAE experiments can be run using run_mnist_cvae.sh and run_vqvae.sh, respectively. Instructions for the SSVAE experiment can be found in mnist_ssvae/README.md, and scripts used for preprocessing, training, and evaluating can be found in mnist_ssvae/scripts. Instructions for the translation experiment can be found in translation/README.md, and scripts used for preprocessing, training, and evaluating can be found in translation/scripts/iwslt.

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Related tags

Overview

Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

Abstract

Setup

Running

Owner

Stanford Intelligent Systems Laboratory

Code and datasets for TPAMI 2021

Official PyTorch implemention of our paper "Learning to Rectify for Robust Learning with Noisy Labels".

Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

Fully convolutional deep neural network to remove transparent overlays from images

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

This repo will contain code to reproduce and build upon understanding transfer learning

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

ML model to classify between cats and dogs

Code for our paper: Online Variational Filtering and Parameter Learning

A2LP for short, ECCV2020 spotlight, Investigating SSL principles for UDA problems

CVPR2021 Content-Aware GAN Compression

Image segmentation with private İstanbul Dataset

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

Equivariant CNNs for the sphere and SO(3) implemented in PyTorch

COD-Rank-Localize-and-Segment (CVPR2021)

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

Learning Representational Invariances for Data-Efficient Action Recognition