Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

Last update: Dec 30, 2022

Related tags

Overview

Stochastic Deep Learning for Pytorch

Documentation on Read the Docs. Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning [1]. Many state of the art deep learning models use gradient estimation, in particular within the fields of Variational Inference and Reinforcement Learning. While PyTorch computes gradients of deterministic computation graphs automatically, it will not estimate gradients on stochastic computation graphs [2].

With Storchastic, you can easily define any stochastic deep learning model and let it estimate the gradients for you. Storchastic provides a large range of gradient estimation methods that you can plug and play, to figure out which one works best for your problem. Storchastic provides automatic broadcasting of sampled batch dimensions, which increases code readability and allows implementing complex models with ease.

When dealing with continuous random variables and differentiable functions, the popular reparameterization method [3] is usually very effective. However, this method is not applicable when dealing with discrete random variables or non-differentiable functions. This is why Storchastic has a focus on gradient estimators for discrete random variables, non-differentiable functions and sequence models.

Documentation on Read the Docs.

Example: Discrete Variational Auto-Encoder

Installation

pip install storchastic

Requires Pytorch 1.5 (older versions will not do!) and Pyro. The code is build on Python 3.7. The master branch works with PyTorch 1.7, but the version on pip is not compatible. Binaries will be updated soon.

Algorithms

Feel free to create an issue if an estimator is missing here.

Reparameterization [1, 3]
Score Function (REINFORCE) with Moving Average baseline [1, 4]
Score Function with Batch Average Baseline [5, 6]
Expected value for enumerable distributions
(Straight through) Gumbel Softmax [7, 8]
LAX, RELAX [9]
REBAR [10]
REINFORCE Without Replacement [6]
Unordered Set Estimator [13]

In development

Memory Augmented Policy Optimization [11]
Rao-Blackwellized REINFORCE [12]

Planned

Measure valued derivatives [1, 14]
ARM [15]
Automatic Credit Assignment [16]
...

References

[1] Monte Carlo Gradient Estimation in Machine Learning, Mohamed et al, 2019
[2] Gradient Estimation Using Stochastic Computation Graphs, Schulman et al, NeurIPS 2015
[3] Auto-Encoding Variational Bayes, Kingma and Welling, ICLR 2014
[4] Simple statistical gradient-following algorithms for connectionist reinforcement learning, Williams, Machine Learning 1992
[5] Variational inference for Monte Carlo objectives, Mnih and Rezende, ICML 2016
[6] Buy 4 REINFORCE Samples, Get a Baseline for Free!, Kool et al, ICLR Workshop dlStructPred 2019
[7] Categorical Reparameterization with Gumbel-Softmax, Jang et al, ICLR 2017
[8] The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables, Maddison et al, ICLR 2017
[9] Backpropagation through the Void: Optimizing control variates for black-box gradient estimation, Grathwohl et al, ICLR 2018
[10] REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models, Tucker et al, NeurIPS 2017
[11] Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing, Liang et al, NeurIPS 2018
[12] Rao-Blackwellized Stochastic Gradients for Discrete Distributions, Liu et al, ICML 2019
[13] Estimating Gradients for Discrete Random Variables by Sampling without Replacement, Kool et al, ICLR 2020
[14] Measure-Valued Derivatives for Approximate Bayesian Inference, Rosca et al, Workshop on Bayesian Deep Learning (NeurIPS 2019)
[15] ARM: Augment-REINFORCE-Merge Gradient for Stochastic Binary Networks, Yin and Zhou, ICLR 2019
[16] Credit Assignment Techniques in Stochastic Computation Graphs, Weber et al, AISTATS 2019

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

Related tags

Overview

Installation

Algorithms

In development

Planned

References

Owner

Emile van Krieken

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

The Face Mask recognition system uses AI technology to detect the person with or without a mask.

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

RoboDesk A Multi-Task Reinforcement Learning Benchmark

A collection of 100 Deep Learning images and visualizations

Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal transformer that encodes language inputs and the full episode history of visual observations and actions.

Code for Dual Contrastive Learning for Unsupervised Image-to-Image Translation, NTIRE, CVPRW 2021.

This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

eXPeditious Data Transfer

TensorFlow2 Classification Model Zoo playing with TensorFlow2 on the CIFAR-10 dataset.

A system used to detect whether a person is wearing a medical mask or not.

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”

Advanced Signal Processing Notebooks and Tutorials