Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Last update: Nov 26, 2021

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

This library contains JAX and Pytorch implementations of neural ODEs and Bayesian layers for stochastic variational inference. A rudimentary JAX implementation of differentiable SDE solvers is also provided, refer to torchsde [2] for a full set of differentiable SDE solvers in Pytorch and similarly to torchdiffeq [3] for differentiable ODE solvers.

Continuous-depth hidden unit trajectories in Neural ODE vs uncertain posterior dynamics SDE-BNN.

Installation

This library runs on jax==0.1.77 and torch==1.6.0. To install all other requirements:

pip install -r requirements.txt

Note: Package versions may change, refer to official JAX installation instructions here.

JaxSDE: Differentiable SDE Solvers in JAX

The jaxsde library contains SDE solvers in the Ito and Stratonovich form. Solvers of different orders can be specified with the following method={euler_maruyama|milstein|euler_heun} (strong orders 0.5|1|0.5 and orders 1|1|1 in the case of an additive noise SDE). Stochastic adjoint (sdeint_ito) training mode does not work efficiently yet, use sdeint_ito_fixed_grid for now. Tradeoff solver speed for precision during training or inference by adjusting --nsteps <# steps>.

Usage

Default solver: Backpropagation through the solver.

from jaxsde.jaxsde.sdeint import sdeint_ito_fixed_grid

y1 = sdeint_ito_fixed_grid(f, g, y0, ts, rng, fw_params, method="euler_maruyama")

Stochastic adjoint: Using O(1) memory instead of solving an adjoint SDE in the backward pass.

from jaxsde.jaxsde.sdeint import sdeint_ito

y1 = sdeint_ito(f, g, y0, ts, rng, fw_params, method="milstein")

Brax: Bayesian SDE Framework in JAX

Implementation of composable Bayesian layers in the stax API. Our SDE Bayesian layers can be used with the SDEBNN block composed with multiple parameterizations of time-dependent layers in diffeq_layers. Sticking-the-landing (STL) trick can be enabled during training with --stl for improving convergence rate. Augment the inputs by a custom amount --aug <integer>, set the number of samples averaged over with --nsamples <integer>. If memory constraints pose a problem, train in gradient accumulation mode: --acc_grad and gradient checkpointing: --remat.

Samples from SDEBNN-learned predictive prior and posterior density distributions.

Usage

All examples can be swapped in with different vision datasets. For better readability, tensorboard logging has been excluded (see torchbnn instead).

Toy 1D regression to learn complex posteriors:

python examples/jax/sdebnn_toy1d.py --ds cos --activn swish --loss laplace --kl_scale 1. --diff_const 0.2 --driftw_scale 0.1 --aug_dim 2 --stl --prior_dw ou

Image Classification:

To train an SDEBNN model:

python examples/jax/sdebnn_classification.py --output <output directory> --model sdenet --aug 2 --nblocks 2-2-2 --diff_coef 0.2 --fx_dim 64 --fw_dims 2-64-2 --nsteps 20 --nsamples 1

To train a ResNet baseline, specify --model resnet and for a Bayesian ResNet baseline, specify --meanfield_sdebnn.

TorchBNN: SDE-BNN in Pytorch

A PyTorch implementation of the Brax framework powered by the torchsde backend.

Usage

All examples can be swapped in with different vision datasets and includes tensorboard logging for critical metrics.

Toy 1D regression to learn multi-modal posterior:

python examples/torch/sdebnn_toy1d.py --output_dir <dst_path>

Arbitrarily expression approximate posteriors from learning non-Gaussian marginals.

Image Classification:

All hyperparameters can be found in the training script. Train with adjoint for memory efficient backpropagation and adaptive mode for adaptive computation (and ensure --adjoint_adaptive True if training with adjoint and adaptive modes).

python examples/torch/sdebnn_classification.py --train-dir <output directory> --data cifar10 --dt 0.05 --method midpoint --adjoint True --adaptive True --adjoint_adaptive True --inhomogeneous True

References

[1] Winnie Xu, Ricky T. Q. Chen, Xuechen Li, David Duvenaud. "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations." Preprint 2021. [arxiv]

[2] Xuechen Li, Ting-Kam Leonard Wong, Ricky T. Q. Chen, David Duvenaud. "Scalable Gradients for Stochastic Differential Equations." AISTATS 2020. [arxiv]

[3] Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, David Duvenaud. "Neural Ordinary Differential Equations." NeurIPS. 2018. [arxiv]

If you found this library useful in your research, please consider citing

@article{xu2021sdebnn,
  title={Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations},
  author={Xu, Winnie and Chen, Ricky T. Q. and Li, Xuechen and Duvenaud, David},
  archivePrefix = {arXiv},
  year={2021}
}

Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Related tags

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

Installation

JaxSDE: Differentiable SDE Solvers in JAX

Usage

Brax: Bayesian SDE Framework in JAX

Usage

Toy 1D regression to learn complex posteriors:

Image Classification:

TorchBNN: SDE-BNN in Pytorch

Usage

Toy 1D regression to learn multi-modal posterior:

Image Classification:

References

Owner

Winnie Xu

unofficial pytorch implementation of RefineGAN

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

DGN pymarl - Implementation of DGN on Pymarl, which could be trained by VDN or QMIX

Deep Sketch-guided Cartoon Video Inbetweening

Pretraining Representations For Data-Efficient Reinforcement Learning

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

Python version of the amazing Reaction Mechanism Generator (RMG).

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Python wrappers to the C++ library SymEngine, a fast C++ symbolic manipulation library.

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

Code for "Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations"

Related tags

Overview

Infinitely Deep Bayesian Neural Networks with SDEs

Installation

JaxSDE: Differentiable SDE Solvers in JAX

Usage

Brax: Bayesian SDE Framework in JAX

Usage

Toy 1D regression to learn complex posteriors:

Image Classification:

TorchBNN: SDE-BNN in Pytorch

Usage

Toy 1D regression to learn multi-modal posterior:

Image Classification:

References

Owner

Winnie Xu

unofficial pytorch implementation of RefineGAN

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

DGN pymarl - Implementation of DGN on Pymarl, which could be trained by VDN or QMIX

Deep Sketch-guided Cartoon Video Inbetweening

Pretraining Representations For Data-Efficient Reinforcement Learning

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

Python version of the amazing Reaction Mechanism Generator (RMG).

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Python wrappers to the C++ library SymEngine, a fast C++ symbolic manipulation library.

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW 2021

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务