A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Last update: Dec 28, 2022

Overview

ZhuSuan is a Python probabilistic programming library for Bayesian deep learning, which conjoins the complimentary advantages of Bayesian methods and deep learning. ZhuSuan is built upon TensorFlow. Unlike existing deep learning libraries, which are mainly designed for deterministic neural networks and supervised tasks, ZhuSuan provides deep learning style primitives and algorithms for building probabilistic models and applying Bayesian inference. The supported inference algorithms include:

Variational Inference (VI) with programmable variational posteriors, various objectives and advanced gradient estimators (SGVB, REINFORCE, VIMCO, etc.).
Importance Sampling (IS) for learning and evaluating models, with programmable proposals.
Hamiltonian Monte Carlo (HMC) with parallel chains, and optional automatic parameter tuning.
Stochastic Gradient Markov Chain Monte Carlo (SGMCMC): SGLD, PSGLD, SGHMC, and SGNHT.

Installation

ZhuSuan is still under development. Before the first stable release (1.0), please clone the repository and run

pip install .

in the main directory. This will install ZhuSuan and its dependencies automatically. ZhuSuan also requires TensorFlow 1.13.0 or later. Because users should choose whether to install the cpu or gpu version of TensorFlow, we do not include it in the dependencies. See Installing TensorFlow.

If you are developing ZhuSuan, you may want to install in an "editable" or "develop" mode. Please refer to the Contributing section below.

Documentation

Examples

We provide examples on traditional hierarchical Bayesian models and recent deep generative models.

To run the provided examples, you may need extra dependencies to be installed. This can be done by

pip install ".[examples]"

Gaussian: HMC
Toy 2D Intractable Posterior: SGVB
Bayesian Neural Networks: SGVB, SGMCMC
Variational Autoencoder (VAE): SGVB, IWAE
Convolutional VAE: SGVB
Semi-supervised VAE (Kingma, 2014): SGVB, Adaptive IS
Deep Sigmoid Belief Networks Adaptive IS, VIMCO
Logistic Normal Topic Model: HMC
Probabilistic Matrix Factorization: HMC
Sparse Variational Gaussian Process: SGVB

Citing ZhuSuan

If you find ZhuSuan useful, please cite it in your publications. We provide a BibTeX entry of the ZhuSuan white paper below.

@ARTICLE{zhusuan2017,
    title={Zhu{S}uan: A Library for {B}ayesian Deep Learning},
    author={Shi, Jiaxin and Chen, Jianfei. and Zhu, Jun and Sun, Shengyang
    and Luo, Yucen and Gu, Yihong and Zhou, Yuhao},
    journal={arXiv preprint arXiv:1709.05870},
    year=2017,
}

Contributing

We always welcome contributions to help make ZhuSuan better. If you would like to contribute, please check out the guidelines here.

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Related tags

Overview

Installation

Documentation

Examples

Citing ZhuSuan

Contributing

Owner

Tsinghua Machine Learning Group

Python library for creating data pipelines with chain functional programming

pipeline for migrating lichess data into postgresql

My solution to the book A Collection of Data Science Take-Home Challenges

Import, connect and transform data into Excel

GWpy is a collaboration-driven Python package providing tools for studying data from ground-based gravitational-wave detectors

Flood modeling by 2D shallow water equation

Spaghetti: an open-source Python library for the analysis of network-based spatial data

Statistical Rethinking: A Bayesian Course Using CmdStanPy and Plotnine

Jupyter notebooks for the book "The Elements of Statistical Learning".

LynxKite: a complete graph data science platform for very large graphs and other datasets.

InDels analysis of CRISPR lines by NGS amplicon sequencing technology for a multicopy gene family.

MoRecon - A tool for reconstructing missing frames in motion capture data.

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

A Python package for the mathematical modeling of infectious diseases via compartmental models

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

ETL pipeline on movie data using Python and postgreSQL

PySpark bindings for H3, a hierarchical hexagonal geospatial indexing system

Meltano: ELT for the DataOps era. Meltano is open source, self-hosted, CLI-first, debuggable, and extensible.

MapReader: A computer vision pipeline for the semantic exploration of maps at scale

Data exploration done quick.