Code for Multinomial Diffusion

Abstract

Generative flows and diffusion models have been predominantly trained on ordinal data, for example natural images. This paper introduces two extensions of flows and diffusion for categorical data such as language or image segmentation: Argmax Flows and Multinomial Diffusion. Argmax Flows are defined by a composition of a continuous distribution (such as a normalizing flow), and an argmax function. To optimize this model, we learn a probabilistic inverse for the argmax that lifts the categorical data to a continuous space. Multinomial Diffusion gradually adds categorical noise in a diffusion process, for which the generative denoising process is learned. We demonstrate that our method outperforms existing dequantization approaches on text modelling and modelling on image segmentation maps in log-likelihood.

Link: https://arxiv.org/abs/2102.05379

Instructions

In the folder containing setup.py, run

pip install --user -e .

The --user option ensures the library will only be installed for your user. The -e option makes it possible to modify the library, and modifications will be loaded on the fly.

You should now be able to use it.

Running experiments.

Go to the experiment of interest (folder segmentation_diffusion or text_diffusion) and follow the readme instructions there.

Acknowledgements

The Robert Bosch GmbH is acknowledged for financial support.

Code for Multinomial Diffusion

Related tags

Overview

Code for Multinomial Diffusion

Abstract

Instructions

Running experiments.

Acknowledgements

Owner

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

Flexible-CLmser: Regularized Feedback Connections for Biomedical Image Segmentation

Real-time 3D multi-person detection made easy with OpenPose and the ZED

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions,spherical coordinates, and intensity

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

EMNLP 2021 Findings' paper, SCICAP: Generating Captions for Scientific Figures

Implementation of gaze tracking and demo

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

QAT(quantize aware training) for classification with MQBench

The official repository for BaMBNet

Code for the published paper : Learning to recognize rare traffic sign

Deep Learning segmentation suite designed for 2D microscopy image segmentation

Picasso: a methods for embedding points in 2D in a way that respects distances while fitting a user-specified shape.

Scrutinizing XAI with linear ground-truth data

Personalized Federated Learning using Pytorch (pFedMe)

PyTorch Connectomics: segmentation toolbox for EM connectomics

Algorithms for outlier, adversarial and drift detection

TDmatch is a Python library developed to perform matching tasks in three categories: