PyTorch implementation for the paper Pseudo Numerical Methods for Diffusion Models on Manifolds

Last update: Jan 05, 2023

Related tags

Overview

Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM)

This repo is the official PyTorch implementation for the paper Pseudo Numerical Methods for Diffusion Models on Manifolds

by Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao (Zhejiang University).

What does this code do?

This code is not only the official implementation for PNDM, but also a generic framework for DDIM-like models including:

Structure

This code contains three main objects including method, schedule and model. The following table shows the options supported by this code and the role of each object.

Object	Option	Role
method	DDIM, S-PNDM, F-PNDM, FON, PF	the numerical method used to generate samples
schedule	linear, quad, cosine	the schedule of adding noise to images
model	DDIM, iDDPM, PF, PF_deep	the neural network used to fit noise

All of them can be combined at will, so this code provide at least 5x3x4=60 choices to generate samples.

How to run the code

Dependencies

Run the following to install a subset of necessary python packages for our code.

pip install -r requirements.txt

Tip: mpi4py can make the generation process faster using multi-gpus. It is not necessary and can be removed freely.

Usage

Evaluate our models through main.py.

python main.py --runner sample --method F-PNDM --sample_speed 50 --device cuda --config ddim-cifar10.yml --image_path temp/results --model_path temp/models/ddim/ema_cifar10.ckpt

runner (train|sample): choose the mode of runner
method (DDIM|FON|S-PNDM|F-PNDM|PF): choose the numerical methods
sample_speed: control the total generation step
device (cpu|cuda:0): choose the device to use
config: choose the config file
image_path: choose the path to save images
model_path: choose the path of model

Train our models through main.py.

python main.py --runner train --device cuda --config ddim-cifar10.yml --train_path temp/train

train_path: choose the path to save training status

Checkpoints & statistics

All checkpoints of models and precalculated statistics for FID are provided in this Onedrive.

References

If you find the code useful for your research, please consider citing:

@inproceedings{liu2022pseudo,
    title={Pseudo Numerical Methods for Diffusion Models on Manifolds},
    author={Luping Liu and Yi Ren and Zhijie Lin and Zhou Zhao},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=PlKWVd2yBkY}
}

This work is built upon some previous papers which might also interest you:

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems 33 (2020): 6840-6851.
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising Diffusion Implicit Models. International Conference on Learning Representations. 2020.
Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-Based Generative Modeling through Stochastic Differential Equations. International Conference on Learning Representations. 2020.

PyTorch implementation for the paper Pseudo Numerical Methods for Diffusion Models on Manifolds

Related tags

Overview

Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM)

What does this code do?

Structure

How to run the code

Dependencies

Usage

Checkpoints & statistics

References

Owner

Luping Liu (刘路平)

Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation" by Shizhe Diao et al.

Portfolio analytics for quants, written in Python

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

So-ViT: Mind Visual Tokens for Vision Transformer

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Open CV - Convert a picture to look like a cartoon sketch in python

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

Official PyTorch implementation of Spatial Dependency Networks.

Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank

Repository for the electrical and ICT benchmark model developed in the ERIGrid 2.0 project.

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

PyTorch implementation HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

U-Net: Convolutional Networks for Biomedical Image Segmentation

MogFace: Towards a Deeper Appreciation on Face Detection

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.