Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Last update: Dec 29, 2022

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

Imaginaire is a pytorch library that contains optimized implementation of several image and video synthesis methods developed at NVIDIA.

License

Imaginaire is released under NVIDIA Software license. For commercial use, please consult NVIDIA Research Inquiries.

What's inside?

We have a tutorial for each model. Click on the model name, and your browser should take you to the tutorial page for the project.

Supervised Image-to-Image Translation

Algorithm Name	Feature	Publication
pix2pixHD	Learn a mapping that converts a semantic image to a high-resolution photorealistic image.	Wang et. al. CVPR 2018
SPADE	Improve pix2pixHD on handling diverse input labels and delivering better output quality.	Park et. al. CVPR 2019

Unsupervised Image-to-Image Translation

Algorithm Name	Feature	Publication
UNIT	Learn a one-to-one mapping between two visual domains.	Liu et. al. NeurIPS 2017
MUNIT	Learn a many-to-many mapping between two visual domains.	Huang et. al. ECCV 2018
FUNIT	Learn a style-guided image translation model that can generate translations in unseen domains.	Liu et. al. ICCV 2019
COCO-FUNIT	Improve FUNIT with a content-conditioned style encoding scheme for style code computation.	Saito et. al. ECCV 2020

Video-to-video Translation

Algorithm Name	Feature	Publication
vid2vid	Learn a mapping that converts a semantic video to a photorealistic video.	Wang et. al. NeurIPS 2018
fs-vid2vid	Learn a subject-agnostic mapping that converts a semantic video and an example image to a photoreslitic video.	Wang et. al. NeurIPS 2019

World-to-world Translation

Algorithm Name	Feature	Publication
wc-vid2vid	Improve vid2vid on view consistency and long-term consistency.	Mallya et. al. ECCV 2020
GANcraft	Convert semantic block worlds to realistic-looking worlds.	Hao et. al. ICCV 2021

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

License

What's inside?

Supervised Image-to-Image Translation

Unsupervised Image-to-Image Translation

Video-to-video Translation

World-to-world Translation

Owner

NVIDIA Research Projects

Learning To Have An Ear For Face Super-Resolution

Predict multi paths to a moving person depending on his trajectory history.

A vision library for performing sliced inference on large images/small objects

Voxel Transformer for 3D object detection

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

A modular active learning framework for Python

Volsdf - Volume Rendering of Neural Implicit Surfaces

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

Can we visualize a large scientific data set with a surrogate model? We're building a GAN for the Earth's Mantle Convection data set to see if we can!

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

Sum-Product Probabilistic Language

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Project NII pytorch scripts

The official code repository for examples in the O'Reilly book 'Generative Deep Learning'

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)