Unsupervised Learning of Compositional Energy Concepts

This is the pytorch code for the paper Unsupervised Learning of Compositional Energy Concepts.

Demo

Please download a pretrained model at this link and then execute the following code to test a pretrained CelebA-HQ 128x128 COMET model

python demo.py im_path=im0.png

Global Factor Decomposition

Please utilize the following command to run global factor decomposition on CelebA-HQ (or other datasets)

python train.py --exp=celebahq --batch_size=12 --gpus=1 --cuda --train --dataset=celebahq --step_lr=500.0

You may further run the code on high-resolution 128x128 images below

python train.py --exp=celebahq_128 --batch_size=12 --gpus=1 --cuda --train --dataset=celebahq_128 --step_lr=500.0

Local Factor Decomposition

Please utilize the following command to run local factor decomposition on CLEVR

python train.py --exp=clevr_local_decomp --num_steps=5 --step_lr=1000.0 --components=4 --dataset=clevr --cuda --train --batch_size=24 --latent_dim=16 --recurrent_model --pos_embed

Dataset Download

Please utilize the following link to download the CLEVR dataset utilized in our experiments. Downloads for additional datasets will be posted soon. Feel free to raise an issue if there is a particular dataset you would like downloaded

Citing our Paper

If you find our code useful for your research, please consider citing

@inproceedings{du2021comet,
  title={Unsupervised Learning of Compositional Energy Concepts},
  author={Du, Yilun and Li, Shuang and Sharma, Yash and Tenenbaum, B. Joshua
  and Mordatch, Igor},
  booktitle={Advances in Neural Information Processing Systems},
  year={2021}
}

[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts

Related tags

Overview

Unsupervised Learning of Compositional Energy Concepts

Demo

Global Factor Decomposition

Local Factor Decomposition

Dataset Download

Citing our Paper

Owner

PyTorch implementation of the YOLO (You Only Look Once) v2

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

Collection of common code that's shared among different research projects in FAIR computer vision team.

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Code for the paper Relation Prediction as an Auxiliary Training Objective for Improving Multi-Relational Graph Representations (AKBC 2021).

Spatial Single-Cell Analysis Toolkit

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

Sequence modeling benchmarks and temporal convolutional networks

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

masscan + nmap + Finger

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

Discovering Interpretable GAN Controls [NeurIPS 2020]

🐤 Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation

Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)

In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.

NeuroLKH: Combining Deep Learning Model with Lin-Kernighan-Helsgaun Heuristic for Solving the Traveling Salesman Problem

PyTorch-centric library for evaluating and enhancing the robustness of AI technologies