Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Last update: Dec 07, 2022

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

This is the official repository for the PyTorch implementation of "Canonical Capsules: Unsupervised Capsules in Canonical Pose" by Weiwei Sun*, Andrea Tagliasacchi*, Boyang Deng, Sara Sabour, Soroosh Yazdani, Geoffrey Hinton, Kwang Moo Yi.

Download links

Project Website
PDF (arXiv)
PDF (github copy)

Citation

⚠️ If you use this source core or data in your research (in any shape or format), we require you to cite our paper as:

@conference{sun2020canonical,
   title={Canonical Capsules: Unsupervised Capsules in Canonical Pose},
   author={Weiwei Sun and Andrea Tagliasacchi and Boyang Deng and 
           Sara Sabour and Soroosh Yazdani and Geoffrey Hinton and
           Kwang Moo Yi},
   booktitle={Neural Information Processing Systems},
   year={2021}
}

Requirements

Please install dependencies with the provided environment.yml:

conda env create -f environment.yml

Datasets

We use the ShapeNet dataset as in AtlasNetV2: download the data from AtlasNetV2's official repo and convert the downloaded data into h5 files with the provided script (i.e., data_utils/ShapeNetLoader.py).
For faster experimentation, please use our 2D planes dataset, which we generated from ShapeNet (please cite both our paper, as well as ShapeNet if you use this dataset).

Training/testing (2D)

To train the model on 2D planes (training of network takes only 50 epochs, and one epoch takes approximately 2.5 minutes on an NVIDIA GTX 1080 Ti):

./main.py --log_dir=plane_dim2 --indim=2 --scheduler=5

To visualize the decompostion and reconstruction:

./main.py --save_dir=gifs_plane2d --indim=2 --scheduler=5 --mode=vis --pt_file=logs/plane_dim2/checkpoint.pth

Training/testing (3D)

To train the model on the 3D dataset:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1

We test the model with:

./main.py --log_dir=plane_dim3 --indim=3 --cat_id=-1 --mode=test

Note that the option cat_id indicates the category id to be used to load the corresponding h5 files (this look-up table):

id	category
-1	all
0	bench
1	cabinet
2	car
3	cellphone
4	chair
5	couch
6	firearm
7	lamp
8	monitor
9	plane
10	speaker
11	table
12	watercraft

Pre-trained models (3D)

We release the 3D pretrained models for both single categy (airplanes), as well as multi-category (all 13 classes).

Classification

To use our classification script:

python classification.py --data_dir=/path/to/saved/features --feature_type=caca --method_type=svm --use_kpts

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Related tags

Overview

Canonical Capsules: Unsupervised Capsules in Canonical Pose (NeurIPS 2021)

Introduction

Download links

Citation

Requirements

Datasets

Training/testing (2D)

Training/testing (3D)

Pre-trained models (3D)

Classification

Owner

Pytoydl: A toy deep learning framework built upon numpy.

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

Official code repository for the EMNLP 2021 paper

[NeurIPS 2021] "G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators"

Locally Most Powerful Bayesian Test for Out-of-Distribution Detection using Deep Generative Models

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

Keras implementation of PersonLab for Multi-Person Pose Estimation and Instance Segmentation.

Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks

Vector Quantization, in Pytorch

Learning Features with Parameter-Free Layers (ICLR 2022)

Dynamic View Synthesis from Dynamic Monocular Video

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Tf alloc - Simplication of GPU allocation for Tensorflow2

Implementation of Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.