Contrastively Disentangled Sequential Variational Audoencoder

Last update: Dec 24, 2022

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

This is the implementation for our C-DSVAE, a novel self-supervised disentangled sequential representation learning method.

Requirements

Python 3
PyTorch 1.7
Numpy 1.18.5

Dataset

Sprites

We provide the raw Sprites .npy files. One can also find the dataset on a third-party repo.

For each split (train/test), we expect the following components for each sequence sample

x: raw sample of shape [8, 3, 64, 64]
c_aug: content augmentation of shape [8, 3, 64, 64]
m_aug: motion augmentation of shape [8, 3, 64, 64]
motion factors: action (3 classes), direction (3 classes)
content factors: skin, tops, pants, hair (each with 6 classes)

Running

Train

./run_cdsvae.sh

Test

./run_test_sprite.sh

Classification Judge

The judge classifiers are pretrained with full supervision separately.

Sprites judge

C-DSVAE Checkpoints

We provide a sample Sprites checkpoint. Checkpoint parameters can be found in ./run_test_sprite.sh.

Paper

If you are inspired by our work, please cite the following paper:

@inproceedings{bai2021contrastively,
  title={Contrastively Disentangled Sequential Variational Autoencoder},
  author={Bai, Junwen and Wang, Weiran and Gomes, Carla},
  booktitle={Advances in Neural Information Processing Systems},
  volume={},
  year={2021}
}

Contrastively Disentangled Sequential Variational Audoencoder

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

Requirements

Dataset

Sprites

Running

Train

Test

Classification Judge

C-DSVAE Checkpoints

Paper

Owner

Junwen Bai

Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

ReSSL: Relational Self-Supervised Learning with Weak Augmentation

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

Real-Time Social Distance Monitoring tool using Computer Vision

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Does Oversizing Improve Prosumer Profitability in a Flexibility Market? - A Sensitivity Analysis using PV-battery System

Keras Model Implementation Walkthrough

CMT: Convolutional Neural Networks Meet Vision Transformers

Minimal implementation of PAWS (https://arxiv.org/abs/2104.13963) in TensorFlow.

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

A Python script that creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editing software such as FinalCut Pro for further adjustments.

Franka Emika Panda manipulator kinematics&dynamics simulation

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

DeepSTD: Mining Spatio-temporal Disturbances of Multiple Context Factors for Citywide Traffic Flow Prediction

Deep Learning as a Cloud API Service.

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.