Visual Adversarial Imitation Learning using Variational Models (VMAIL)

This is the official implementation of the NeurIPS 2021 paper.

Method

VMAIL simultaneously learns a variational dynamics model and trains an on-policy adversarial imitation learning algorithm in the latent space using only model-based rollouts. This allows for stable and sample efficient training, as well as zero-shot imitation learning by transfering the learned dynamics model

Instructions

Get dependencies:

conda env create -f vmail.yml
conda activate vmail
cd robel_claw/robel
pip install -e .

To train agents for each environmnet download the expert data from the provided link and run:

python3 -u vmail.py --logdir .logdir --expert_datadir expert_datadir

The training will generate tensorabord plots and GIFs in the log folder:

tensorboard --logdir ./logdir

Citation

If you find this code useful, please reference in your paper:

@article{rafailov2021visual,
      title={Visual Adversarial Imitation Learning using Variational Models}, 
      author={Rafael Rafailov and Tianhe Yu and Aravind Rajeswaran and Chelsea Finn},
      year={2021},
      journal={Neural Information Processing Systems}
}

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Related tags

Overview

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Method

Instructions

Citation

Owner

Differential fuzzing for the masses!

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

IPATool-py: download ipa easily

Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Node for thenewboston digital currency network.

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

A library for building and serving multi-node distributed faiss indices.

Deep Learning Visuals contains 215 unique images divided in 23 categories

DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection

LineBoard - Python+React+MySQL-白板即時系統改善人群行為

Create UIs for prototyping your machine learning model in 3 minutes

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

GULAG: GUessing LAnGuages with neural networks

This is official implementaion of paper "Token Shift Transformer for Video Classification".

Replication Code for "Self-Supervised Bug Detection and Repair" NeurIPS 2021

Single Image Deraining Using Bilateral Recurrent Network (TIP 2020)

Encode and decode text application