Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Last update: Dec 13, 2022

Related tags

Deep Learning auto-drac

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

This is a PyTorch implementation of the methods proposed in

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning by

Roberta Raileanu, Max Goldstein, Denis Yarats, Ilya Kostrikov, and Rob Fergus.

Citation

If you use this code in your own work, please cite our paper:

@article{raileanu2020automatic,
  title={Automatic Data Augmentation for Generalization in Deep Reinforcement Learning},
  author={Raileanu, Roberta and Goldstein, Max and Yarats, Denis and Kostrikov, Ilya and Fergus, Rob},
  journal={arXiv preprint arXiv:2006.12862},
  year={2020}
}

Requirements

The code was run on a GPU with CUDA 10.2. To install all the required dependencies:

conda create -n auto-drac python=3.7
conda activate auto-drac

git clone [email protected]:rraileanu/auto-drac.git
cd auto-drac
pip install -r requirements.txt

git clone https://github.com/openai/baselines.git
cd baselines 
python setup.py install 

pip install procgen

Instructions

cd auto-drac

Train DrAC with crop augmentation on BigFish

python train.py --env_name bigfish --aug_type crop

Train UCB-DrAC on BigFish

python train.py --env_name bigfish --use_ucb

Train RL2-DrAC on BigFish

python train.py --env_name bigfish --use_rl2

Train Meta-DrAC on BigFish

python train.py --env_name bigfish --use_meta

Procgen Results

UCB-DrAC achieves state-of-the-art performance on the Procgen benchmark (easy mode), significantly improving the agent's generalization ability over standard RL methods such as PPO.

Test Results on Procgen

Train Results on Procgen

Agent Videos

You can find some videos of the agent's behavior while training on our website.

Acknowledgements

This code was based on an open sourced PyTorch implementation of PPO.

We also used kornia for some of the augmentations.

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Related tags

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

Citation

Requirements

Instructions

Train DrAC with crop augmentation on BigFish

Train UCB-DrAC on BigFish

Train RL2-DrAC on BigFish

Train Meta-DrAC on BigFish

Procgen Results

Agent Videos

Acknowledgements

Owner

Honours project, on creating a depth estimation map from two stereo images of featureless regions

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

Minimal PyTorch implementation of Generative Latent Optimization from the paper "Optimizing the Latent Space of Generative Networks"

A basic neural network for image segmentation.

Official code for "Mean Shift for Self-Supervised Learning"

An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.

A Jupyter notebook to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

A Simple Key-Value Data-store written in Python

Discriminative Condition-Aware PLDA

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Code for intrusion detection system (IDS) development using CNN models and transfer learning

Data and code for ICCV 2021 paper Distant Supervision for Scene Graph Generation.

General neural ODE and DAE modules for power system dynamic modeling.

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning

A repo for Causal Imitation Learning under Temporally Correlated Noise

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks