DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

95.47% on CIFAR10 with PyTorch

Repo for the Video Person Clustering dataset, and code for the associated paper

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Python KNN model: Predicting a probability of getting a work visa. Tableau: Non-immigrant visas over the years.

PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.

Learning Compatible Embeddings, ICCV 2021

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

Implementation of Neural Style Transfer in Pytorch

duralava is a neural network which can simulate a lava lamp in an infinite loop.

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

So-ViT: Mind Visual Tokens for Vision Transformer

DLL: Direct Lidar Localization

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in ONNX

Learning cell communication from spatial graphs of cells

I created My own Virtual Artificial Intelligence named genesis, He can assist with my Tasks and also perform some analysis,,

This project is used for the paper Differentiable Programming of Isometric Tensor Network

To prepare an image processing model to classify the type of disaster based on the image dataset

Fully convolutional networks for semantic segmentation

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".