Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Last update: Jan 01, 2023

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

MACAW code used for the experiments in the ICML 2021 paper.

Installing the environment

# Install Python 3.7.9 if necessary
$ pyenv install 3.7.9
$ pyenv shell 3.7.9

$ python --version
Python 3.7.9

$ python -m venv env
$ source env/bin/activate
$ pip install -r requirements.txt

Downloading the data

The offline data used for MACAW can be found here. Download it and use the default name (macaw_offline_data) for the folder where the four data directories are stored. gDrive might be useful here if downloading from the Google Drive GUI is not an option.

Running MACAW 🦜

Run offline meta-training with periodic online evaluations with any of the scripts in scripts/. e.g.

$ . scripts/macaw_dir.sh # MACAW training on Cheetah-Direction (Figure 1)
$ . scripts/macaw_vel.sh # MACAW training on Cheetah-Velocity (Figure 1)
$ . scripts/macaw_quality_ablation.sh # Data quality ablation (Figure 5-left)
...

Outputs (tensorboard logs) will be written to the log/ directory.

Reach out!

If you're having issues with the code or data, feel free to open an issue or send me an email.

Citation

If our code or research was useful for your own work, you can cite us with the following attribution:

@InProceedings{mitchell2021offline,
    title = {Offline Meta-Reinforcement Learning with Advantage Weighting},
    author = {Mitchell, Eric and Rafailov, Rafael and Peng, Xue Bin and Levine, Sergey and Finn, Chelsea},
    booktitle = {Proceedings of the 38th International Conference on Machine Learning},
    year = {2021}
}

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Related tags

Overview

Offline Meta-Reinforcement Learning with Advantage Weighting (MACAW)

Installing the environment

Downloading the data

Running MACAW 🦜

Reach out!

Citation

Owner

Eric Mitchell

Exploring Image Deblurring via Blur Kernel Space (CVPR'21)

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

Wav2Vec for speech recognition, classification, and audio classification

All materials of Cassandra Event, Udyam'22

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Challenge on Spectral Reconstruction from RGB)

Official PyTorch implementation of the paper "TEMOS: Generating diverse human motions from textual descriptions"

A simple version for graphfpn

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning

A Blender python script for getting asset browser custom preview images for objects and collections.

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

[ICCV2021] IICNet: A Generic Framework for Reversible Image Conversion

This is a demo app to be used in the video streaming applications

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

Code of Adverse Weather Image Translation with Asymmetric and Uncertainty aware GAN

Conditional Gradients For The Approximately Vanishing Ideal

HGCN: Harmonic Gated Compensation Network For Speech Enhancement

This repository contains PyTorch models for SpecTr (Spectral Transformer).

PECOS - Prediction for Enormous and Correlated Spaces