RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Last update: Nov 29, 2022

Related tags

Overview

State Entropy Maximization with Random Encoders for Efficient Exploration (RE3) (ICML 2021)

Code for State Entropy Maximization with Random Encoders for Efficient Exploration.

In this repository, we provide code for RE3 algorithm described in the paper linked above. We provide code in three sub-directories: rad_re3 containing code for the combination of RE3 and RAD, dreamer_re3 containing code for the combination of RE3 and Dreamer, and a2c_re3 containing code for the combination of RE3 and A2C.

We also provide raw data(.csv) and code for visualization in the data directory.

If you find this repository useful for your research, please cite:

@inproceedings{seo2021state,
  title={State Entropy Maximization with Random Encoders for Efficient Exploration},
  author={Seo, Younggyo and Chen, Lili and Shin, Jinwoo and Lee, Honglak and Abbeel, Pieter and Lee, Kimin},
  booktitle={International Conference on Machine Learning},
  year={2021}
}

RAD + RE3

Our code is built on top of the DrQ repository.

Installation

You could install all dependencies by following command:

conda env install -f conda_env.yml

You should also install custom version of dm_control to run experiments on Walker Run Sparse and Cheetah Run Sparse. You could do this by following command:

cd ../envs/dm_control
pip install .

Instructions

RAD

python train.py env=hopper_hop batch_size=512 action_repeat=2 logdir=runs_rad_re3 use_state_entropy=false

RAD + RE3

python train.py env=hopper_hop batch_size=512 action_repeat=2 logdir=runs_rad_re3

We provide all scripts to reproduce Figure 4 (RAD, RAD + RE3) in scripts directory.

Dreamer + RE3

Our code is built on top of the Dreamer repository.

Installation

You could install all dependencies by following command:

pip3 install --user tensorflow-gpu==2.2.0
pip3 install --user tensorflow_probability
pip3 install --user git+git://github.com/deepmind/dm_control.git
pip3 install --user pandas
pip3 install --user matplotlib

# Install custom dm_control environments for walker_run_sparse / cheetah_run_sparse
cd ../envs/dm_control
pip3 install .

Instructions

Dreamer

python dreamer.py --logdir ./logdir/dmc_pendulum_swingup/dreamer/12345 --task dmc_pendulum_swingup --precision 32 --beta 0.0 --seed 12345

Dreamer + RE3

python dreamer.py --logdir ./logdir/dmc_pendulum_swingup/dreamer_re3/12345 --task dmc_pendulum_swingup --precision 32 --k 53 --beta 0.1 --seed 12345

We provide all scripts to reproduce Figure 4 (Dreamer, Dreamer + RE3) in scripts directory.

A2C + RE3

Training code can be found in rl-starter-files directory, which is forked from rl-starter-files, which uses a modified A2C implementation from torch-ac. Note that currently there is only support for A2C.

Installation

All of the dependencies are in the requirements.txt file in rl-starter-files. They can be installed manually or with the following command:

pip3 install -r requirements.txt

You will also need to install our cloned version of torch-ac with these commands:

cd torch-ac
pip3 install -e .

Instructions

See instructions in rl-starter-files directory. Example scripts can be found in rl-starter-files/rl-starter-files/run_sent.sh.

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Related tags

Overview

State Entropy Maximization with Random Encoders for Efficient Exploration (RE3) (ICML 2021)

RAD + RE3

Installation

Instructions

RAD

RAD + RE3

Dreamer + RE3

Installation

Instructions

Dreamer

Dreamer + RE3

A2C + RE3

Installation

Instructions

Owner

Younggyo Seo

Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

A Flow-based Generative Network for Speech Synthesis

Official implementation of "CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding" (CVPR, 2022)

Repository for the Bias Benchmark for QA dataset.

Graph Transformer Architecture. Source code for

naked is a Python tool which allows you to strip a model and only keep what matters for making predictions.

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Differentiable Wavetable Synthesis

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Pytorch ImageNet1k Loader with Bounding Boxes.

OntoProtein: Protein Pretraining With Ontology Embedding

Robust Lane Detection via Expanded Self Attention (WACV 2022)

Eff video representation - Efficient video representation through neural fields

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

Code for the SIGGRAPH 2021 paper "Consistent Depth of Moving Objects in Video".