Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Last update: Dec 23, 2021

Related tags

Overview

ALPHAMEPOL

This repository contains the implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Installation

In order to use this codebase you need to work with a Python version >= 3.6. Moreover, you need to have a working setup of Mujoco with a valid Mujco license. To setup Mujoco, have a look here. To avoid any conflict with your existing Python setup, and to keep this project self-contained, it is suggested to work in a virtual environment with virtualenv. To install virtualenv:

pip install --upgrade virtualenv

Create a virtual environment, activate it and install the requirements:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Usage

Unsupervised Pre-Training

To reproduce the Unsupervised Pre-Training experiments in the paper, run:

./scripts/exploration/[gridworld_with_slope.sh | multigrid.sh | ant.sh | minigrid.sh]

Supervised Fine-Tuning

To reproduce the Supervised Fine-Tuning experiments, run:

./scripts/goal_rl/[gridworld_with_slope.sh | multigrid.sh | ant.sh | minigrid.sh]

By default, this will launch TRPO with ALPHAMEPOL initialization. To launch TRPO with a random initialization, simply omit the policy_init argument in the scripts.

Moreover, note that the scripts for the GridWorld with Slope and MultiGrid experiments have the argument num_goals = 50, meaning that the training will be performed with one goal at a time. If you want to speed up the process, you can use several processes (ideally one for each goal), by passing as argument num_goals = 1 and changing incrementally the seed. As regards the Ant and MiniGrid experiments, since the goals are predefined, you can also set the goal_index argument to specify a goal (from 0 to 7 and from 0 to 12 respectively).

Results Visualization

Once launched, each experiment will log statistics in the results folder. You can visualize everything by launching tensorboard targeting that directory:

python -m tensorboard.main --logdir=./results --port 8080

and visiting the board at http://localhost:8080.

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

Related tags

Overview

ALPHAMEPOL

Installation

Usage

Unsupervised Pre-Training

Supervised Fine-Tuning

Results Visualization

Owner

Face Recognition plus identification simply and fast | Python

The code for 'Deep Residual Fourier Transformation for Single Image Deblurring'

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Meandering In Networks of Entities to Reach Verisimilar Answers

IEEE Winter Conference on Applications of Computer Vision 2022 Accepted

BookMyShowPC - Movie Ticket Reservation App made with Tkinter

A higher performance pytorch implementation of DeepLab V3 Plus(DeepLab v3+)

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

DI-HPC is an acceleration operator component for general algorithm modules in reinforcement learning algorithms

An Unpaired Sketch-to-Photo Translation Model

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

PyTorch Implementation of Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.

Deep Learning Training Scripts With Python

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

GANsformer: Generative Adversarial Transformers Drew A

Contour-guided image completion with perceptual grouping (BMVC 2021 publication)

QR2Pass-project - A proof of concept for an alternative (passwordless) authentication system to a web server

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records