PAIRED in PyTorch 🔥

Last update: Dec 12, 2022

Related tags

Overview

PAIRED

This codebase provides a PyTorch implementation of Protagonist Antagonist Induced Regret Environment Design (PAIRED), which was first introduced in "Emergent Complexity and Zero-Shot Transfer via Unsupervised Environment Design" (Dennis et al, 2020). This implementation comes integrated with custom adversarial maze environments based on MiniGrid environment (Chevalier-Boisvert et al, 2018), as used in Dennis et al, 2020.

Unsupervised environment design (UED) methods propose a curriculum of tasks or environment instances (levels) that aims to foster more sample efficient learning and robust policies. PAIRED performs unsupervised environment design (UED) using a three-player game among two student agents—the protagonist and antagonist—and an adversary. The antagonist is allied with the adversary, which proposes new environment instances (or levels) aiming to maximize the regret of the protagonist, estimated as the difference in returns achieved by the student agents across a batch of rollouts on proposed levels.

PAIRED has a strong guarantee of robustness in that at Nash equilibrium, it provably induces a minimax regret policy for the protagonist, which means that the protagonist optimally trades off regret across all possible levels that can be proposed by the adversary.

UED algorithms included

PAIRED (Protagonist Antagonist Induced Regret Environment Design)
Minimax
Domain randomization

Set up

To install the necessary dependencies, run the following commands:

conda create --name paired python=3.8
conda activate paired
pip install -r requirements.txt

git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .
cd ..

Configuration

Detailed descriptions of the various command-line arguments for the main training script, train.py can be found in arguments.py.

Experiments

For convenience, configuration json files are provided to generate the commands to run the specific experimental settings featured in Dennis et al, 2020. To generate the command to launch 1 run of the experiment codified by the configuration file config.json in the local folder train_scripts/configs, simply run the following, and copy and paste the output into your command line.

python train_scripts/make_cmd.py --json config --num_trials 1

Alternatively, you can run the following to copy the command directly to your clipboard:

python train_scripts/make_cmd.py --json config --num_trials 1 | pbcopy

By default, each experiment run will generate a folder in ~/logs/paired named after the --xpid argument passed into the the train command. This folder will contain log outputs in logs.csv and periodic screenshots of generated levels in the directory screenshots. Each screenshot uses the naming convention update_<number of PPO updates>.png. The latest model checkpoint will be output to model.tar, and archived model checkpoints are also saved according to the naming convention model_<number of PPO updates>.tar.

The json files for reproducing various MiniGrid experiments from Dennis et al, 2020 are listed below:

Method	json config
PAIRED	minigrid/paired.json
Minimax	minigrid/minimax.json
DR	minigrid/dr.json

Evaluation

You can use the following command to batch evaluate all trained models whose output directory shares the same <xpid_prefix> before the indexing _[0-9]+ suffix:

python -m eval \
--base_path "~/logs/paired" \
--prefix '<xpid prefix>' \
--num_processes 2 \
--env_names \
'MultiGrid-SixteenRooms-v0,MultiGrid-Labyrinth-v0,MultiGrid-Maze-v0'
--num_episodes 100 \
--model_tar model

PAIRED in PyTorch 🔥

Related tags

Overview

PAIRED

UED algorithms included

Set up

Configuration

Experiments

Evaluation

Owner

UCL DARK Lab

torchsummaryDynamic: support real FLOPs calculation of dynamic network or user-custom PyTorch ops

Collection of TensorFlow2 implementations of Generative Adversarial Network varieties presented in research papers.

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

Code for our paper "Interactive Analysis of CNN Robustness"

ivadomed is an integrated framework for medical image analysis with deep learning.

This repository contains PyTorch models for SpecTr (Spectral Transformer).

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Official PyTorch implementation of the paper "Self-Supervised Relational Reasoning for Representation Learning", NeurIPS 2020 Spotlight.

Denoising Diffusion Implicit Models

Structured Edge Detection Toolbox

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

Occlusion robust 3D face reconstruction model in CFR-GAN (WACV 2022)

This application explain how we can easily integrate Deepface framework with Python Django application

Do you like Quick, Draw? Well what if you could train/predict doodles drawn inside Streamlit? Also draws lines, circles and boxes over background images for annotation.

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

AI virtual gym is an AI program which can be used to exercise and can be used to see if we are doing the exercises

A machine learning malware analysis framework for Android apps.