Evaluating different engineering tricks that make RL work

Last update: Dec 26, 2022

Related tags

Overview

Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Unofficial Implementation of MLP-Mixer, Image Classification Model

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Genshin-assets - 👧 Public documentation & static assets for Genshin Impact data.

DUE: End-to-End Document Understanding Benchmark

Train emoji embeddings based on emoji descriptions.

A python implementation of Yolov5 to detect fire or smoke in the wild in Jetson Xavier nx and Jetson nano

Focal Loss for Dense Rotation Object Detection

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Earth Vision Foundation

Deep Q-Learning Network in pytorch (not actively maintained)

InsCLR: Improving Instance Retrieval with Self-Supervision

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Self-Learning - Books Papers, Courses & more I have to learn soon

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Unofficial Implementation of MLP-Mixer, Image Classification Model

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Genshin-assets - 👧 Public documentation & static assets for Genshin Impact data.

DUE: End-to-End Document Understanding Benchmark

Train emoji embeddings based on emoji descriptions.

A python implementation of Yolov5 to detect fire or smoke in the wild in Jetson Xavier nx and Jetson nano

Focal Loss for Dense Rotation Object Detection

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

git《Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction》(ECCV 2020) GitHub:

Earth Vision Foundation

Deep Q-Learning Network in pytorch (not actively maintained)

InsCLR: Improving Instance Retrieval with Self-Supervision

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Self-Learning - Books Papers, Courses & more I have to learn soon

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: