Do you want a RL agent nicely moving on Atari?

Rainbow is all you need!

This is a step-by-step tutorial from DQN to Rainbow. Every chapter contains both of theoretical backgrounds and object-oriented implementation. Just pick any topic in which you are interested, and learn! You can execute them right away with Colab even on your smartphone.

Please feel free to open an issue or a pull-request if you have any idea to make it better. :)

If you want a tutorial for policy gradient methods, please see PG is All You Need.

DQN [NBViewer] [Colab]
DoubleDQN [NBViewer] [Colab]
PrioritizedExperienceReplay [NBViewer] [Colab]
DuelingNet [NBViewer] [Colab]
NoisyNet [NBViewer] [Colab]
CategoricalDQN [NBViewer] [Colab]
N-stepLearning [NBViewer] [Colab]
Rainbow [NBViewer] [Colab]

Prerequisites

This repository is tested on Anaconda virtual environment with python 3.7+

$ conda create -n rainbow-is-all-you-need python=3.7
$ conda activate rainbow-is-all-you-need

Installation

First, clone the repository.

git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git
cd rainbow-is-all-you-need

Secondly, install packages required to execute the code. Just type:

make setup

Contributors

Thanks goes to these wonderful people (emoji key):

_{Jinwoo Park (Curt)}

_{Kyunghwan Kim}

_{Wei Chen}

_{WANG Lei}

_leeyaf

_ahmadF

This project follows the all-contributors specification. Contributions of any kind welcome!

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Related tags

Overview

Rainbow is all you need!

Contents

Prerequisites

Installation

Related Papers

Contributors

Owner

Jinwoo Park (Curt)

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

For storing the complete exploration of Visual Question Answering for our B.Tech Project

Sequential model-based optimization with a `scipy.optimize` interface

Improving Calibration for Long-Tailed Recognition (CVPR2021)

An open-source, low-cost, image-based weed detection device for fallow scenarios.

Aws-machine-learning-university-accelerated-tab - Machine Learning University: Accelerated Tabular Data Class

Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.

Unsupervised Foreground Extraction via Deep Region Competition

Iowa Project - My second project done at General Assembly, focused on feature engineering and understanding Linear Regression as a concept

the official implementation of the paper "Isometric Multi-Shape Matching" (CVPR 2021)

Generic Foreground Segmentation in Images

Simple-System-Convert--C--F - Simple System Convert With Python

Sequence lineage information extracted from RKI sequence data repo

Compressed Video Action Recognition

Official repository of DeMFI (arXiv.)

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Randomized Correspondence Algorithm for Structural Image Editing

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)