Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Last update: Dec 25, 2022

Overview

MARL Tricks

Our codes for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implemented and standardized the hyperparameters of the SOTA MARL algorithms.

Python MARL framework

PyMARL is WhiRL's framework for deep multi-agent reinforcement learning and includes implementations of the following algorithms:

Value-based Methods:

Actor Critic Methods:

PyMARL is written in PyTorch and uses SMAC as its environment.

Installation instructions

Install Python packages

# require Anaconda 3 or Miniconda 3
bash install_dependecies.sh

Set up StarCraft II and SMAC:

bash install_sc2.sh

This will download SC2 into the 3rdparty folder and copy the maps necessary to run over.

Run an experiment

# For SMAC
python3 src/main.py --config=qmix --env-config=sc2 with env_args.map_name=corridor

# For Cooperative Predator-Prey
python3 src/main.py --config=qmix_prey --env-config=stag_hunt with env_args.map_name=stag_hunt

The config files act as defaults for an algorithm or environment.

They are all located in src/config. --config refers to the config files in src/config/algs --env-config refers to the config files in src/config/envs

Run parallel experiments:

# bash run.sh config_name map_name_list (threads_num arg_list gpu_list experinments_num)
bash run.sh qmix corridor 2 epsilon_anneal_time=500000 0,1 5

xxx_list is separated by ,.

All results will be stored in the Results folder and named with map_name.

Force all processes to exit

# all python and game processes of current user will quit.
bash clean.sh

Some test results on Super Hard scenarios

Cite

@article{hu2021riit,
      title={RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning}, 
      author={Jian Hu and Haibin Wu and Seth Austin Harding and Siyang Jiang and Shih-wei Liao},
      year={2021},
      eprint={2102.03479},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Related tags

Overview

MARL Tricks

Python MARL framework

Installation instructions

Run an experiment

Run parallel experiments:

Force all processes to exit

Some test results on Super Hard scenarios

Cite

Owner

TensorFlow Reinforcement Learning

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

ChainerRL is a deep reinforcement learning library built on top of Chainer.

This is the official implementation of Multi-Agent PPO.

An open source robotics benchmark for meta- and multi-task reinforcement learning

Open world survival environment for reinforcement learning

A toolkit for developing and comparing reinforcement learning algorithms.

Deep Reinforcement Learning for Keras.

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:

A toolkit for reproducible reinforcement learning research.

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Monitor your el-cheapo UPS via SNMP

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

A customisable 3D platform for agent-based AI research

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

A general-purpose multi-agent training framework.