Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Last update: Dec 22, 2022

Related tags

Deep Learning UPDeT

Overview

UPDeT

Official Implementation of UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers (ICLR 2021 spotlight)

The framework is inherited from PyMARL. UPDeT is written in pytorch and uses SMAC as its environment.

Installation instructions

Installing dependencies:

pip install -r requirements.txt

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.

bash install_sc2.sh

Run an experiment

Before training your own transformer-based multi-agent model, there are a list of things to note.

Currently, this repository supports marine-based battle scenarios. e.g. 3m, 8m, 5m_vs_6m.
If you are interested in training a different unit type, carefully modify the Transformer Parameters block at src/config/default.yaml and revise the _build_input_transformer function in basic_controller.python.
Before running the experiment, check the agent type in Agent Parameters block at src/config/default.yaml.
This repository contains two new transformer-based agents from the UPDeT paper including
- Standard UPDeT
- Aggregation Transformer

Training script

python3 src/main.py --config=vdn --env-config=sc2 with env_args.map_name=5m_vs_6m

All results will be stored in the Results/ folder.

Performance

Single battle scenario

Surpass the GRU baseline on hard 5m_vs_6m with:

Multiple battle scenarios

Zero-shot generalize to different tasks:

Result on 7m-5m-3m transfer learning.

Note: Only UPDeT can be deployed to other scenarios without changing the model's architecture.

More details please refer to UPDeT paper.

Bibtex

@article{hu2021updet,
  title={UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers},
  author={Hu, Siyi and Zhu, Fengda and Chang, Xiaojun and Liang, Xiaodan},
  journal={arXiv preprint arXiv:2101.08001},
  year={2021}
}

License

The MIT License

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Related tags

Overview

UPDeT

Installation instructions

Installing dependencies:

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.

Run an experiment

Training script

Performance

Single battle scenario

Multiple battle scenarios

Bibtex

License

Owner

hhhusiyi

Deep motion generator collections

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

Fast and robust certifiable relative pose estimation

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

A PyTorch implementation of DenseNet.

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021

Plato: A New Framework for Federated Learning Research

Method for facial emotion recognition compitition of Xunfei and Datawhale .

Unofficial implementation of Google "CutPaste: Self-Supervised Learning for Anomaly Detection and Localization" in PyTorch

Morphable Detector for Object Detection on Demand

A simple editor for captions in .SRT file extension

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

A collection of inference modules for fastai2

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Related tags

Overview

UPDeT

Installation instructions

Installing dependencies:

Download SC2 into the 3rdparty/ folder and copy the maps necessary to run over.

Run an experiment

Training script

Performance

Single battle scenario

Multiple battle scenarios

Bibtex

License

Owner

hhhusiyi

Deep motion generator collections

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

Fast and robust certifiable relative pose estimation

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

A PyTorch implementation of DenseNet.

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021

Plato: A New Framework for Federated Learning Research

Method for facial emotion recognition compitition of Xunfei and Datawhale .

Unofficial implementation of Google "CutPaste: Self-Supervised Learning for Anomaly Detection and Localization" in PyTorch

Morphable Detector for Object Detection on Demand

A simple editor for captions in .SRT file extension

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation （ICCV2021）

A collection of inference modules for fastai2

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.