A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

VGG16 model-based classification project about brain tumor detection.

Computationally efficient algorithm that identifies boundary points of a point cloud.

Unified tracking framework with a single appearance model

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

A time series processing library

A deep learning based semantic search platform that computes similarity scores between provided query and documents

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.

Framework for Spectral Clustering on the Sparse Coefficients of Learned Dictionaries

Ranking Models in Unlabeled New Environments （iccv21）

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

K-Means Clustering and Hierarchical Clustering Unsupervised Learning Solution in Python3.

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

Accelerated deep learning R&D

a minimal terminal with python 😎😉

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets