Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Last update: Sep 19, 2022

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

This is the official codebase for Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL. Here, we provide a sample implementation of SAFARI on the cooperative navigation environment. This specific repository is untested; however, many of the given files match the code used to run experiments in the paper exactly. Refer to agents/safari.py.

Requirements

To install requirements, run:

pip install -r requirements.txt

Not all dependencies may be used; however, all dependencies that are needed can be found here.

Run

To kick off a training run of SAFARI, add a dataset into the data/ folder. Then running:

python main.py safari

will start the script from the entry point, main.py.

Data Format

SAFARI expects there to be a dataset present at data/ / for each parallel seed that is run. We expect three files:

actions.txt (Shape: [N, H])
rewards.txt (Shape: [N, H])
obs.txt (Shape: [N, H, O])

each of which expects each line to be an episodic trajectory. We convert each buffer into a list (1), cast them to str (2), and print them on separate lines of the file (3).

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Requirements

Run

Data Format

Owner

DrNAS: Dirichlet Neural Architecture Search

Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"

Aircraft design optimization made fast through modern automatic differentiation

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

deep learning for image processing including classification and object-detection etc.

Checkout some cool self-projects you can try your hands on to curb your boredom this December!

Pytorch implementation of the paper: "A Unified Framework for Separating Superimposed Images", in CVPR 2020.

This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.

AdaDM: Enabling Normalization for Image Super-Resolution

Code and data for ImageCoDe, a contextual vison-and-language benchmark

Repository of 3D Object Detection with Pointformer (CVPR2021)

PyTorch implementation for 3D human pose estimation

Lite-HRNet: A Lightweight High-Resolution Network

FairMOT for Multi-Class MOT using YOLOX as Detector

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

Related resources for our EMNLP 2021 paper

Contrastive Learning for Compact Single Image Dehazing, CVPR2021

Semantic Image Synthesis with SPADE

MATLAB codes of the book "Digital Image Processing Fourth Edition" converted to Python

Open source Python implementation of the HDR+ photography pipeline