Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Last update: Nov 16, 2021

Related tags

Deep Learning marl-design

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Official implementation of:

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Shriram Chennakesavalu and Grant M. Rotskoff

https://arxiv.org/abs/2111.06875

Abstract: Experimental advances enabling high-resolution external control create new opportunities to produce materials with exotic properties. In this work, we investigate how a multi-agent reinforcement learning approach can be used to design external control protocols for self-assembly. We find that a fully decentralized approach performs remarkably well even with a "coarse" level of external control. More importantly, we see that a partially decentralized approach, where we include information about the local environment allows us to better control our system towards some target distribution. We explain this by analyzing our approach as a partially-observed Markov decision process. With a partially decentralized approach, the agent is able to act more presciently, both by preventing the formation of undesirable structures and by better stabilizing target structures as compared to a fully decentralized approach.

Installing prerequisites (using conda)

conda env create -f environment.yml -n marldesign
conda activate marldesign

Possible --centralize_approach values are ("plaquette", "all", "grid_n"), where 1 < n < region_num/2

Sample training commands

python train.py --active --centralize_states --centralize_approach plaquette
python train.py --active --centralize_rewards --centralize_approach all
python train.py --centralize_rewards --centralize_states --centralize_approach grid_1

Sample testing commands

python test.py --active --num_samples 10  --centralize_states --centralize_approach plaquette
python test.py --active --num_samples 10 --centralize_rewards --centralize_approach grid_1
python test.py --centralize_rewards --num_samples 10 --centralize_states --centralize_approach grid_2

For a more theoretical description of the systems described here, please visit https://github.com/rotskoff-group/dissipative-design

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Related tags

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Installing prerequisites (using conda)

Sample training commands

Sample testing commands

Owner

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

A Pythonic library for Nvidia Codec.

OpenMMLab Model Deployment Toolset

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Contrastive Feature Loss for Image Prediction

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

Real-time multi-object tracker using YOLO v5 and deep sort

Sentiment analysis translations of the Bhagavad Gita

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Rendering Point Clouds with Compute Shaders

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

Official repository for the NeurIPS 2021 paper Get Fooled for the Right Reason: Improving Adversarial Robustness through a Teacher-guided curriculum Learning Approach

Space Ship Simulator using python

FADNet++: Real-Time and Accurate Disparity Estimation with Configurable Networks

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Pun Detection and Location

Implementation of the final project of the course DDA6309 Probabilistic Graphical Model

Conditional Gradients For The Approximately Vanishing Ideal