[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Last update: Dec 19, 2022

Overview

Code for Coordinated Policy Optimization

Webpage | Code | Paper | Talk (English) | Talk (Chinese)

Hi there! This is the source code of the paper “Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization”.

Please following the tutorial below to kickoff the reproduction of our results.

Installation

# Create virtual environment
conda create -n copo python=3.7
conda activate copo

# Install dependency
pip install metadrive-simulator==0.2.3
pip install torch  # Make sure your torch is successfully installed! Especially when using GPU!

# Install environment and algorithm.
cd code
pip install -e .

Training

As a quick start, you can start training CoPO in Intersection environment immediately after installation by running:

cd code/copo/
python inter/train_copo_dist.py --exp-name inter_copo_dist

The general way to run training is following:

cd code/copo/
python ENV/train_ALGO.py --exp-name EXPNAME

Here ENV refers to the shorthand of environments:

round  # Roundabout
inter  # Intersection
bottle  # Bottleneck
parking  # Parking Lot
tollgate  # Tollgate

and ALGO is the shorthand for algorithms:

ippo  # Individual Policy Optimization
ccppo  # Mean Field Policy Optimization
cl  # Curriculum Learning
copo_dist  # Coordinated Policy Optimiztion (Ours)
copo_dist_cc  # Coordinated Policy Optimiztion with Centralized Critics

finally the EXPNAME is arbitrary name to denote the experiment (with multiple concurrent trials), such as roundabout_copo.

Visualization

We provide the trained models for all algorithms in all environments. A simple command can bring you the visualization of the behaviors of the populations!

cd copo
python vis.py 

# In default, we provide you the CoPO population in Intersection environment. 
# If you want to see others, try:
python vis.py --env round --algo ippo

# Or you can use the native renderer for 3D rendering:
# (Press H to show helper message)
python vis.py --env tollgate --algo cl --use_native_render

We hope you enjoy the interesting behaviors learned in this work! Please feel free to contact us if you have any questions, thanks!

Citation

@misc{peng2021learning,
      title={Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization}, 
      author={Zhenghao Peng and Quanyi Li and Ka Ming Hui and Chunxiao Liu and Bolei Zhou},
      year={2021},
      eprint={2110.13827},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Related tags

Overview

Code for Coordinated Policy Optimization

Installation

Training

Visualization

Citation

Owner

DeciForce: Crossroads of Machine Perception and Autonomy

Official Pytorch implementation of RePOSE (ICCV2021)

wmctrl ported to Python Ctypes

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Rl-quickstart - Reinforcement Learning Quickstart

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Official implementation of paper Gradient Matching for Domain Generalization

✔️ Visual, reactive testing library for Julia. Time machine included.

moving object detection for satellite videos.

My course projects for the 2021 Spring Machine Learning course at the National Taiwan University (NTU)

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

Exploiting a Zoo of Checkpoints for Unseen Tasks

Official repository for "Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring".

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Revisting Open World Object Detection

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

Deep Learning applied to Integral data analysis

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Related tags

Overview

Code for Coordinated Policy Optimization

Installation

Training

Visualization

Citation

Owner

DeciForce: Crossroads of Machine Perception and Autonomy

Official Pytorch implementation of RePOSE (ICCV2021)

wmctrl ported to Python Ctypes

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Rl-quickstart - Reinforcement Learning Quickstart

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Official implementation of paper Gradient Matching for Domain Generalization

✔️ Visual, reactive testing library for Julia. Time machine included.

moving object detection for satellite videos.

My course projects for the 2021 Spring Machine Learning course at the National Taiwan University (NTU)

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

Exploiting a Zoo of Checkpoints for Unseen Tasks

Official repository for "Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring".

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Revisting Open World Object Detection

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

Deep Learning applied to Integral data analysis

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.