Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Last update: Dec 24, 2022

Overview

Automatic, Readable, Reusable, Extendable

Machin is a reinforcement library designed for pytorch.

Build status

Platform	Status
Linux
Windows

Supported Models

Anything, including recurrent networks.

Supported algorithms

Currently Machin has implemented the following algorithms, the list is still growing:

Single agent algorithms:

Multi-agent algorithms:

Multi-agent DDPG (MADDPG)

Immitation learning algorithms (Behavioral Cloning, Inverse RL, GAIL)

Generative Adversarial Imitation Learning (GAIL)

Massively parallel algorithms:

Enhancements:

Algorithms to be supported:

Evolution Strategies
QMIX (multi agent)
Model-based methods

Features

1. Automatic

Starting from version 0.4.0, Machin now supports automatic config generation, you can get a configuration through:

python -m machin.auto generate --algo DQN --env openai_gym --output config.json

And automatically launch the experiment with pytorch lightning:

python -m machin.auto launch --config config.json

2. Readable

Compared to other reinforcement learning libraries such as the famous rlpyt, ray, and baselines. Machin tries to just provide a simple, clear implementation of RL algorithms.

All algorithms in Machin are designed with minimial abstractions and have very detailed documents, as well as various helpful tutorials.

3. Reusable

Machin takes a similar approach to that of pytorch, encasulating algorithms, data structures in their own classes. Users do not need to setup a series of data collectors, trainers, runners, samplers... to use them, just import.

The only restriction placed on your models is their input / output format, however, these restrictions are minimal, making it easy to adapt algorithms to your custom environments.

4. Extendable

Machin is built upon pytorch, it and thanks to its powerful rpc api, we may construct complex distributed programs. Machin provides implementations for enhanced parallel execution pools, automatic model assignment, role based rpc scaling, rpc service discovery and registration, etc.

Upon these core functions, Machin is able to provide tested high-performance distributed training algorithm implementations, such as A3C, APEX, IMPALA, to ease your design.

5. Reproducible

Machin is weakly reproducible, for each release, our test framework will directly train every RL framework, if any framework cannot reach the target score, the test will fail directly.

However, currently, the tests are not guaranteed to be exactly the same as the tests in original papers, due to the large variety of different environments used in original research papers.

Documentation

See here. Examples are located in examples.

Installation

Machin is hosted on PyPI. Python >= 3.6 and PyTorch >= 1.6.0 is required. You may install the Machin library by simply typing:

pip install machin

You are suggested to create a virtual environment first if you are using conda to manage your environments, to prevent PIP changes your packages without letting conda know.

conda create -n some_env pip
conda activate some_env
pip install machin

Note: Currently only a fraction of all functions is supported on platforms other than linux (mainly distributed algorithms), to test whether the code is running correctly, you can run the corresponding test script for your platform in the root directory:

run_win_test.bat
run_linux_test.sh
run_macos_test.sh

Some errors may occur due to incorrect setup of libraries, make sure you have installed graphviz etc.

Contributing

Any contribution would be great, don't hesitate to submit a PR request to us! Please follow the instructions in this file.

Issues

If you have any issues, please use the template markdown files in .github/ISSUE_TEMPLATE folder and format your issue before opening a new one. We would try our best to respond to your feature requests and problems.

Citing

We would be very grateful if you can cite our work in your publications:

@misc{machin,
  author = {Muhan Li},
  title = {Machin},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/iffiX/machin}},
}

Roadmap

Please see Roadmap for the exciting new features we are currently working on!

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Related tags

Overview

Build status

Supported Models

Supported algorithms

Single agent algorithms:

Multi-agent algorithms:

Immitation learning algorithms (Behavioral Cloning, Inverse RL, GAIL)

Massively parallel algorithms:

Enhancements:

Algorithms to be supported:

Features

1. Automatic

2. Readable

3. Reusable

4. Extendable

5. Reproducible

Documentation

Installation

Contributing

Issues

Citing

Roadmap

Owner

Iffi

pytorch implementation of ABC : Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"

Lipstick ain't enough: Beyond Color-Matching for In-the-Wild Makeup Transfer (CVPR 2021)

Active Offline Policy Selection With Python

Sematic-Segmantation - Semantic Segmentation on MIT ADE20K dataset in PyTorch

This repo generates the training data and the model for Morpheus-Deblend

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

SNIPS: Solving Noisy Inverse Problems Stochastically

I-BERT: Integer-only BERT Quantization

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

Kaggle Ultrasound Nerve Segmentation competition [Keras]

An adaptive hierarchical energy management strategy for hybrid electric vehicles

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

QI-Q RoboMaster2022 CV Algorithm

PyTorch implementation of GLOM

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"