Reinforcement learning library in JAX.

Last update: Oct 30, 2022

Overview

Magi RL library in JAX

Installation | Agents | Examples | Contributing | Documentation

Magi is a RL library developed on top of Acme.

Note: Magi is in alpha development so expect breaking changes!

Installation

Create a new Python virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies and the package in editable mode by running

pip install -U pip setuptools wheel
pip install -r requirements.txt # This uses pinned dependencies, you may adjust this for your needs.
pip install -e .

If for some reason installation fails, first check out GitHub Actions badge to see if this fails on the latest CI run. If the CI is successful, then it's likely that there are some issues to setting up your own environment. Refer to .github/workflows/ci.yaml as the official source for how to set up the environment.

Agents

magi includes popular RL algorithm implementation such as SAC, DrQ, SAC-AE and PETS. Refer to magi/agents for a full list of agents.

Examples

Check out magi/examples where we include examples of using our RL agents on popular benchmark tasks.

Testing

On Linux, you can run tests with

JAX_PLATFORM_NAME=cpu pytest -n `grep -c ^processor /proc/cpuinfo` magi

Contributing

Refer to CONTRIBUTING.md.

Acknowledgements

Magi is inspired by many of the open-source RL projects out there. Here is a (non-exhaustive) list of related libraries and packages that Magi references:

License

Apache License 2.0

Citation

If you use Magi in your work, please cite us according to the CITATION file. You may learn more about the CITATION file from here.

Reinforcement learning library in JAX.

Related tags

Overview

Magi RL library in JAX

Installation

Agents

Examples

Testing

Contributing

Acknowledgements

License

Citation

Owner

Yicheng Luo

The "breathing k-means" algorithm with datasets and example notebooks

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs.

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021)

Create animations for the optimization trajectory of neural nets

Spherical Confidence Learning for Face Recognition, accepted to CVPR2021.

Code for EMNLP 2021 main conference paper "Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification"

Code for the tech report Toward Training at ImageNet Scale with Differential Privacy

Alternatives to Deep Neural Networks for Function Approximations in Finance

DEMix Layers for Modular Language Modeling

Learning to Simulate Dynamic Environments with GameGAN (CVPR 2020)

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

Türkiye Canlı Mobese Görüntülerinde Profesyonel Nesne Takip Sistemi

Official PyTorch implementation of MAAD: A Model and Dataset for Attended Awareness

Largest list of models for Core ML (for iOS 11+)

Saliency - Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

Official Implementation (PyTorch) of "Point Cloud Augmentation with Weighted Local Transformations", ICCV 2021

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation