Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Last update: Jan 07, 2023

Related tags

Overview

Decision Transformer

Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor Mordatch†

*equal contribution, †equal advising

A link to our paper can be found on arXiv.

Overview

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. Contains scripts to reproduce experiments.

Instructions

We provide code in two sub-directories: atari containing code for Atari experiments and gym containing code for OpenAI Gym experiments. See corresponding READMEs in each folder for instructions; scripts should be run from the respective directories. It may be necessary to add the respective directories to your PYTHONPATH.

Citation

Please cite our paper as:

@article{chen2021decisiontransformer,
  title={Decision Transformer: Reinforcement Learning via Sequence Modeling},
  author={Lili Chen and Kevin Lu and Aravind Rajeswaran and Kimin Lee and Aditya Grover and Michael Laskin and Pieter Abbeel and Aravind Srinivas and Igor Mordatch},
  journal={arXiv preprint arXiv:2106.01345},
  year={2021}
}

Note: this is not an official Google or Facebook product.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Related tags

Overview

Decision Transformer

Overview

Instructions

Citation

Owner

Kevin Lu

Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21

Multi-tool reverse engineering collaboration solution.

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

ADOP: Approximate Differentiable One-Pixel Point Rendering

Faster RCNN pytorch windows

Build Low Code Automated Tensorflow, What-IF explainable models in just 3 lines of code.

Edge Restoration Quality Assessment

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

Calibrate your listeners! Robust communication-based training for pragmatic speakers. Findings of EMNLP 2021.

Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

Face recognize system

Development kit for MIT Scene Parsing Benchmark

Implicit Deep Adaptive Design (iDAD)

Code for visualizing the loss landscape of neural nets

[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts