PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Last update: Dec 09, 2022

Related tags

Overview

StARformer

This repository contains the PyTorch implementation for our paper titled StARformer: Transformer with State-Action-Reward Representations. We learn local State-Action-Reward representations (StAR-representations) to improve (long) sequence modeling for reinforcement learning (and imitation learning).

Results

Installation

Dependencies can be installed by Conda:

conda env create -f my_env.yml

And install Atari ROMs.

Datasets

Please follow this instruction for datasets.

Example usage

See run.sh or below:

python run_star_atari.py --seed 123 --data_dir_prefix [data_directory] --epochs 10 --num_steps 500000 --num_buffers 50 --batch_size 64 --seq_len 30 --model_type 'star' --game 'Breakout'

[data_directory] is where you place the Atari dataset.

Variants (`model_type`):

'star' (imitation)
'star_rwd' (offline RL)
'star_fusion' (see Figure 4a in our paper)
'star_stack' (see Figure 4b in our paper)

Acknowledgement

This code is based on Decision-Transformer.

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (`model_type`):

Acknowledgement

Owner

Jinghuan Shang

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

TrackFormer: Multi-Object Tracking with Transformers

PyTorch implementation of CloudWalk's recent work DenseBody

Implementation of PyTorch-based multi-task pre-trained models

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Example scripts for the detection of lanes using the ultra fast lane detection model in Tensorflow Lite.

This toolkit provides codes to download and pre-process the SLUE datasets, train the baseline models, and evaluate SLUE tasks.

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

Spatiotemporal resampling methods for mlr3

Deep Learning Emotion decoding using EEG data from Autism individuals

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.

PyTorch - Python + Nim

Python Fanduel API (2021) - Lineup Automation

Style transfer between images was performed using the VGG19 model

Plenoxels: Radiance Fields without Neural Networks, Code release WIP

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

nnFormer: Interleaved Transformer for Volumetric Segmentation

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (model_type):

Acknowledgement

Owner

Jinghuan Shang

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

TrackFormer: Multi-Object Tracking with Transformers

PyTorch implementation of CloudWalk's recent work DenseBody

Implementation of PyTorch-based multi-task pre-trained models

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

Example scripts for the detection of lanes using the ultra fast lane detection model in Tensorflow Lite.

This toolkit provides codes to download and pre-process the SLUE datasets, train the baseline models, and evaluate SLUE tasks.

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

Spatiotemporal resampling methods for mlr3

Deep Learning Emotion decoding using EEG data from Autism individuals

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.

PyTorch - Python + Nim

Python Fanduel API (2021) - Lineup Automation

Style transfer between images was performed using the VGG19 model

Plenoxels: Radiance Fields without Neural Networks, Code release WIP

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

nnFormer: Interleaved Transformer for Volumetric Segmentation

Variants (`model_type`):