AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

Unofficial Implementation of MLP-Mixer in TensorFlow

StyleGAN2 Webtoon / Anime Style Toonify

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

Pytorch Implementation of LNSNet for Superpixel Segmentation

FairMOT for Multi-Class MOT using YOLOX as Detector

Code repository for the work "Multi-Domain Incremental Learning for Semantic Segmentation", accepted at WACV 2022

This toolkit provides codes to download and pre-process the SLUE datasets, train the baseline models, and evaluate SLUE tasks.

PyTorch Code for NeurIPS 2021 paper Anti-Backdoor Learning: Training Clean Models on Poisoned Data.

Experiments for Fake News explainability project

Codes for the compilation and visualization examples to the HIF vegetation dataset

Fuse radar and camera for detection

Pytorch implementation of MaskFlownet

CoRe: Contrastive Recurrent State-Space Models

Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

Official Implementation of "Learning Disentangled Behavior Embeddings"

BBScan py3 - BBScan py3 With Python

CLIP+FFT text-to-image

Age and Gender prediction using Keras