Anonymous implementation of KSL

Last update: Nov 10, 2021

Related tags

Deep Learning ksl

Overview

k-Step Latent (KSL)

Implementation of k-Step Latent (KSL) in PyTorch.

Representation Learning for Data-Efficient Reinforcement Learning

[Paper]

Code is built on top of the DrQ repo from Denis Yarats.

Getting Started

First, create and activate conda env:

conda env create -f conda_env.yml

conda activate ksl

This repo relies on environments from DMControl, and therefore assumes that you can run MuJoCo.

From within ./ksl, simply run:

python train.py

Altering training schemes can be done by feeding additional args, such as:

python train.py env=cheetah_run lr=2e-4

For a full list of customizable args, see ./ksl/configs.yaml.

Observing Runs

Just as in the DrQ repo, train.py will produce the runs folder, where all the outputs are going to be stored including train/eval logs, tensorboard blobs, and evaluation episode videos. To launch tensorboard run

tensorboard --logdir runs

The console output is also available in a form:

| train | E: 5 | S: 5000 | R: 11.4359 | D: 66.8 s | BR: 0.0581 | ALOSS: -1.0640 | CLOSS: 0.0996 | TLOSS: -23.1683 | TVAL: 0.0945 | AENT: 3.8132

a training entry decodes as

train - training episode
E - total number of episodes
S - total number of environment steps
R - episode return
D - duration in seconds
BR - average reward of a sampled batch
ALOSS - average loss of the actor
CLOSS - average loss of the critic
TLOSS - average loss of the temperature parameter
TVAL - the value of temperature
AENT - the actor's entropy

while an evaluation entry

| eval  | E: 20 | S: 20000 | R: 10.9356

contains

E - evaluation was performed after E episodes
S - evaluation was performed after S environment steps
R - average episode return computed over `num_eval_episodes` (usually 10)

Anonymous implementation of KSL

Related tags

Overview

k-Step Latent (KSL)

Getting Started

Observing Runs

Owner

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.

A Domain-Agnostic Benchmark for Self-Supervised Learning

KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

Replication attempt for the Protein Folding Model

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

The 2nd place solution of 2021 google landmark retrieval on kaggle.

PyTorch code for ICLR 2021 paper Unbiased Teacher for Semi-Supervised Object Detection

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

PyTorch Implementation of the paper Learning to Reweight Examples for Robust Deep Learning

Code for "R-GCN: The R Could Stand for Random"

[AI6122] Text Data Management & Processing

Object Database for Super Mario Galaxy 1/2.

Code to train models from "Paraphrastic Representations at Scale".

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

Code reproduce for paper "Vehicle Re-identification with Viewpoint-aware Metric Learning"

Virtual hand gesture mouse using a webcam

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting