TRFL

TRFL (pronounced "truffle") is a library built on top of TensorFlow that exposes several useful building blocks for implementing Reinforcement Learning agents.

Installation

TRFL can be installed from pip with the following command: pip install trfl

TRFL will work with both the CPU and GPU version of tensorflow, but to allow for that it does not list Tensorflow as a requirement, so you need to install Tensorflow and Tensorflow-probability separately if you haven't already done so.

Usage Example

import tensorflow as tf
import trfl

# Q-values for the previous and next timesteps, shape [batch_size, num_actions].
q_tm1 = tf.get_variable(
    "q_tm1", initializer=[[1., 1., 0.], [1., 2., 0.]], dtype=tf.float32)
q_t = tf.get_variable(
    "q_t", initializer=[[0., 1., 0.], [1., 2., 0.]], dtype=tf.float32)

# Action indices, discounts and rewards, shape [batch_size].
a_tm1 = tf.constant([0, 1], dtype=tf.int32)
r_t = tf.constant([1, 1], dtype=tf.float32)
pcont_t = tf.constant([0, 1], dtype=tf.float32)  # the discount factor

# Q-learning loss, and auxiliary data.
loss, q_learning = trfl.qlearning(q_tm1, a_tm1, r_t, pcont_t, q_t)

loss is the tensor representing the loss. For Q-learning, it is half the squared difference between the predicted Q-values and the TD targets, shape [batch_size]. Extra information is in the q_learning namedtuple, including q_learning.td_error and q_learning.target.

The loss tensor can be differentiated to derive the corresponding RL update.

reduced_loss = tf.reduce_mean(loss)
optimizer = tf.train.AdamOptimizer(learning_rate=0.1)
train_op = optimizer.minimize(reduced_loss)

All loss functions in the package return both a loss tensor and a namedtuple with extra information, using the above convention, but different functions may have different extra fields. Check the documentation of each function below for more information.

Documentation

Check out the full documentation page here.

TensorFlow Reinforcement Learning

Related tags

Overview

TRFL

Installation

Usage Example

Documentation

Owner

DeepMind

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

A general-purpose multi-agent training framework.

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Retro Games in Gym

Open world survival environment for reinforcement learning

TensorFlow Reinforcement Learning

A customisable 3D platform for agent-based AI research

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

Monitor your el-cheapo UPS via SNMP

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

An open source robotics benchmark for meta- and multi-task reinforcement learning

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Tensorforce: a TensorFlow library for applied reinforcement learning