Implementation of H-UCRL Algorithm

This repository is an implementation of the H-UCRL algorithm introduced in Curi, S., Berkenkamp, F., & Krause, A. (2020). Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning.

To install create a conda environment:

$ conda create -n hucrl python=3.7
$ conda activate hucrl

$ pip install -e .[test,logging,experiments]

For Mujoco (license required) Run:

$ pip install -e .[mujoco]

Running an experiment.

For the inverted pendulum experiment run

$ python exps/inverted_pendulum/run.py

For the mujoco (license required) experiment run

$ python exps/mujoco/run.py --environment ENV_NAME --agent AGENT_NAME --action

We support MBHalfCheetah-v0, MBPusher-v0, MBReacher-v0, MBAnt-v0, MBCartPole-v0, MBHopper-v0, MBInvertedDoublePendulum-v0, MBInvertedPendulum-v0, MBReacher-v0, MBReacher3D-v0, MBSwimmer-v0, MBWalker2d-v0

Citing H-UCRL

If you this repo for your research please use the following BibTeX entry:

@article{curi2020efficient,
  title={Efficient model-based reinforcement learning through optimistic policy search and planning},
  author={Curi, Sebastian and Berkenkamp, Felix and Krause, Andreas},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Implementation of H-UCRL Algorithm

Related tags

Overview

Implementation of H-UCRL Algorithm

Running an experiment.

Citing H-UCRL

Owner

Sebastian Curi

FastyAPI is a Stack boilerplate optimised for heavy loads.

This code provides various models combining dilated convolutions with residual networks

ONNX-PackNet-SfM: Python scripts for performing monocular depth estimation using the PackNet-SfM model in ONNX

Fast and Simple Neural Vocoder, the Multiband RNNMS

Rocket-recycling with Reinforcement Learning

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud, CVPR 2019.

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

Implementation of the state-of-the-art vision transformers with tensorflow

Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques.

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Adds timm pretrained backbone to pytorch's FasterRcnn model

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way

From a body shape, infer the anatomic skeleton.

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

TimeSHAP explains Recurrent Neural Network predictions.

[IROS'21] SurRoL: An Open-source Reinforcement Learning Centered and dVRK Compatible Platform for Surgical Robot Learning