PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Last update: Mar 10, 2022

Overview

Exploring Munchausen Reinforcement Learning

This is the project repository of my team in the "Advanced Deep Learning for Robotics" course at TUM. Our project's topic is "Exploring Munchausen Reinforcement Learning" based on this paper.

For a detailed discussion, see the report and the final presentation.

Setup

Create a virtual environment.
Run pip3 install -r requirements.txt

Code Structure

This repository is structured as follows:

The directories M-DQN and M-SAC contain the implementations of the RL agents DQN and SAC extended with the Munchausen term, respectively.
The directories rl-baselines3-zoo contains a copy of this repository, where we included the implementations of M-DQN so that we can easily train and test the M-DQN agent on benchmark environments and also compare it to other classical agents. To do so, just follow the steps described in the original repository and insert M-DQN as the agent argument.
The directory particles-envcontains a modified version of this repository. The modified version contains code for a particles environment, where an agent wants to reach a goal, while avoiding obstacles. Besides, M-SAC agent is implemented and included in the code, so that it can be trained and compared to the classical SAC agent.
The directory action-gap contains implementation of callbacks for experiment manager of rl-baselines3-zoo which logs action-gap for tensorboard.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Related tags

Overview

Exploring Munchausen Reinforcement Learning

Setup

Code Structure

Owner

Mohamed Amine Ketata

A framework for analyzing computer vision models with simulated data

Wide Residual Networks (WideResNets) in PyTorch

Continuous Security Group Rule Change Detection & Response at scale

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Normal Face Photos

Phylogeny Partners

official code for dynamic convolution decomposition

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

Multi-View Radar Semantic Segmentation

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal

Deep Markov Factor Analysis (NeurIPS2021)

An e-commerce company wants to segment its customers and determine marketing strategies according to these segments.

Vision-Language Pre-training for Image Captioning and Question Answering

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

Contrastive Learning of Structured World Models

:boar: :bear: Deep Learning based Python Library for Stock Market Prediction and Modelling

Clockwork Variational Autoencoder

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

Keras Image Embeddings using Contrastive Loss

[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"