A clean and robust Pytorch implementation of PPO on continuous action space.

Last update: Dec 16, 2022

Related tags

Overview

PPO-Continuous-Pytorch

I found the current implementation of PPO on continuous action space is whether somewhat complicated or not stable.
And this is a clean and robust Pytorch implementation of PPO on continuous action space. Here is the result:

All the experiments are trained with same hyperparameters.

Dependencies

gym==0.18.3
box2d==2.3.10
numpy==1.21.2
pytorch==1.8.1

How to use my code

Play with trained model

run 'python main.py --write False --render True --Loadmodel True --ModelIdex 400'

Train from scratch

run 'python main.py', where the default enviroment is Pendulum-v0.

Change Enviroment

If you want to train on different enviroments, just run 'python main.py --EnvIdex 0'.
The --EnvIdex can be set to be 0~5, where
'--EnvIdex 0' for 'BipedalWalker-v3'
'--EnvIdex 1' for 'BipedalWalkerHardcore-v3'
'--EnvIdex 2' for 'LunarLanderContinuous-v2'
'--EnvIdex 3' for 'Pendulum-v0'
'--EnvIdex 4' for 'Humanoid-v2'
'--EnvIdex 5' for 'HalfCheetah-v2'

Visualize the training curve

You can use the tensorboard to visualize the training curve. History training curve is saved at '\runs'

Hyperparameter Setting

For more details of Hyperparameter Setting, please check 'main.py'

A clean and robust Pytorch implementation of PPO on continuous action space.

Related tags

Overview

PPO-Continuous-Pytorch

Dependencies

How to use my code

Play with trained model

Train from scratch

Change Enviroment

Visualize the training curve

Hyperparameter Setting

Owner

XinJingHao

DFFNet: An IoT-perceptive Dual Feature Fusion Network for General Real-time Semantic Segmentation

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

Christmas face app for Decathlon xmas coding party!

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

A PyTorch Toolbox for Face Recognition

Code for approximate graph reduction techniques for cardinality-based DSFM, from paper

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

Multi-Stage Episodic Control for Strategic Exploration in Text Games

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

Official Repository for the paper "Improving Baselines in the Wild".

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations

AI Toolkit for Healthcare Imaging

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

[Machine Learning Engineer Basic Guide] 부스트캠프 AI Tech - Product Serving 자료

Syntax-Aware Action Targeting for Video Captioning

Springer Link Download Module for Python

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

DeepMind Alchemy task environment: a meta-reinforcement learning benchmark

This repo is customed for VisDrone.

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening