Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

Last update: Dec 06, 2021

Related tags

Overview

Continuous Control With Ensemble Deep Deterministic Policy Gradients

This repository is the official implementation of Continuous Control With Ensemble Deep Deterministic Policy Gradients.

Requirements

Before installation, please make sure you have MuJoCo engine set up on your machine. We use mujoco150 in order to be comparable with previous benchmarks on v2 environments. See this issue

To install requirements:

pip install -r requirements.txt

Training

To train the model(s) in the paper, run this command:

python run.py <experiment_specification path>

Logger automatically stops training and evaluates current policy every log_every environment interactions. The data is printed to standard output and stored on drive.

We include specifications for our most important experiments.

Path	Description
specs/ed2_on_mujoco.py	Benchmark of our method
specs/sac_on_mujoco.py	Benchmark of our implementation of SAC
specs/sunrise_on_mujoco.py	Benchmark of our implementation of SUNRISE
specc/sop_on_mujoco.py	Benchmark of our implementation of SOP

Results

Our model achieves the following performance on the MuJoCo suite:

Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

Related tags

Overview

Continuous Control With Ensemble Deep Deterministic Policy Gradients

Requirements

Training

Results

Owner

A Python library that provides a simplified alternative to DBAPI 2

A curated list of the top 10 computer vision papers in 2021 with video demos, articles, code and paper reference.

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

Multi-Objective Loss Balancing for Physics-Informed Deep Learning

dualPC.R contains the R code for the main functions.

This is the repository for the paper "Have I done enough planning or should I plan more?"

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

UPSNet: A Unified Panoptic Segmentation Network

A simple rest api that classifies pneumonia infection weather it is Normal, Pneumonia Virus or Pneumonia Bacteria from a chest-x-ray image.

Pytorch cuda extension of grid_sample1d

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Its a Plant Leaf Disease Detection System based on Machine Learning.

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Physics-informed convolutional-recurrent neural networks for solving spatiotemporal PDEs

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes