Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Last update: Dec 14, 2021

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

In recent years, Evolutionary Strategies were actively explored in robotic tasks for policy search as they provide a simpler alternative to reinforcement learning algorithms. However, this class of algorithms is often claimed to be extremely sample-inefficient. On the other hand, there is a growing interest in Differentiable Robot Simulators (DRS) as they potentially can find successful policies with only a handful of trajectories. But the resulting gradient is not always useful for the first-order optimization. In this work, we demonstrate how DRS gradient can be used in conjunction with Evolutionary Strategies. Preliminary results suggest that this combination can reduce sample complexity of Evolutionary Strategies by 3x-5x times in both simulation and the real world.

To appear in 4th Robot Learning Workshop: Self-Supervised and Lifelong Learning

Paper -- Video -- Poster

Citation

Please use the following Bibtex entry:

@misc{kurenkov2021guiding,
      title={Guiding Evolutionary Strategies by Differentiable Robot Simulators}, 
      author={Vladislav Kurenkov and Bulat Maksudov},
      year={2021},
      eprint={2110.00438},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Related tags

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

Citation

Owner

Vladislav Kurenkov

PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.

Multi-Objective Reinforced Active Learning

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Concept drift monitoring for HA model servers.

A map update dataset and benchmark

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"

Keras-tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation（Unfinished）

DeepLab resnet v2 model in pytorch

Code Impementation for "Mold into a Graph: Efficient Bayesian Optimization over Mixed Spaces"

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

Emotion classification of online comments based on RNN

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations)

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

Viewmaker Networks: Learning Views for Unsupervised Representation Learning