Evolving Neural Networks in JAX

This repository holds code displaying techniques for applying evolutionary network training strategies in JAX. Each script trains a network to solve the same problem: given a sequence of regularly-spaced values on a sine wave, predict the next value. The problem is trivial - the interesting part is intended to be the way in which this is accomplished, by updating network parameters directly and without gradient calculations, in parallel across devices. A lengthy tutorial is included, explaining the ideas and rationale. Much of the code is duplicated between scripts so that readers can run them individually and, if they like, view the differences between files to see what changes in each section.

The evolutionary ideas present here are mainly taken from OpenAI's blog post describing their efforts at scaling evolution strategies (and the associated code.)

tutorial.md

A longform tutorial that explains why I think evolutionary optimization strategies are interesting and some of the JAX techniques that I use to implement them. Individual bits of the code in each of the script files are discussed here.

simple.py

In this file, a very basic evolutionary strategy is implemented, without many optimizations. You can get a grasp here on how some fundamental JAX methods like scan and vmap are used to execute our training routine.

advanced.py

Here, some optimizations that OpenAI made in their code are added to our training routine. The various optimizations are discussed in depth in the article.

parallel.py

In this file, we prepare to scale the network to more than one device and to greater sizes. Vectorization becomes parallelization, and the code is sliced up so that we can calculate our network updates on a single device.

Evolving neural network parameters in JAX.

Related tags

Overview

Evolving Neural Networks in JAX

tutorial.md

simple.py

advanced.py

parallel.py

Owner

Trevor Thackston

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

TorchMD-Net provides state-of-the-art graph neural networks and equivariant transformer neural networks potentials for learning molecular potentials

TF Image Segmentation: Image Segmentation framework

A Comparative Framework for Multimodal Recommender Systems

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

PPO is a very popular Reinforcement Learning algorithm at present.

3D Generative Adversarial Network

GestureSSD CBAM - A gesture recognition web system based on SSD and CBAM, using pytorch, flask and node.js

A SAT-based sudoku solver

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Train an imgs.ai model on your own dataset

Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation

The official github repository for Towards Continual Knowledge Learning of Language Models

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

On Effective Scheduling of Model-based Reinforcement Learning

Using image super resolution models with vapoursynth and speeding them up with TensorRT

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets