evolvingrl

Supplementary Data for Evolving Reinforcement Learning Algorithms

This dataset contains 1000 loss graphs from two experiments: 500 unique graphs learned from scratch, and 500 unique graphs seeded by the DQN loss.

There are two csv files: from_scratch.csv and dqn_seeded.csv. They have two columns: id and reward. Each file is sorted by reward from highest to lowest. Graph with is visualized in a png file named .png. These graphs are under folders from_scratch_graphs/ and dqn_seeded_graphs/.

Notes on reading the graph:

Input nodes are in green, the output node is in blue.
The directed edges represent the data flow. A red edge represents the 2nd input for a binary operator, and all other edges are in black. Such coloring scheme is necesssary for encoding inputs for non-commutative operators like -, /, etc.
It’s common to have isolated input nodes and intermediate nodes that do not contribute to the final output. We can ignore these nodes.
As an example, Q(s_{t-1}, a_{t-1}) is represented by 5 nodes:
- Q_param → QValueListOp ← s_tm1. This gives Q(s_{t-1}, -).
- QValueListOp → SelectList ← a_{t-1}. This uses a_{t-1} to index into Q(s_{t-1}, -).

Supplementary Data for Evolving Reinforcement Learning Algorithms

Related tags

Overview

evolvingrl

Owner

John Co-Reyes

This is a demo for AAD algorithm.

Dynamic Programming-Join Optimization Algorithm

Implementation of Apriori algorithms via Python

A simple python application to visualize sorting algorithms.

sudoku solver using CSP forward-tracking algorithms.

8-puzzle-solver with UCS, ILS, IDA* algorithm

A raw implementation of the nearest insertion algorithm to resolve TSP problems in a TXT format.

marching rectangles algorithm in python with clean code.

Sign data using symmetric-key algorithm encryption.

Minimal pure Python library for working with little-endian list representation of bit strings.

FPE - Format Preserving Encryption with FF3 in Python

Python implementation of Aho-Corasick algorithm for string searching

marching Squares algorithm in python with clean code.

FLIght SCheduling OPTimization - a simple optimization library for flight scheduling and related problems in the discrete domain

The test data, code and detailed description of the AW t-SNE algorithm

Exam Schedule Generator using Genetic Algorithm

Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set

🧬 Performant Evolutionary Algorithms For Python with Ray support

This repository explores an implementation of Grover's Algorithm for knights on a chessboard.

This is an implementation of the QuickHull algorithm in Python. I