Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

MMFlow is an open source optical flow toolbox based on PyTorch

Interactive Image Segmentation via Backpropagating Refinement Scheme

DANA paper supplementary materials

The Official PyTorch Implementation of "VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models" (ICLR 2021 spotlight paper)

Official repository for the paper "Self-Supervised Models are Continual Learners" (CVPR 2022)

Reverse engineer your pytorch vision models, in style

An index of algorithms for learning causality with data

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Extreme Dynamic Classifier Chains - XGBoost for Multi-label Classification

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Image Classification - A research on image classification and auto insurance claim prediction, a systematic experiments on modeling techniques and approaches

HybVIO visual-inertial odometry and SLAM system

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

DUE: End-to-End Document Understanding Benchmark

This is implementation of AlexNet(2012) with 3D Convolution on TensorFlow (AlexNet 3D).

A python-image-classification web application project, written in Python and served through the Flask Microframework. This Project implements the VGG16 covolutional neural network, through Keras and Tensorflow wrappers, to make predictions on uploaded images.

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation

Powerful and efficient Computer Vision Annotation Tool (CVAT)