PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Last update: Aug 31, 2022

Overview

Interaction Grounded Learning

This repository contains a simple PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL) from Xie et al., 2021. This repository is also accompanied by a short blog post I wrote on the topic, which is available here.

In IGL, rather than being provided with a reward signal from the environment, a feedback signal is provided instead which corresponds in some way to the true latent reward. The task is to learn both a policy for optimizing against the true reward, as well as a decoder for learning a proxy reward from the feedback signal.

My implementation differs slightly from that of the original paper, but converges consistently on the MNIST digit identification task, and is robust to hyperparameters and initialization seeds. Performance of IGL method is comparable to that of contextual bandit with access to ground truth reward.

The code can be found in the Jupyter notebook here.

Requirements

Python 3
PyTorch
TorchVision
PyPlot
Jupyter-Lab

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Related tags

Overview

Interaction Grounded Learning

Requirements

Owner

Arthur Juliani

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

D2Go is a toolkit for efficient deep learning

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

[Arxiv preprint] Causality-inspired Single-source Domain Generalization for Medical Image Segmentation (code&data-processing pipeline)

Official implementation of EfficientPose

PyTorch code for training MM-DistillNet for multimodal knowledge distillation

Power Core Simulator!

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

This repository gives an example on how to preprocess the data of the HECKTOR challenge

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting

Cweqgen - The CW Equation Generator

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification

TorchGeo is a PyTorch domain library, similar to torchvision, that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data.

Code repository for the paper "Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation" with instructions to reproduce the results.

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Plug-n-Play Reinforcement Learning in Python with OpenAI Gym and JAX

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London