Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

Code for SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics (ACL'2020).

MvtecAD unsupervised Anomaly Detection

HyperaPy: An automatic hyperparameter optimization framework ⚡🚀

Simple transformer model for CIFAR10

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Interacting Two-Hand 3D Pose and Shape Reconstruction from Single Color Image (ICCV 2021)

This repository for project that can Automate Number Plate Recognition (ANPR) in Morocco Licensed Vehicles. 💻 + 🚙 + 🇲🇦 = 🤖 🕵🏻‍♂️

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhin et al., 2020).

TuckER: Tensor Factorization for Knowledge Graph Completion

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Codes for “A Deeply Supervised Attention Metric-Based Network and an Open Aerial Image Dataset for Remote Sensing Change Detection”

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

A simple Python configuration file operator.

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

It is modified Tensorflow 2.x version of Mask R-CNN

Koopman operator identification library in Python

A DCGAN to generate anime faces using custom mined dataset