PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Last update: Jan 05, 2023

Related tags

Overview

pytorch-maml

This is a PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML): https://arxiv.org/abs/1703.03400

Important: You will need the latest version of PyTorch, v.0.2.0 to run this code (otherwise you will get errors about double backwards not being supported).

Currently, only the Omniglot experiments have been replicated here. The hyper-parameters are the same as those used in the original Tensorflow implementation, except that only 1 random seed is used here.

5-way 1-shot training, best performance 98.9%

20-way 1-shot training, best performance 92%

Note: the 20-way performance is slightly lower than that reported in the paper (they report 95.8%). If you can see why this might be, please let me know. Also in this experiment, we can see evidence of overfitting to the meta-training set.

The 5-way results are achieved by simply meta-testing the network trained on the 1-shot task on the 5-shot task (e.g. for the 5-way 5-shot result, test the 5-way 1-shot trained network with 5-shots). Again the 20-way result is lower here than reported in the paper.

This repo also contains code for running maml experiments on permuted MNIST (tasks are created by shuffling the labels). This is a nice sanity check task.

license

This software is distributed under the MIT license.

to-do

port to pytorch 0.4 from 0.2 and python 3 from 2
investigate performance difference from TF version
add first-order version

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Related tags

Overview

pytorch-maml

license

to-do

Owner

Kate Rakelly

Repo for code associated with Modeling the Mitral Valve.

Repository for publicly available deep learning models developed in Rosetta community

Element selection for functional materials discovery by integrated machine learning of atomic contributions to properties

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

Pytorch implementation of AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

Implementation of TabTransformer, attention network for tabular data, in Pytorch

PPO is a very popular Reinforcement Learning algorithm at present.

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Code for "Unsupervised State Representation Learning in Atari"

PyGRANSO: A PyTorch-enabled port of GRANSO with auto-differentiation

Running Google MoveNet Multipose Tracking models on OpenVINO.

Multiple custom object count and detection using YOLOv3-Tiny method

face2comics by Sxela (Alex Spirin) - face2comics datasets

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

Code for the SIGGRAPH 2021 paper "Consistent Depth of Moving Objects in Video".

Much faster than SORT(Simple Online and Realtime Tracking), a little worse than SORT

UnFlow: Unsupervised Learning of Optical Flow with a Bidirectional Census Loss

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

TensorFlow Tutorials with YouTube Videos