Unofficial PyTorch implementation of TokenLearner by Google AI

Last update: Dec 20, 2022

Related tags

Deep Learning tokenlearner-pytorch

Overview

tokenlearner-pytorch

Unofficial PyTorch implementation of TokenLearner by Ryoo et al. from Google AI (abs, pdf)

Installation

You can install TokenLearner via pip:

pip install tokenlearner-pytorch

Usage

You can access the TokenLearner class from the tokenlearner_pytorch package. You can use this layer with a Vision Transformer, MLPMixer, or Video Vision Transformer as done in the paper.

import torch
from tokenlearner_pytorch import TokenLearner

tklr = TokenLearner(S=8)
x = torch.rand(512, 32, 32, 3)
y = tklr(x) # [512, 8, 3]

You can also use TokenLearner and TokenFuser together with Multi-head Self-Attention as done in the paper:

import torch
import torch.nn as nn
from tokenlearner_pytorch import TokenLearner, TokenFuser

mhsa = nn.MultiheadAttention(3, 1)
tklr = TokenLearner(S=8)
tkfr = TokenFuser(H=32, W=32, C=3, S=8)

x = torch.rand(512, 32, 32, 3) # a batch of images

y = tklr(x)
y = y.view(8, 512, 3)
y, _ = mhsa(y, y, y) # ignore attn weights
y = y.view(512, 8, 3)

out = tkfr(y, x) # [512, 32, 23, 3]

TODO

Add support for temporal dimension T
Implement TokenFuser with ViT
Implement TokenFuser with ViViT

Contributions

If I've made any errors or you have any suggestions, feel free to raise an Issue or PR. All contributions welcome!!

License

MIT

Unofficial PyTorch implementation of TokenLearner by Google AI

Related tags

Overview

tokenlearner-pytorch

Installation

Usage

TODO

Contributions

License

Owner

Rishabh Anand

Building Ellee — A GPT-3 and Computer Vision Powered Talking Robotic Teddy Bear With Human Level Conversation Intelligence

Self-Adaptable Point Processes with Nonparametric Time Decays

Beancount-mercury - Beancount importer for Mercury Startup Checking

Codebase for Inducing Causal Structure for Interpretable Neural Networks

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Planner_backend - Academic planner application designed for students and counselors.

This is official implementaion of paper "Token Shift Transformer for Video Classification".

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Keras implementations of Generative Adversarial Networks.

Official Pytorch implementation of "Learning Debiased Representation via Disentangled Feature Augmentation (Neurips 2021, Oral)"

Data-Uncertainty Guided Multi-Phase Learning for Semi-supervised Object Detection

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

This code is an unofficial implementation of HiFiSinger.

Code for our paper "Interactive Analysis of CNN Robustness"

Recurrent Neural Network Tutorial, Part 2 - Implementing a RNN in Python and Theano

Unofficial JAX implementations of Deep Learning models

LIVECell - A large-scale dataset for label-free live cell segmentation

Machine Learning Platform for Kubernetes

PyTorch implementation of EigenGAN

Config files for my GitHub profile.