Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Last update: Dec 22, 2022

Overview

Cross Transformers - Pytorch (wip)

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Install

$ pip install cross-transformers-pytorch

Usage

import torch
from torch import nn
import torch.nn.functional as F
from torchvision import models
from cross_transformers_pytorch import CrossTransformer

resnet = models.resnet34(pretrained = True)
model = nn.Sequential(*[*resnet.children()][:-2])

cross_transformer = CrossTransformer(
    dim = 512,
    dim_key = 128,
    dim_value = 128
)

# (batch, channels, height, width)
img_query = torch.randn(1, 3, 224, 224)

# (batch, classes, num supports, channels, height, width)
img_supports = torch.randn(1, 2, 4, 3, 224, 224)

labels = torch.randint(0, 2, (1,))

dists = cross_transformer(model, img_query, img_supports) # (1, 2)

loss = F.cross_entropy(dists, labels)
loss.backward()

Citations

@misc{doersch2020crosstransformers,
    title={CrossTransformers: spatially-aware few-shot transfer}, 
    author={Carl Doersch and Ankush Gupta and Andrew Zisserman},
    year={2020},
    eprint={2007.11498},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

You might also like...

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021) This repository is the official PyTorc

139 Dec 29, 2022

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Introduction Pytorch implementation of Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Expert. | paper Song Park1

97 Dec 23, 2022

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Hypercorrelation Squeeze for Few-Shot Segmentation This is the implementation of the paper "Hypercorrelation Squeeze for Few-Shot Segmentation" by Juh

165 Dec 28, 2022

Pytorch implementation of few-shot semantic image synthesis

Few-shot Semantic Image Synthesis Using StyleGAN Prior Our method can synthesize photorealistic images from dense or sparse semantic annotations using

40 Sep 26, 2022

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

LearningToCompare Pytorch Implementation for Paper: Learning to Compare: Relation Network for Few-Shot Learning Howto download mini-imagenet and make

246 Dec 19, 2022

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

Optimization as a Model for Few-Shot Learning This repo provides a Pytorch implementation for the Optimization as a Model for Few-Shot Learning paper.

238 Jan 4, 2023

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

Relational Embedding for Few-Shot Classification (ICCV 2021) Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho [paper], [project hompage] We propose t

82 Dec 24, 2022

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

D2C: Diffuison-Decoding Models for Few-shot Conditional Generation Project | Paper PyTorch implementation of D2C: Diffuison-Decoding Models for Few-sh

90 Dec 27, 2022

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

🦩 Flamingo - Pytorch Implementation of Flamingo, state-of-the-art few-shot visual question answering attention net, in Pytorch. It will include the p

630 Dec 28, 2022

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Related tags

Overview

Cross Transformers - Pytorch (wip)

Install

Usage

Citations

You might also like...

Official PyTorch code for Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution (MANet, ICCV2021)

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

Pytorch implementation of few-shot semantic image synthesis

Pytorch Implementation for CVPR2018 Paper: Learning to Compare: Relation Network for Few-Shot Learning

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification

PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Releases(0.0.2)

0.0.2(Mar 30, 2021)

0.0.1(Dec 16, 2020)

Owner

Phil Wang

Implementation of Multistream Transformers in Pytorch

Deploy optimized transformer based models on Nvidia Triton server

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

This repository collects project-relevant Isabelle/HOL formalizations.

Official Pytorch implementation of ICLR 2018 paper Deep Learning for Physical Processes: Integrating Prior Scientific Knowledge.

Repository of continual learning papers

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

LBBA-boosted WSOD

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Benchmarking Pipeline for Prediction of Protein-Protein Interactions

PyGCL: Graph Contrastive Learning Library for PyTorch

Hummingbird compiles trained ML models into tensor computation for faster inference.

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

User-friendly bulk RNAseq deconvolution using simulated annealing

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

This repo provides function call to track multi-objects in videos

REBEL: Relation Extraction By End-to-end Language generation

[ICLR 2021] Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Automatically creates genre collections for your Plex media