Attention for PyTorch with Linear Memory Footprint

Unofficially implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention (+ some sidekick speedup on the GPU when compared to reference implementation in JAX)

Usage:

git clone https://github.com/CHARM-Tx/linear_mem_attention_pytorch
cd linear_mem_attention_pytorch
python setup.py install

Usage:

High Level

from linear_mem_attention_torch.fast_attn import Attention

batch, length, features = 2, 2**8, 64
x, ctx = torch.randn(2, batch, length, features)
mask = torch.randn(batch, length) < 1.

attn = Attention(dim=features, heads = 8, dim_head = 64, bias=False)

# self-attn
v_self = attn(x, x, mask, query_chunk_size=1024, key_chunk_size=4096)

# cross-attn
v_cross = attn(x, ctx, mask, query_chunk_size=1024, key_chunk_size=4096)

Low level

from linear_mem_attention_torch import attention

batch, length, heads, features = 2, 2**8, 8, 64
mask = torch.randn(batch, length) < 1.
q, k, v = torch.randn(3, batch, length, heads, features)

v_ = attention(q, k, v, mask, query_chunk_size=1024, key_chunk_size=4096)

Benchmarks

See examples/example_benchamrk.ipynb for more information.

Citations:

@misc{rabe2021selfattention,
      title={Self-attention Does Not Need $O(n^2)$ Memory}, 
      author={Markus N. Rabe and Charles Staats},
      year={2021},
      eprint={2112.05682},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Attention for PyTorch with Linear Memory Footprint

Related tags

Overview

Attention for PyTorch with Linear Memory Footprint

Usage:

Usage:

High Level

Low level

Benchmarks

Citations:

Owner

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Video Frame Interpolation with Transformer (CVPR2022)

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Industrial knn-based anomaly detection for images. Visit streamlit link to check out the demo.

This project is the PyTorch implementation of our CVPR 2022 paper:

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

[NAACL & ACL 2021] SapBERT: Self-alignment pretraining for BERT.

Anomaly detection related books, papers, videos, and toolboxes

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

Ontologysim: a Owlready2 library for applied production simulation

AI4Good project for detecting waste in the environment

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).

Official PyTorch implementation of Learning Intra-Batch Connections for Deep Metric Learning (ICML 2021) published at International Conference on Machine Learning

YOLOX Win10 Project

This repository contains demos I made with the Transformers library by HuggingFace.

Baseline inference Algorithm for the STOIC2021 challenge.

CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields