An implementation of the efficient attention module.

Last update: Dec 15, 2022

Overview

Efficient Attention

An implementation of the efficient attention module.

Description

Efficient attention is an attention mechanism that substantially optimizes the memory and computational efficiency while retaining exactly the same expressive power as the conventional dot-product attention. The illustration above compares the two types of attention. The efficient attention module is a drop-in replacement for the non-local module (Wang et al., 2018), while it:

uses less resources to achieve the same accuracy;
achieves higher accuracy with the same resource constraints (by allowing more insertions); and
is applicable in domains and models where the non-local module is not (due to resource constraints).

Resources

YouTube:

Presentation: https://youtu.be/_wnjhTM04NM

bilibili (for users in Mainland China):

Presentation: https://www.bilibili.com/video/BV1tK4y1f7Rm
Presentation in Chinese: https://www.bilibili.com/video/bv1Gt4y1Y7E3

Implementation details

This repository implements the efficient attention module with softmax normalization, output reprojection, and residual connection.

Features not in the paper

This repository implements additionally implements the multi-head mechanism which was not in the paper. To learn more about the mechanism, refer to Vaswani et al.

Citation

The paper will appear at WACV 2021. If you use, compare with, or refer to this work, please cite

@inproceedings{shen2021efficient,
    author = {Zhuoran Shen and Mingyuan Zhang and Haiyu Zhao and Shuai Yi and Hongsheng Li},
    title = {Efficient Attention: Attention with Linear Complexities},
    booktitle = {WACV},
    year = {2021},
}

An implementation of the efficient attention module.

Related tags

Overview

Efficient Attention

Description

Resources

Implementation details

Features not in the paper

Citation

Owner

Shen Zhuoran

This project aims to segment 4 common retinal lesions from Fundus Images.

Official project website for the CVPR 2021 paper "Exploring intermediate representation for monocular vehicle pose estimation"

Implementation of ResMLP, an all MLP solution to image classification, in Pytorch

Randomizes the warps in a stock pokeemerald repo.

FinEAS: Financial Embedding Analysis of Sentiment 📈

Official Implementation for Fast Training of Neural Lumigraph Representations using Meta Learning.

Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"

Deep ViT Features as Dense Visual Descriptors

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Birthday-problem - The birthday problem asks for the probability that, in a set of n randomly chosen people, at least two will share a birthday

A curated list of awesome resources combining Transformers with Neural Architecture Search

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

Efficient neural networks for analog audio effect modeling

CasualHealthcare's Pneumonia detection with Artificial Intelligence (Convolutional Neural Network)

It's like Shape Editor in Maya but works with skeletons (transforms).

Code for the Paper: Alexandra Lindt and Emiel Hoogeboom.

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

FaceQgen: Semi-Supervised Deep Learning for Face Image Quality Assessment

Collection of generative models in Tensorflow