Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Last update: Dec 12, 2022

Related tags

Deep Learning aft-pytorch

Overview

aft-pytorch

Unofficial PyTorch implementation of Attention Free Transformer's layers by Zhai, et al. [abs, pdf] from Apple Inc.

Installation

You can install aft-pytorch via pip:

pip install aft-pytorch

Usage

You can import the AFT-Full or AFT-Simple layer (as described in the paper) from the package like so:

`AFTFull`

from aft_pytorch import AFTFull

layer = AFTFull(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

`AFTSimple`

from aft_pytorch import AFTSimple

layer = AFTSimple(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

This layer wrapper is a 'plug-and-play' with your existing networks / Transformers. You can swap out the Self-Attention layer with the available layers in this package with minimal changes.

TODO

Add full AFT architecture
Add variants like, AFTConv, AFTLocal

Contributing

If you like this repo, please leave a star! If there are any amends or suggestions, feel free to raise a PR/issue.

Credits

@misc{attention-free-transformer,
title = {An Attention Free Transformer},
author = {Shuangfei Zhai and Walter Talbott and Nitish Srivastava and Chen Huang and Hanlin Goh and Ruixiang Zhang and Josh Susskind},
year = {2021},
URL = {https://arxiv.org/pdf/2105.14103.pdf}
}

License

MIT

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Related tags

Overview

aft-pytorch

Installation

Usage

`AFTFull`

`AFTSimple`

TODO

Contributing

Credits

License

Owner

Rishabh Anand

LyaNet: A Lyapunov Framework for Training Neural ODEs

(JMLR'19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

PyTorch implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose

Hydra: an Extensible Fuzzing Framework for Finding Semantic Bugs in File Systems

An intuitive library to extract features from time series

Differential rendering based motion capture blender project.

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Project to create an open-source 6 DoF input device

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

An MQA (Studio, originalSampleRate) identifier for lossless flac files written in Python.

Semiconductor Machine learning project

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Differentiable scientific computing library

ROS Basics and TurtleSim

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

hipCaffe: the HIP port of Caffe

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis

OOD Generalization and Detection (ACL 2020)

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training