Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

This repository contains PyTorch implementation of the Adaptive Fourier Neural Operator token mixer. Classification code is also provided in the classification folder.

The Adaptive Fourier Neural Operator is a token mixer that learns to mix in the Fourier domain. AFNO is based on a principled foundation of operator learning which allows us to frame token mixing as a continuous global convolution without any dependence on the input resolution. This principle was previously used to design FNO, which solves global convolution efficiently in the Fourier domain and has shown promise in learning challenging PDEs. To handle challenges in visual representation learning such as discontinuities in images and high resolution inputs, we propose principled architectural modifications to FNO which results in memory and computational efficiency. This includes imposing a block-diagonal structure on the channel mixing weights, adaptively sharing weights across tokens, and sparsifying the frequency modes via soft-thresholding and shrinkage. The resulting model is highly parallel with a quasi-linear complexity and has linear memory in the sequence size.

[arXiv]

Usage

Requirements

torch>=1.8.0
torchvision
timm

Note: To use the rfft2 and irfft2 functions in PyTorch, you need to install PyTorch>=1.8.0. Complex numbers are supported after PyTorch 1.6.0, but the fft API is slightly different from the current version.

Installation

pip install -e .

Example

from afno import AFNO1D, AFNO2D

mixer = AFNO1D()
mixer = AFNO2D()

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{guibas2021efficient,
  title={Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators},
  author={Guibas, John and Mardani, Morteza and Li, Zongyi and Tao, Andrew and Anandkumar, Anima and Catanzaro, Bryan},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Adaptive FNO transformer - official Pytorch implementation

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Usage

Requirements

Installation

Example

Citation

Owner

NVIDIA Research Projects

Learning to Prompt for Vision-Language Models.

DSL for matching Python ASTs

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Unrolled Generative Adversarial Networks

PyTorch implementation HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

An open source app to help calm you down when needed.

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

MPViT:Multi-Path Vision Transformer for Dense Prediction

Type4Py: Deep Similarity Learning-Based Type Inference for Python

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

An open source implementation of CLIP.

NeurIPS'21 Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki

MISSFormer: An Effective Medical Image Segmentation Transformer

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments