Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

Plug-n-Play Reinforcement Learning in Python with OpenAI Gym and JAX

Python project to take sound as input and output as RGB + Brightness values suitable for DMX

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

Python code for loading the Aschaffenburg Pose Dataset.

DeepDiffusion: Unsupervised Learning of Retrieval-adapted Representations via Diffusion-based Ranking on Latent Feature Manifold

Adversarial Autoencoders

The pytorch implementation of the paper "text-guided neural image inpainting" at MM'2020

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.

Node for thenewboston digital currency network.

Implementing DropPath/StochasticDepth in PyTorch

prior-based-losses-for-medical-image-segmentation

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

On-device wake word detection powered by deep learning.

Exadel CompreFace is a free and open-source face recognition GitHub project

Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Augmented Traffic Control: A tool to simulate network conditions

Official Python implementation of the 'Sparse deconvolution'-v0.3.0