Stochastic Positional Encoding (SPE)

This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Linear Complexity by Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang and Gaël Richard.

In this paper, we propose Stochastic Positional Encoding (SPE), which provably behaves like relative PE while being compatible with linear-complexity Transformers. We do this by drawing a connection between positional encoding and cross-covariance structures of correlated Gaussian processes.

Check out also the companion website with music examples.

Citation:

@inproceedings{pmlr-v139-liutkus21a,
  title = 	 {Relative Positional Encoding for {Transformers} with Linear Complexity},
  author =       {Liutkus, Antoine and C{\'i}fka, Ond{\v r}ej and Wu, Shih-Lun and {\c S}im{\c s}ekli, Umut and Yang, Yi-Hsuan and Richard, Ga{\"e}l},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7067--7079},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/liutkus21a/liutkus21a.pdf},
  url = 	 {http://proceedings.mlr.press/v139/liutkus21a.html}
}

SPE implementation

We have implemented SPE in PyTorch and JAX/Flax. Each implementation is available as a separate Python package under src.

Experiments

Each of the 3 experiments (LRA, pop piano generation, groove continuation) has a dedicated directory under experiments. See the README files there for how to set up the environment and prepare the datasets. To make sure you have the custom dependencies for each experiment, clone this repository with --recurse-submodules or run git submodule init && git submodule update after cloning.

Relative Positional Encoding for Transformers with Linear Complexity

Related tags

Overview

Stochastic Positional Encoding (SPE)

SPE implementation

Experiments

Owner

Antoine Liutkus

Iterative Normalization: Beyond Standardization towards Efficient Whitening

PClean: A Domain-Specific Probabilistic Programming Language for Bayesian Data Cleaning

A containerized REST API around OpenAI's CLIP model.

FastFace: Lightweight Face Detection Framework

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

An open software package to develop BCI based brain and cognitive computing technology for recognizing user's intention using deep learning

[CVPR 2022] Deep Equilibrium Optical Flow Estimation

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Codes for the ICCV'21 paper "FREE: Feature Refinement for Generalized Zero-Shot Learning"

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

Autonomous Perception: 3D Object Detection with Complex-YOLO

Unofficial PyTorch implementation of SimCLR by Google Brain

Semi-Supervised Learning for Fine-Grained Classification

ROS support for Velodyne 3D LIDARs

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Subdivision-based Mesh Convolutional Networks

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Aligning Latent and Image Spaces to Connect the Unconnectable