Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Last update: Dec 15, 2022

Overview

This is a fork of Fairseq(-py) with implementations of the following models:

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

An NMT models with two-dimensional convolutions to jointly encode the source and the target sequences.

Pervasive Attention also provides an extensive decoding grid that we leverage to efficiently train wait-k models.

See README.

Efficient Wait-k Models for Simultaneous Machine Translation

Transformer Wait-k models (Ma et al., 2019) with unidirectional encoders and with joint training of multiple wait-k paths.

See README.

Fairseq Requirements and Installation

PyTorch version >= 1.4.0
Python version >= 3.6
For training new models, you'll also need an NVIDIA GPU and NCCL

Installing Fairseq

git clone https://github.com/elbayadm/attn2d
cd attn2d
pip install --editable .

License

fairseq(-py) is MIT-licensed. The license applies to the pre-trained models as well.

Citation

For Pervasive Attention, please cite:

@InProceedings{elbayad18conll,
    author ="Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob",
    title = "Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction",
    booktitle = "Proceedings of the 22nd Conference on Computational Natural Language Learning",
    year = "2018",
 }

For our wait-k models, please cite:

@article{elbayad20waitk,
    title={Efficient Wait-k Models for Simultaneous Machine Translation},
    author={Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob},
    journal={arXiv preprint arXiv:2005.08595},
    year={2020}
}

For Fairseq, please cite:

@inproceedings{ott2019fairseq,
  title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
  author = {Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},
  booktitle = {Proceedings of NAACL-HLT 2019: Demonstrations},
  year = {2019},
}

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Related tags

Overview

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

Owner

Maha

Python scripts using the Mediapipe models for Halloween.

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

An adaptive hierarchical energy management strategy for hybrid electric vehicles

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

A PyTorch re-implementation of Neural Radiance Fields

An Approach to Explore Logistic Regression Models

NeurIPS 2021, self-supervised 6D pose on category level

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

SLAMP: Stochastic Latent Appearance and Motion Prediction

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Self-supervised Label Augmentation via Input Transformations (ICML 2020)

performing moving objects segmentation using image processing techniques with opencv and numpy

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

High accurate tool for automatic faces detection with landmarks

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Related tags

Overview

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

Owner

Maha

Python scripts using the Mediapipe models for Halloween.

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Diabet Feature Engineering - Predict whether people have diabetes when their characteristics are specified

An adaptive hierarchical energy management strategy for hybrid electric vehicles

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

A PyTorch re-implementation of Neural Radiance Fields

An Approach to Explore Logistic Regression Models

NeurIPS 2021, self-supervised 6D pose on category level

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

SLAMP: Stochastic Latent Appearance and Motion Prediction

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Self-supervised Label Augmentation via Input Transformations (ICML 2020)

performing moving objects segmentation using image processing techniques with opencv and numpy

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

High accurate tool for automatic faces detection with landmarks

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .