Long-Short Transformer (Transformer-LS)

This repository hosts the code and models for the paper:

Long-Short Transformer: Efficient Transformers for Language and Vision

Updates

July 23, 2021: Release the code and models for ImageNet classification and Long-Range Arena

Architecture

Long-short Transformer substitutes the full self attention of the original Transformer models with an efficient attention that considers both long-range and short-term correlations. Each query attends to tokens from the segment-wise sliding window to capture short-term correlations, and the dynamically projected features to capture long-range correlations. To align the norms of the original and projected feature vectors and improve the efficacy of the aggregation, we normalize the original and project feature vectors with two sets of Layer Normalizations.

Tasks

>>> Transformer-LS for ImageNet classification
>>> Transformer-LS for Long Range Areana
>>> Transformer-LS for autoregressive language modeling

Official implementation of Long-Short Transformer in PyTorch.

Related tags

Overview

Long-Short Transformer (Transformer-LS)

Updates

Architecture

Tasks

Owner

NVIDIA Corporation

JORLDY an open-source Reinforcement Learning (RL) framework provided by KakaoEnterprise

Astrostatistics class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

Code for the paper "JANUS: Parallel Tempered Genetic Algorithm Guided by Deep Neural Networks for Inverse Molecular Design"

Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"

Official code implementation for "Personalized Federated Learning using Hypernetworks"

Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.

Predicts an answer in yes or no.

P-Tuning v2: Prompt Tuning Can Be Comparable to Finetuning Universally Across Scales and Tasks

FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

OBBDetection is a oriented object detection library, which is based on MMdetection.

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Code for one-stage adaptive set-based HOI detector AS-Net.

Code for our paper "Interactive Analysis of CNN Robustness"

Repository for MeshTalk supplemental material and code once the (already approved) 16 GHS captures our lab will make publicly available are released.

Meshed-Memory Transformer for Image Captioning. CVPR 2020

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

A Structured Self-attentive Sentence Embedding

Noether Networks: meta-learning useful conserved quantities