Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Last update: Jan 04, 2023

Related tags

Text Data & NLP PABEE

Overview

Patience-based Early Exit

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

NEWS: We now have a better and tidier implementation integrated into Hugging Face transformers!

Citation

If you use this code in your research, please cite our paper:

@inproceedings{zhou2020bert,
 author = {Zhou, Wangchunshu and Xu, Canwen and Ge, Tao and McAuley, Julian and Xu, Ke and Wei, Furu},
 booktitle = {Advances in Neural Information Processing Systems},
 pages = {18330--18341},
 publisher = {Curran Associates, Inc.},
 title = {BERT Loses Patience: Fast and Robust Inference with Early Exit},
 url = {https://proceedings.neurips.cc/paper/2020/file/d4dd111a4fd973394238aca5c05bebe3-Paper.pdf},
 volume = {33},
 year = {2020}
}

Requirement

Our code is built on huggingface/transformers. To use our code, you must clone and install huggingface/transformers.

Training

You can fine-tune a pretrained language model and train the internal classifiers by configuring and running finetune_bert.sh and finetune_albert.sh .

Inference

You can inference with different patience settings by configuring and running patience_infer_albert.sh and patience_infer_bert.sh.

Bug Report and Contribution

If you'd like to contribute and add more tasks (only GLUE is available at this moment), please submit a pull request and contact me. Also, if you find any problem or bug, please report with an issue. Thanks!

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

NLP, before and after spaCy

Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing

Signature remover is a NLP based solution which removes email signatures from the rest of the text.

Simple text to phones converter for multiple languages

Rank-One Model Editing for Locating and Editing Factual Knowledge in GPT

PortaSpeech - PyTorch Implementation

Text Classification Using LSTM

Need: Image Search With Python

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

The source code of HeCo

A versatile token stream for handwritten parsers.

Galois is an auto code completer for code editors (or any text editor) based on OpenAI GPT-2.

A Python script which randomly chooses and prints a file from a directory.

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

2021语言与智能技术竞赛：机器阅读理解任务

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Japanese synonym library