DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Last update: Nov 14, 2022

Related tags

Overview

DeeBERT

This is the code base for the paper DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference.

Code in this repository is also available in the Huggingface Transformer repo (with minor modification for version compatibility). Check this page for models that we have trained in advance (the latest version of Huggingface Transformers Library is needed).

Installation

This repo is tested on Python 3.7.5, PyTorch 1.3.1, and Cuda 10.1. Using a virtulaenv or conda environemnt is recommended, for example:

conda install pytorch==1.3.1 torchvision cudatoolkit=10.1 -c pytorch

After installing the required environment, clone this repo, and install the following requirements:

git clone https://github.com/castorini/deebert
cd deebert
pip install -r ./requirements.txt
pip install -r ./examples/requirements.txt

Usage

There are four scripts in the scripts folder, which can be run from the repo root, e.g., scripts/train.sh.

In each script, there are several things to modify before running:

path to the GLUE dataset. Check this for more details.
path for saving fine-tuned models. Default: ./saved_models.
path for saving evaluation results. Default: ./plotting. Results are printed to stdout and also saved to npy files in this directory to facilitate plotting figures and further analyses.
model_type (bert or roberta)
model_size (base or large)
dataset (SST-2, MRPC, RTE, QNLI, QQP, or MNLI)

train.sh

This is for fine-tuning and evaluating models as in the original BERT paper.

train_highway.sh

This is for fine-tuning DeeBERT models.

eval_highway.sh

This is for evaluating each exit layer for fine-tuned DeeBERT models.

eval_entropy.sh

This is for evaluating fine-tuned DeeBERT models, given a number of different early exit entropy thresholds.

Citation

Please cite our paper if you find the repository useful:

@inproceedings{xin-etal-2020-deebert,
    title = "{D}ee{BERT}: Dynamic Early Exiting for Accelerating {BERT} Inference",
    author = "Xin, Ji  and
      Tang, Raphael  and
      Lee, Jaejun  and
      Yu, Yaoliang  and
      Lin, Jimmy",
    booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.acl-main.204",
    pages = "2246--2251",
}

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Related tags

Overview

DeeBERT

Installation

Usage

train.sh

train_highway.sh

eval_highway.sh

eval_entropy.sh

Citation

Owner

Castorini

Code for using and evaluating SpanBERT.

BiQE: Code and dataset for the BiQE paper

PyTorch Implementation of "Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging" (Findings of ACL 2022)

The code for two papers: Feedback Transformer and Expire-Span.

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

This is a really simple text-to-speech app made with python and tkinter.

Translate U is capable of translating the text present in an image from one language to the other.

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

Share constant definitions between programming languages and make your constants constant again

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

Nested Named Entity Recognition

An ActivityWatch watcher to pose questions to the user and record her answers.

Download videos from YouTube/Twitch/Twitter right in the Windows Explorer, without installing any shady shareware apps

Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization

A Streamlit web app that generates Rick and Morty stories using GPT2.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

基于百度的语音识别，用python实现，pyaudio+pyqt