DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Last update: Nov 14, 2022

Related tags

Overview

DeeBERT

This is the code base for the paper DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference.

Code in this repository is also available in the Huggingface Transformer repo (with minor modification for version compatibility). Check this page for models that we have trained in advance (the latest version of Huggingface Transformers Library is needed).

Installation

This repo is tested on Python 3.7.5, PyTorch 1.3.1, and Cuda 10.1. Using a virtulaenv or conda environemnt is recommended, for example:

conda install pytorch==1.3.1 torchvision cudatoolkit=10.1 -c pytorch

After installing the required environment, clone this repo, and install the following requirements:

git clone https://github.com/castorini/deebert
cd deebert
pip install -r ./requirements.txt
pip install -r ./examples/requirements.txt

Usage

There are four scripts in the scripts folder, which can be run from the repo root, e.g., scripts/train.sh.

In each script, there are several things to modify before running:

path to the GLUE dataset. Check this for more details.
path for saving fine-tuned models. Default: ./saved_models.
path for saving evaluation results. Default: ./plotting. Results are printed to stdout and also saved to npy files in this directory to facilitate plotting figures and further analyses.
model_type (bert or roberta)
model_size (base or large)
dataset (SST-2, MRPC, RTE, QNLI, QQP, or MNLI)

train.sh

This is for fine-tuning and evaluating models as in the original BERT paper.

train_highway.sh

This is for fine-tuning DeeBERT models.

eval_highway.sh

This is for evaluating each exit layer for fine-tuned DeeBERT models.

eval_entropy.sh

This is for evaluating fine-tuned DeeBERT models, given a number of different early exit entropy thresholds.

Citation

Please cite our paper if you find the repository useful:

@inproceedings{xin-etal-2020-deebert,
    title = "{D}ee{BERT}: Dynamic Early Exiting for Accelerating {BERT} Inference",
    author = "Xin, Ji  and
      Tang, Raphael  and
      Lee, Jaejun  and
      Yu, Yaoliang  and
      Lin, Jimmy",
    booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.acl-main.204",
    pages = "2246--2251",
}

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Related tags

Overview

DeeBERT

Installation

Usage

train.sh

train_highway.sh

eval_highway.sh

eval_entropy.sh

Citation

Owner

Castorini

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

Explaining Hyperparameter Optimization via PDPs

Classification models 1D Zoo - Keras and TF.Keras

Tutorial on active learning with the Nvidia Transfer Learning Toolkit (TLT).

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

This is the workbook I created while I was studying for the Qiskit Associate Developer exam. I hope this becomes useful to others as it was for me :)

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

Clean Machine Learning, a Coding Kata

The MATH Dataset

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Low Complexity Channel estimation with Neural Network Solutions

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

Only works with the dashboard version / branch of jesse

"Learning and Analyzing Generation Order for Undirected Sequence Models" in Findings of EMNLP, 2021

A novel benchmark dataset for Monocular Layout prediction

View model summaries in PyTorch!

(IEEE TIP 2021) Regularized Densely-connected Pyramid Network for Salient Instance Segmentation

Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers on hybryd datasets samples for facial emotion recognition.

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code