Fastseq

基于ONNXRUNTIME的文本生成加速框架

1. 环境配置

# 创建onnx conda环境
conda create -n onnx_py38 python=3.8
conda activate onnx_py38
conda install pytorch cudatoolkit=10.2 -c pytorch

# 安装onnxruntime-gpu(目前只有1.5.2版本测试成功)
pip install onnxruntime-gpu==1.5.2

# 安装transformers==3.1.0版本
pip install transformers==3.1.0

2. ONNX转换

# 将huggingface保存的 模型/checkpoint 转换为onnx格式。这里使用onnxruntime自带的转换工具。
python -m onnxruntime.transformers.convert_to_onnx \
    -m "path_to_checkpoint/model_name(gpt2)" \
    --model_class GPT2LMHeadModel \
    --output gpt2_fp32.onnx \
    -p fp32

3. DEMO测试

CUDA_VISIBLE_DEVICES=3 python demo.py \
    --onnx_model_path "./gpt2_fp32.onnx" \
    --model_name_or_path "path_to_checkpoint" \
    --prompt_text "here is an example of gpt2 model" \
    --do_sample_top_k 5

Fastseq 基于ONNXRUNTIME的文本生成加速框架

Related tags

Overview

Fastseq

1. 环境配置

2. ONNX转换

3. DEMO测试

4. TODO

Owner

Jun Gao

A python package to fine-tune transformer-based models for named entity recognition (NER).

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux

The proliferation of disinformation across social media has led the application of deep learning techniques to detect fake news.

【原神】自动演奏风物之诗琴的程序

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets

End-2-end speech synthesis with recurrent neural networks

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Code for EMNLP20 paper: "ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training"

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

Tools and data for measuring the popularity & growth of various programming languages.

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Exploring dimension-reduced embeddings

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Simple, Fast, Powerful and Easily extensible python package for extracting patterns from text, with over than 60 predefined Regular Expressions.

COVID-19 Related NLP Papers

Contract Understanding Atticus Dataset

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.